In this paper, we present a cluster-based algorithm for time series outlier mining.We use discrete Fourier transformation (DFT) to transform time series from time domain to frequency domain. Time series thus can be ma...In this paper, we present a cluster-based algorithm for time series outlier mining.We use discrete Fourier transformation (DFT) to transform time series from time domain to frequency domain. Time series thus can be mapped as the points in k -dimensional space.For these points, a cluster-based algorithm is developed to mine the outliers from these points.The algorithm first partitions the input points into disjoint clusters and then prunes the clusters,through judgment that can not contain outliers.Our algorithm has been run in the electrical load time series of one steel enterprise and proved to be effective.展开更多
Since data services are penetrating into our daily life rapidly, the mobile network becomes more complicated, and the amount of data transmission is more and more increasing. In this case, the traditional statistical ...Since data services are penetrating into our daily life rapidly, the mobile network becomes more complicated, and the amount of data transmission is more and more increasing. In this case, the traditional statistical methods for anomalous cell detection cannot adapt to the evolution of networks, and data mining becomes the mainstream. In this paper, we propose a novel kernel density-based local outlier factor(KLOF) to assign a degree of being an outlier to each object. Firstly, the notion of KLOF is introduced, which captures exactly the relative degree of isolation. Then, by analyzing its properties, including the tightness of upper and lower bounds, sensitivity of density perturbation, we find that KLOF is much greater than 1 for outliers. Lastly, KLOFis applied on a real-world dataset to detect anomalous cells with abnormal key performance indicators(KPIs) to verify its reliability. The experiment shows that KLOF can find outliers efficiently. It can be a guideline for the operators to perform faster and more efficient trouble shooting.展开更多
Assessing machine's performance through comparing the same or similar machines is important to implement intelligent maintenance for swarm machine.In this paper,an outlier mining based abnormal machine detection a...Assessing machine's performance through comparing the same or similar machines is important to implement intelligent maintenance for swarm machine.In this paper,an outlier mining based abnormal machine detection algorithm is proposed for this purpose.Firstly,the outlier mining based on clustering is introduced and the definition of cluster-based global outlier factor(CBGOF) is presented.Then the modified swarm intelligence clustering(MSIC) algorithm is suggested and the outlier mining algorithm based on MSIC is proposed.The algorithm can not only cluster machines according to their performance but also detect possible abnormal machines.Finally,a comparison of mobile soccer robots' performance proves the algorithm is feasible and effective.展开更多
Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer e...Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer effective. Considering the excellence of the principalcomponent estimation, we use it to substitute the least squares estimation, and then give the Cookdistance measurement based on principal component estimation, which can be used in outlier mining.At the same time, we have done some research on related theories and application problems.展开更多
Outlier detection is a very important type of data mining,which is extensively used in application areas.The traditional cell-based outlier detection algorithm not only takes a large amount of time in processing massi...Outlier detection is a very important type of data mining,which is extensively used in application areas.The traditional cell-based outlier detection algorithm not only takes a large amount of time in processing massive data,but also uses lots of machine resources,which results in the imbalance of the machine load.This paper presents an algorithm of the MapReduce-based and cell-based outlier detection,combined with the single-layer perceptron,which achieves the parallelization of outlier detection.These experiments show that this improved algorithm is able to effectively improve the efficiency of the outlier detection as well as the accuracy.展开更多
文摘In this paper, we present a cluster-based algorithm for time series outlier mining.We use discrete Fourier transformation (DFT) to transform time series from time domain to frequency domain. Time series thus can be mapped as the points in k -dimensional space.For these points, a cluster-based algorithm is developed to mine the outliers from these points.The algorithm first partitions the input points into disjoint clusters and then prunes the clusters,through judgment that can not contain outliers.Our algorithm has been run in the electrical load time series of one steel enterprise and proved to be effective.
基金supported by the National Basic Research Program of China (973 Program: 2013CB329004)
文摘Since data services are penetrating into our daily life rapidly, the mobile network becomes more complicated, and the amount of data transmission is more and more increasing. In this case, the traditional statistical methods for anomalous cell detection cannot adapt to the evolution of networks, and data mining becomes the mainstream. In this paper, we propose a novel kernel density-based local outlier factor(KLOF) to assign a degree of being an outlier to each object. Firstly, the notion of KLOF is introduced, which captures exactly the relative degree of isolation. Then, by analyzing its properties, including the tightness of upper and lower bounds, sensitivity of density perturbation, we find that KLOF is much greater than 1 for outliers. Lastly, KLOFis applied on a real-world dataset to detect anomalous cells with abnormal key performance indicators(KPIs) to verify its reliability. The experiment shows that KLOF can find outliers efficiently. It can be a guideline for the operators to perform faster and more efficient trouble shooting.
基金the National Natural Science Foundation of China (No. 50705054)
文摘Assessing machine's performance through comparing the same or similar machines is important to implement intelligent maintenance for swarm machine.In this paper,an outlier mining based abnormal machine detection algorithm is proposed for this purpose.Firstly,the outlier mining based on clustering is introduced and the definition of cluster-based global outlier factor(CBGOF) is presented.Then the modified swarm intelligence clustering(MSIC) algorithm is suggested and the outlier mining algorithm based on MSIC is proposed.The algorithm can not only cluster machines according to their performance but also detect possible abnormal machines.Finally,a comparison of mobile soccer robots' performance proves the algorithm is feasible and effective.
文摘Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer effective. Considering the excellence of the principalcomponent estimation, we use it to substitute the least squares estimation, and then give the Cookdistance measurement based on principal component estimation, which can be used in outlier mining.At the same time, we have done some research on related theories and application problems.
基金Supported by the National High Technology Research and Development Program of China(863 Program)(2012AA040910)
文摘Outlier detection is a very important type of data mining,which is extensively used in application areas.The traditional cell-based outlier detection algorithm not only takes a large amount of time in processing massive data,but also uses lots of machine resources,which results in the imbalance of the machine load.This paper presents an algorithm of the MapReduce-based and cell-based outlier detection,combined with the single-layer perceptron,which achieves the parallelization of outlier detection.These experiments show that this improved algorithm is able to effectively improve the efficiency of the outlier detection as well as the accuracy.