期刊文献+

统计监控建模离群点检测数据预处理高效算法 被引量:5

Outlier detection data high performance preprocessing algorithm of statistical monitoring modeling
下载PDF
导出
摘要 基于多向主元分析(multi-way principal component analysis,MPCA)(包括主元分析(principal component analysis,PCA))的统计监控模型易受建模数据中离群点影响,将数据点的k-最近邻(k-nearest neighbor,k-NN)距离dk作为离群度指标能有效地发现非线性数据集中的离群点,但现有的基于该定义的鲁棒离群点检测算法对不同尺度的中心化和标准化方法非常敏感,且需要计算每个数据点的dk,引起巨大的计算开销。提出一种改进尺度的近邻修剪(modified scale neighborhood pruning,MSNHP)高效鲁棒离群点检测算法用于对统计监控建模数据集的预处理。该算法利用改进尺度得到离线建模正常数据的均值和标准差,并对数据进行中心化和标准化处理;在每次dk查询过程中计算出其他点的dk上界用于直接修剪非离群点,以减少dk查询的次数;并通过优化搜索次序提高修剪效果和减少每次dk查询的计算开销。将该算法应用于β-甘露聚糖酶发酵间歇过程离群点检测,与其他鲁棒离群点检测算法相比,应用结果表明该算法明显减少了计算开销,对数据集数据个数和算法参数都具有更好的伸缩性。 The statistical monitoring model based on multi-way principal component analysis including principal component analysis (MPCA)is strongly affected by outlying observation data, taking the k-nearest neighbor distance ( d^k ) of data point as the outlying measure can effectively detect outliers in nonlinear data set. However existing robust outlier detection algorithms based on this definition are very sensitive to centralization and standardization approaches with various scales ,and it is necessary to calculate the d^k of each data point ,which brings huge computational cost. A high performance robust outlier detection algorithm named modified scale neighborhood pruning (MSNHP) is proposed for preprocessing the statistical monitoring modeling data set. The MSNHP algorithm utilizes the modified scale to obtain the mean and standard deviation of the off-line modeling normal data, and carries out centralization and standardization of the modeling data using the mean and standard deviation. MSNHP algorithm calculates the upper bounds of d^k for other data points in each d^k query process ,which are used for pruning the non-outliers and reducing the number of d^k queries. The searching order is optimized to increase the pruning effects and reduce the computational cost of each d^k query. The proposed algorithm was applied to the outlier detection in β -mannanase fermentation batch process. Compared with other robust outlier detection algorithms, the application results show that the proposed algorithm can obviously reduce the computational cost and has better stability when the data number in data set and algorithm parameters change.
出处 《仪器仪表学报》 EI CAS CSCD 北大核心 2012年第12期2742-2746,共5页 Chinese Journal of Scientific Instrument
基金 国家自然科学基金(61174123) 广东省自然科学基金(9151063101000043)资助项目
关键词 改进尺度的近邻修剪 高效鲁棒离群点检测 统计监控建模 数据预处理 modified sale nighborhood pruning (MSNHP) high performance robust outlier detection algorithm statistical monitoring modeling data preprocessing
  • 相关文献

参考文献15

  • 1许洁,胡寿松.基于KPCA和MKL-SVM的非线性过程监控与故障诊断[J].仪器仪表学报,2010,31(11):2428-2433. 被引量:30
  • 2洪涛,李辉,邱畅啸,黄志奇.基于PCA相似系数与SVM的涡轮泵故障检测算法[J].电子测量与仪器学报,2012,26(6):514-520. 被引量:17
  • 3肖应旺.基于WTPCA-MSVMs过程监控方法[J].仪器仪表学报,2010,31(3):558-564. 被引量:9
  • 4PEARSON P K. Exploring process data [ J]. Journal of Process Control,2001,11 : 179-194.
  • 5HOO K A,TVARLAPATI K J, PIOVOSO M J, et al. A method of robust multivariate outlier replacement [ J ]. Computers and Chemical Engineering,2005,26: 17-39.
  • 6EGAN W J MORGAN S L Outlier detection in multivari- ate analytical chemical data [ J ]. Analytical Chemistry, 2006,70( 11 ) : 2372-2379.
  • 7CHIANG L H, PELL R J, SEASHOLTZ M B. Exploring process data with the use of robust outlier detection algo- rithms [ J ]. Journal of Process Control, 2003, 13: 437 -449.
  • 8CHEN H ,JIANG G,YOSHIHIRA K. Robust nonlinear di- mensionality reduction for manifold learning [ J ]. Pattern Recognition ,2006,2 : 447-450.
  • 9RAMASWAMY S, RASTOGI R, SHIM K. Efficient algo- rithms for mining outliers from large data sets [ J ] ACMSIGMOD Record,2000,29(2) : 438-443.
  • 10BAY S D, SCHWABACHER M. Mining distance-based outliers in near linear time with randomization and a sim- ple pruning rule[ C]. Proceedings of the 9th ACM SIGK- DD International Conference on Knowledge Discovery and Data Mining. Washington, D. C. , USA,2003 : 29-38.

二级参考文献44

  • 1谢光军,胡海峰,秦国军,胡茑庆,温熙森.液体火箭发动机涡轮泵健康监控系统[J].国防科技大学学报,2005,27(3):40-44. 被引量:9
  • 2赵立杰,柴天佑,袁德成.SBR过程自适应动态非线性MPCA建模及在线监视[J].系统仿真学报,2005,17(9):2060-2064. 被引量:4
  • 3张勇,金学波.基于图像处理和小波去噪的化工信号分析[J].化工自动化及仪表,2007,34(1):69-72. 被引量:4
  • 4刘爱伦,袁小艳,俞金寿.基于KPCA-SVC的复杂过程故障诊断[J].仪器仪表学报,2007,28(5):870-874. 被引量:16
  • 5郑建国,石智,权豫西.非平稳信号的小波包阈值去噪方法[J].信息技术,2007,31(3):16-18. 被引量:10
  • 6SIRONI S, CAPELLI L, C'ENTOLA P, et al. Development of a system for the continuous monitoring of odours from a composting plant: Focus on training, data processing and results validation methods[ J]. Sensors and Actuators B, 2007,1 : 1-11.
  • 7CHING P C, SO H C, WU S Q. On wavelet denoising and its applications to time delay estimation [ J ]. IEEE Trans. on Signal Processing, 1999,47 (10) :2879-2882.
  • 8LISBOA P J, TAKTAK A F G. The use of artificial neural networks in decision support in cancer: A systematic review[ J]. Neural Networks, 2006,19:408-415.
  • 9ZHANG J, MARTIN E B, MORRIS A J. Process monitoring using non-linear statistical techniques[ J]. Chemical Engineering Journal, 1997,67 ( 3 ) : 181-189.
  • 10WANG G Q, SUN Y AN, DING Q ZH, et al. Estimation of source spectra profiles and simultaneous determination of polycomponent in mixtures from ultraviolet spectra data using kernel independent component analysis and support vector regression [ J ]. Analytica Chimica Acta, 2007, 594 : 101-106.

共引文献52

同被引文献57

  • 1Nomikos P and MacGregor J F. Monitoring batch processes using multi-way principal component analysis. AIChE J, 1994, 40: 1361-1375.
  • 2Nomikos P and MacGregor J F. Multivariate SPC charts for batch processes. Technometrics, 1995,37(1):41-59.
  • 3Nomikos P and MacGregor J F. Multi-way partial least squares in monitoring batch processes. Chemo-metrics and Intelligent Laboratory Systems, 1995,30:97-108.
  • 4Undey C and Cinar A. Statistical monitoring of multistage, multi-phase batch processes. IEEE Control Syst Mag, 2002, 22:40-52.
  • 5Camacho J, Pic J. Multi-phase principal component analysis for batch processes modeling. Chemometrics and Intelligent Laboratory Systems, 2006, 81(2):127-136.
  • 6Dong Weiwei, Yao Yuan, Gao Furlong. Phase analysis and identification method for multi-phase batch processes with partitioning Multi-way Principal Component Analysis (MPCA) model. Chinese Journal of Chemical Engineering, 2012, 20(6): 1l2I-1l27.
  • 7Camacho J, Pic J, Ferrer A. The best approaches in the on-line monitoring of batch processes based on PCA: dose the modeling structurem atter. Analytica Chimica Acta, 2009, 642(1-2):59-68.
  • 8罗俊伟.基于FCM的类合行聚类算法研究[D].重庆:重庆大学,2009.
  • 9OZDEMIR S, XIAO Y. FTDA: Outlier detection-based fault- tolerant data aggregation for wireless sensor net- works [ J ]. Security and Communication Networks, 2013, 6(6) : 702-710.
  • 10BRANCH J W, GIANNELLA C, SZYMANSKI B, et al. In-network outlier detection in wireless sensor networks [ J]. Knowledge and information systems, 2013, 34( 1 ) : 23 -54.

引证文献5

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部