期刊文献+

基于Shannon熵的因子特征提取算法 被引量:9

Factor Analysis Feature Extraction Algorithm Based on Shannon Entropy
原文传递
导出
摘要 现存的数据提取算法,大都以方差贡献率作为评价准则,来衡量特征提取的效果.然而方差贡献率注重的是样本相关矩阵特征值的性质,并不能顾及到信息的度量问题.文中将Shannon信息熵理论引入提取算法,定义类概率、类信息函数,通过计算累计信息贡献率来确定提取特征维数,提取效果可以从信息论的角度评价.将此理论与因子分析(FA)结合,建立基于信息熵的FA特征提取算法,利用信息贡献率确定主因子提取的个数.通过实例分析,验证理论的有效性. The performance assessments of existing data extraction algorithms mostly use variance contribution rate calculated by eigenvalues of raw data to measure the effect of feature extraction. However, variance contribution rate emphasizes the characteristic of eigenvalues of correlation matrix of the sample and it can not take information measuring into account. The extraction effect can be assessed from the angle of information theory by introducing Shannon information entropy into extraction algorithm, defining class probability and class information function and determining feature dimensions by calculating total information contribution rate. The theory are combined with factor analysis (FA) and FA feature extraction algorithm of information function is established. The extracting number of main factors is determined by information contribution rate. Finally, the efficiency of the theory is tested by cases.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2011年第3期327-331,共5页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.60975039 41074003) 国家973重点基础研究发展计划(No.2007CB311004) 江苏省基础研究计划(自然科学基金)项目(No.BK2009093)资助
关键词 信息函数 Shannon熵 特征提取 方差贡献率 Information Function, Shannon Entropy, Feature Extraction, Variance Contribution Rate
  • 相关文献

参考文献18

  • 1毛勇,周晓波,夏铮,尹征,孙优贤.特征选择算法研究综述[J].模式识别与人工智能,2007,20(2):211-218. 被引量:95
  • 2Ding Shifei, Jia Weikuan, Su Chunyang, et al. A Survey on Statistical Pattern Feature Extraction//Proc of the 4th International Con- ference on Intelligent Computing. Shanghai, China, 2008:701 - 708.
  • 3Foman G. An Extensive Empirical Study of Feature Selection Metrics for Text Classification. Journal of Machine Learning Research, 2003, 3 : 1289 - 1305.
  • 4宋枫溪,高秀梅,刘树海,杨静宇.统计模式识别中的维数削减与低损降维[J].计算机学报,2005,28(11):1915-1922. 被引量:44
  • 5Ding Shifei, Jia Weikuan, Su Chunyang, et al. Research of Pattern Feature Extraction and Selection // Proc of the 7th International Conference on Machine Learning and Cybernetics. Kunming, Chi- na, 2008, I : 466 -471.
  • 6Kwon O W, Lee T W. Phoneme Recognition Using ICA-Based Fea- ture Extraction and Transformation. Signal Processing, 2004, 84 (6) : 1005 -1019.
  • 7杨进,文玉梅,李平.基于相关分析和近似熵的管道泄漏声信号特征提取及辨识方法[J].仪器仪表学报,2009,30(2):272-279. 被引量:51
  • 8Du Wei, Piater J. Tracking by Cluster Analysis of Feature Points U- sing a Mixture Particle Filter//Proc of the IEEE International Con- ference on Advanced Video and Signal Based Surveillance. London, UK, 2005:165-170.
  • 9Xu Yong, Zhang D, Song Fengxi, et al. A Method for Speeding up Feature Extraction Based on KPCA. Neurocomputing, 2007, 70 (4/ 5/6) : 1056 - 1061.
  • 10Bai Ling, Xu Anbang, Guo Ping, et al. Kernel ICA Feature Extraction for Spectral Recognition of Celestial Objects. IEEE Interna- tional Conference on Systems, Man and Cybernetics. Taipei, Chi- na, 2007 : 3922 - 3926.

二级参考文献144

  • 1宣国荣,柴佩琪.基于Chernoff上界的特征选择[J].模式识别与人工智能,1996,9(1):26-30. 被引量:2
  • 2刘伟权,王明会,钟义信.利用遗传算法实现手写体数字识别中特征维数的压缩[J].模式识别与人工智能,1996,9(1):45-51. 被引量:4
  • 3宣国荣,柴佩琪.基于巴氏距离的特征选择[J].模式识别与人工智能,1996,9(4):324-329. 被引量:16
  • 4姜旦.信息论[M].合肥:中国科技大学出版社,1987.14-96.
  • 5KU CHAO-CHEE, LEE, KWANG Y. Diagonal recurrent neural networks for dynamic systems control[ J]. IEEE Trans. Neural Networks, 1995,6( 1 ) : 144-156.
  • 6CHERN MING-JYH, WANG CHIN-CHENG, MA CHEN- HSUAN. Performance test and flow visualization of ball valve[J]. Experimental Thermal and Fluid Science, 2007, 31 (6) :502-512.
  • 7SCHRAMA C, HIRSCHBERG A. Application of vortex sound theory to vortex-pairing noise:sensitivity to errors in flow data[J]. Journal of Sound and Vibration, 2003, 266 : 1079-1098.
  • 8FUCHS H V, RIEHLE R. Ten years of experience with leak detection by acoustic signal analysis[ J]. Applied Acoustics, 1991,33 : 1-9.
  • 9MAKAR J M, KLEINER Y. Maintaining water pipeline integrity[ C]. AWWA Infrastructure Conference and Exhibition, Baltimore, Maryland, 2000:12-15.
  • 10MAKAR J M, CHAGNON N. Inspecting systems for leaks, pits, and corrosion [J].American Water Works Association, 1999,91 (7) :36-46.

共引文献240

同被引文献88

  • 1张沧生,崔丽娟,杨刚,倪志宏.集成学习算法的比较研究[J].河北大学学报(自然科学版),2007,27(5):551-554. 被引量:6
  • 2丁世飞,靳奉祥,史忠植.基于PLS的信息特征压缩算法[J].计算机辅助设计与图形学学报,2005,17(2):368-371. 被引量:7
  • 3王桂莲,盖立平,柴英,潘志达.心电图导联及其电学问题的分析[J].数理医药学杂志,2007,20(3):390-391. 被引量:4
  • 4段向阳 王永生 苏永生.基于奇异值分解的信号特征提取方法研究.振动与冲击,2009,28(11):30-33.
  • 5Merler S, Caprile B, Furlanello C. Parallelizing AdaBoost by weights dynamics [J]. Computational Statistics & Data Analysis, 2007, 51 (5): 2487-2498.
  • 6Zhang CX, Zhang JS. A local boosting algorithm for solving classification problems[J]. Computational Statistics & Data Analysis, 2008, 52 (4): 1928-1941.
  • 7Nishikawa T, Abe S. Maximizing margins of multilayer neural networks [C]//Singapore: Proceedings of the 9th International Conference on Neural Information Processing, 2002.
  • 8Chang Chihchung, Lin Chihjen. LIBSVM: A library for support vector machines [J]. ACM Transactions on Intelligent Systems and Technology, 2011, 2 (3): 27. Software available at http: //www. csie. ntu. edu. tw/-cjlin/libsvm.
  • 9Fan RE, Chang KW, Hsieh CJ, et al. LIBLINEAR: A library for large linear classification [J]. Journal of Machine Learning Research, 2008 (9): 1871-1874.
  • 10Weston J, Schoolkopf B, Eskin E, et al. Dealing with large diagonals in kernel matrices EG]. Lecture Notes in Computer Science 243: Principles of Data Mining and Knowledge Disco- very, Springer, 2002.

引证文献9

二级引证文献47

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部