使用迭代方法求解核主成分分析被引量：2

To Solve Kernel Principal Component Analysis Using Iterative Method

下载PDF

导出

摘要核主成分分析方法是使用核方法将经典的线性算法主成分分析推广到高维空间,用来处理复杂非线性数据的一种常用的特征提取算法,该算法首先在高维空间中计算所有样本之间的核矩阵,然后使用特征分解技术计算核矩阵的特征解,其计算的时间和空间复杂度分别为O(m2)和O(m3).然而在大规模数据集的情况下,由于储存和计算的问题无法进行正常的求解.文中提出首先使用幂迭代方法计算核矩阵的高阶特征解,然后重复使用Schur-Weilandt收缩方法分别计算出核矩阵的其它阶特征解.文中算法在计算过程中,不需要像传统的计算方法那样需要事先存储核矩阵,空间复杂度只有O(m).通过在模拟和真实数据的实验结果充分验证了算法的有效性. Kernel Principal Component Analysis （KPCA） is the generalized algorithm of famous Principal Component Analysis （ PCA）, which uses the kernel method and treats with the complex nonlinear dataset. It firstly computes the kernel matrix between mapped samples in high dimensional space, and uses eigen-decomposition technique to compute the eigen-solution for kernel matrix. The space and time complexity of the KPCA is O（ m2 ） and O（ m3 ） , respectively. When faced with large-scale data set, the method is infeasible for the sake of the storage and computational problem. In this paper, the Power iteration is introduced to compute the highest eigen-solution. Then the Schur-Weilandt deflation is repeatedly applied to achieve other higher order eigenvectors. In the process of computation, the kernel matrix needs not to compute and store in advance. The space complexity of the proposed method is only O （ m ）. The effectiveness of proposed method is validated from experimental results on toy and real dataset.

作者史卫亚郭跃飞

机构地区河南工业大学信息科学与工程学院粮食信息处理与控制教育部重点实验室复旦大学计算机科学技术学院

出处《小型微型计算机系统》 CSCD 北大核心 2013年第8期1882-1885,共4页 Journal of Chinese Computer Systems

基金河南省教育厅自然科学研究计划项目(2010B520005)资助河南工业大学博士基金项目(2009BS013)资助国家自然科学基金项目(60875003)资助河南省科技厅重点科技攻关项目(112102210190)资助郑州市科技发展计划项目(2010SFXM470)资助

关键词核主成分分析核矩阵大数据集特征分解幂迭代 KPCA kernel matrix large-scale data set eigen-decomposition power iteration

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献2

1吴枫,仲妍,吴泉源.基于增量核主成分分析的数据流在线分类框架[J].自动化学报,2010,36(4):534-542. 被引量：12
2史卫亚,郭跃飞,薛向阳.一种解决大规模数据集问题的核主成分分析算法[J].软件学报,2009,20(8):2153-2159. 被引量：19

二级参考文献13

1Martinez A M, Kak A C. PCA versus LDA. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(2): 228-233.
2Mika S, Scholkopf B, Smola A, Muller K R, Scholz M, Ratsch G. Kernel PCA and de-noising in feature spaces. In: Proceedings of the Conference on Advances in Neural Information Processing Systems. Denver, Colorado: MIT Press, 1999. 536-542.
3Weng J Y, Zhang Y L, Hwang W S. Candid covariancefree incremental principal componet analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(8): 1034-1040.
4Zhang Y L, Weng J Y. Convergence Analysis of Complementary Candid Incremental Principal Component Analysis, Technical Report MSU-CSE-01-23, Michigan State University, USA, 2001.
5Papadimitriou S, Sun J, Faloutsos C. Streaming pattern discovery in multiple time-series. In: Proceedings of the 31st International Conference on Very Large Data Bases. Trondheim, Norway: ACM, 2005. 697-708.
6Scholkopf B, Smola A, Muller K R. Nonliear component analysis as a kernel eigenvalue problem. Neural Computation, 1998, 10(5): 1299-1319.
7Hyvavinen A, Oja E. Independent component analysis: algorithms and applications. Neural Networks, 2000, 13(4-5): 411-430.
8Blake C L, Merz C J. UCI repository of machine learning databases [Online], available: http://www.ics.uci.edu/ -mlearn/MLRepository.html, November 7, 2007.
9Coenen F. LUCS-KDD DN software [Online], available: http://www.csc.liv.ac.uk/-frans/KDD/software/LUCS_K- DD_DN/, November 7, 2007.
10Keogh E, Xi X P, Li W, Ratanamahatana C. The UCR time series data mining archive [Online], available: http://www. cs.ucr.edu/-eamonn/TSDMA/index.html, November 7, 2007.

共引文献29

1卢桂馥,林忠,金忠.基于核化图嵌入的最佳鉴别分析与人脸识别[J].软件学报,2011,22(7):1561-1570. 被引量：27
2邵景峰,王进富,马晓红,党金房,任克俭.开清棉联合机计算机监测系统的软件设计[J].江南大学学报（自然科学版）,2011,10(3):272-277.
3唐科威,刘日升,杜慧,苏志勋.一种基于张量和洛仑兹几何的降维方法[J].自动化学报,2011,37(9):1151-1156. 被引量：5
4曲福恒,胡雅婷,马驷良,苑丽红,孙爽滋.基于核的模糊C均值聚类算法的收敛性定理[J].吉林大学学报（理学版）,2011,49(6):1079-1086. 被引量：3
5于红芸,姜涛,关键.SAR图像的快速核主成分分析识别方法[J].中国图象图形学报,2012,17(1):137-141. 被引量：4
6刘刚,李千目,刘凤玉,张宏.面向网络实时风险预测的马尔可夫时变模型[J].兵工学报,2012,33(2):163-169. 被引量：6
7陈丽敏,杨静,张健沛.一种基于加速迭代的大数据集谱聚类方法[J].计算机科学,2012,39(5):172-176. 被引量：7
8陈伟根,凌云,甘德刚,蔚超,岳彦峰.基于聚类-小波神经网络的油纸绝缘气隙放电发展阶段识别方法[J].电网技术,2012,36(7):126-132. 被引量：9
9史卫亚,郭跃飞.大规模数据集下谱聚类算法的求解[J].计算机科学,2012,39(B06):312-314.
10刘嵩,李时东,郑明辉.一种改进的基于奇异值扰动的单样本人脸识别方法[J].计算机工程与科学,2012,34(10):88-91. 被引量：1

同被引文献14

1Jolliffe I T. Principal component analysis [ M ]. 2nd ed. New York : Springer, 2002.
2Scholkopf B, Muller S A. Nonlinear component analysis as a kernel ei- genvalue problem [ J]. Neural Computation, 1998,10 (5) : 1299- 1319.
3Shyu M L, Chen S C, Sarinnapakom K, et al. A novel anomaly detec- tion scheme based on principal component classifier[ C ]//Proc of the 3rd IEEE International Conference on Data Mining. 2003:172-179.
4Hoffmann H. Kernel PCA for novelty detection[ J]. Pattern Recogni- tion, 2007,40 ( 3 ) : 863- 874.
5Sehalkopf B, Williamson R C, Smola A, et al. Support vector method for novelty detection [ C ]//Advances in Neural Information Processing Systems. 2000:582-588.
6Kwak N. Principal component analysis based on Ll-norm maximization [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2008,30 ( 9 ) : 1672-1680.
7Xiao Yingchao,Wang Huangang,Xu Wenli,et al. L1 norm based KP- CA for novelty detection [ J]. Pattern Recognition,2013,46 ( 1 ) : 389-396.
8Zheng Wenming, Zou Cairong, Zhao Li. An improved algorithm for kernel principal components analysis [J]. Neural Processing Let- ters,2005,22( 1 ) :49-56.
9Honeine P. Online kernel principal component analysis : a reduced-or- der model[ J]. IEEE Trans on Pattern Analysis and Machine In- telligence,2012,34 (9) : 1814-1826.
10史卫亚,郭跃飞,薛向阳.一种解决大规模数据集问题的核主成分分析算法[J].软件学报,2009,20(8):2153-2159. 被引量：19

引证文献2

1安磊磊,邢红杰.基于样本选取和加权KPCA-L1的异常检测[J].计算机应用研究,2016,33(5):1354-1358.
2孟宪强,南新元,曾庆凯.生物氧化预处理过程中不确定性数据处理[J].计算机工程与设计,2017,38(7):1977-1981.

1史卫亚,郭跃飞.大规模数据集下谱聚类算法的求解[J].计算机科学,2012,39(B06):312-314.
2赵丽红,孙宇舸,蔡玉,徐心和.基于核主成分分析的人脸识别[J].东北大学学报（自然科学版）,2006,27(8):847-850. 被引量：16
3薛宁静.基于测地距离的核主成分分析方法[J].微计算机信息,2010,26(31):123-124.
4王和勇,姚正安,李磊.基于聚类的核主成分分析在特征提取中的应用[J].计算机科学,2005,32(4):64-66. 被引量：20
5李路.大数据的特征解[J].农业网络信息,2014(4):64-65.
6王丽美,郑大军,郑程友.改进的基于支持向量机模型剪接位点的预测[J].宜宾学院学报,2014,14(12):93-98.
7陈其松,陈孝威,张欣,吴茂念.优化SVM在锅炉负荷预测中的应用[J].电子科技大学学报,2010,39(2):316-320. 被引量：2
8刘嵩.结合DCT与KPCA的人脸识别[J].计算机工程与应用,2012,48(27):186-188. 被引量：5
9毕晓君,李博,王珏.基于特征解选取的高维多目标可视化研究[J].哈尔滨工程大学学报,2013,34(9):1179-1187. 被引量：2
10郭雷,肖怀铁,付强.目标识别中特征空间核矩阵收缩方法[J].自然科学进展,2008,18(12):1467-1473.

小型微型计算机系统

2013年第8期

浏览历史

内容加载中请稍等...

使用迭代方法求解核主成分分析被引量：2

参考文献2

二级参考文献13

共引文献29

同被引文献14

引证文献2

相关作者

相关机构

相关主题

浏览历史

使用迭代方法求解核主成分分析 被引量：2

参考文献2

二级参考文献13

共引文献29

同被引文献14

引证文献2

相关作者

相关机构

相关主题

浏览历史

使用迭代方法求解核主成分分析被引量：2