期刊文献+

使用迭代方法求解核主成分分析 被引量:2

To Solve Kernel Principal Component Analysis Using Iterative Method
下载PDF
导出
摘要 核主成分分析方法是使用核方法将经典的线性算法主成分分析推广到高维空间,用来处理复杂非线性数据的一种常用的特征提取算法,该算法首先在高维空间中计算所有样本之间的核矩阵,然后使用特征分解技术计算核矩阵的特征解,其计算的时间和空间复杂度分别为O(m2)和O(m3).然而在大规模数据集的情况下,由于储存和计算的问题无法进行正常的求解.文中提出首先使用幂迭代方法计算核矩阵的高阶特征解,然后重复使用Schur-Weilandt收缩方法分别计算出核矩阵的其它阶特征解.文中算法在计算过程中,不需要像传统的计算方法那样需要事先存储核矩阵,空间复杂度只有O(m).通过在模拟和真实数据的实验结果充分验证了算法的有效性. Kernel Principal Component Analysis (KPCA) is the generalized algorithm of famous Principal Component Analysis ( PCA), which uses the kernel method and treats with the complex nonlinear dataset. It firstly computes the kernel matrix between mapped samples in high dimensional space, and uses eigen-decomposition technique to compute the eigen-solution for kernel matrix. The space and time complexity of the KPCA is O( m2 ) and O( m3 ) , respectively. When faced with large-scale data set, the method is infeasible for the sake of the storage and computational problem. In this paper, the Power iteration is introduced to compute the highest eigen-solution. Then the Schur-Weilandt deflation is repeatedly applied to achieve other higher order eigenvectors. In the process of computation, the kernel matrix needs not to compute and store in advance. The space complexity of the proposed method is only O ( m ). The effectiveness of proposed method is validated from experimental results on toy and real dataset.
出处 《小型微型计算机系统》 CSCD 北大核心 2013年第8期1882-1885,共4页 Journal of Chinese Computer Systems
基金 河南省教育厅自然科学研究计划项目(2010B520005)资助 河南工业大学博士基金项目(2009BS013)资助 国家自然科学基金项目(60875003)资助 河南省科技厅重点科技攻关项目(112102210190)资助 郑州市科技发展计划项目(2010SFXM470)资助
关键词 核主成分分析 核矩阵 大数据集 特征分解 幂迭代 KPCA kernel matrix large-scale data set eigen-decomposition power iteration
  • 相关文献

参考文献2

二级参考文献13

  • 1Martinez A M, Kak A C. PCA versus LDA. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(2): 228-233.
  • 2Mika S, Scholkopf B, Smola A, Muller K R, Scholz M, Ratsch G. Kernel PCA and de-noising in feature spaces. In: Proceedings of the Conference on Advances in Neural Information Processing Systems. Denver, Colorado: MIT Press, 1999. 536-542.
  • 3Weng J Y, Zhang Y L, Hwang W S. Candid covariancefree incremental principal componet analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(8): 1034-1040.
  • 4Zhang Y L, Weng J Y. Convergence Analysis of Complementary Candid Incremental Principal Component Analysis, Technical Report MSU-CSE-01-23, Michigan State University, USA, 2001.
  • 5Papadimitriou S, Sun J, Faloutsos C. Streaming pattern discovery in multiple time-series. In: Proceedings of the 31st International Conference on Very Large Data Bases. Trondheim, Norway: ACM, 2005. 697-708.
  • 6Scholkopf B, Smola A, Muller K R. Nonliear component analysis as a kernel eigenvalue problem. Neural Computation, 1998, 10(5): 1299-1319.
  • 7Hyvavinen A, Oja E. Independent component analysis: algorithms and applications. Neural Networks, 2000, 13(4-5): 411-430.
  • 8Blake C L, Merz C J. UCI repository of machine learning databases [Online], available: http://www.ics.uci.edu/ -mlearn/MLRepository.html, November 7, 2007.
  • 9Coenen F. LUCS-KDD DN software [Online], available: http://www.csc.liv.ac.uk/-frans/KDD/software/LUCS_K- DD_DN/, November 7, 2007.
  • 10Keogh E, Xi X P, Li W, Ratanamahatana C. The UCR time series data mining archive [Online], available: http://www. cs.ucr.edu/-eamonn/TSDMA/index.html, November 7, 2007.

共引文献29

同被引文献14

  • 1Jolliffe I T. Principal component analysis [ M ]. 2nd ed. New York : Springer, 2002.
  • 2Scholkopf B, Muller S A. Nonlinear component analysis as a kernel ei- genvalue problem [ J]. Neural Computation, 1998,10 (5) : 1299- 1319.
  • 3Shyu M L, Chen S C, Sarinnapakom K, et al. A novel anomaly detec- tion scheme based on principal component classifier[ C ]//Proc of the 3rd IEEE International Conference on Data Mining. 2003:172-179.
  • 4Hoffmann H. Kernel PCA for novelty detection[ J]. Pattern Recogni- tion, 2007,40 ( 3 ) : 863- 874.
  • 5Sehalkopf B, Williamson R C, Smola A, et al. Support vector method for novelty detection [ C ]//Advances in Neural Information Processing Systems. 2000:582-588.
  • 6Kwak N. Principal component analysis based on Ll-norm maximization [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2008,30 ( 9 ) : 1672-1680.
  • 7Xiao Yingchao,Wang Huangang,Xu Wenli,et al. L1 norm based KP- CA for novelty detection [ J]. Pattern Recognition,2013,46 ( 1 ) : 389-396.
  • 8Zheng Wenming, Zou Cairong, Zhao Li. An improved algorithm for kernel principal components analysis [J]. Neural Processing Let- ters,2005,22( 1 ) :49-56.
  • 9Honeine P. Online kernel principal component analysis : a reduced-or- der model[ J]. IEEE Trans on Pattern Analysis and Machine In- telligence,2012,34 (9) : 1814-1826.
  • 10史卫亚,郭跃飞,薛向阳.一种解决大规模数据集问题的核主成分分析算法[J].软件学报,2009,20(8):2153-2159. 被引量:19

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部