期刊文献+

一种基于极大熵的快速无监督线性降维方法 被引量:2

Fast Unsupervised Dimension Reduction Method Based on Maximum Entropy
下载PDF
导出
摘要 现实世界中高维数据无处不在,然而在高维数据中往往存在大量的冗余和噪声信息,这导致很多传统聚类算法在对高维数据聚类时不能获得很好的性能.实践中发现高维数据的类簇结构往往嵌入在较低维的子空间中.因而,降维成为挖掘高维数据类簇结构的关键技术.在众多降维方法中,基于图的降维方法是研究的热点.然而,大部分基于图的降维算法存在以下两个问题:(1)需要计算或者学习邻接图,计算复杂度高;(2)降维的过程中没有考虑降维后的用途.针对这两个问题,提出一种基于极大熵的快速无监督降维算法MEDR. MEDR算法融合线性投影和极大熵聚类模型,通过一种有效的迭代优化算法寻找高维数据嵌入在低维子空间的潜在最优类簇结构. MEDR算法不需事先输入邻接图,具有样本个数的线性时间复杂度.在真实数据集上的实验结果表明,与传统的降维方法相比, MEDR算法能够找到更好地将高维数据投影到低维子空间的投影矩阵,使投影后的数据有利于聚类. High-dimensional data is widely adopted in the real world.However,there is usually plenty of redundant and noisy information existing in high-dimensional data,which accounts for the poor performance of many traditional clustering algorithms when clustering high-dimensional data.In practice,it is found that the cluster structure of high-dimensional data is often embedded in the lower dimensional subspace.Therefore,dimension reduction becomes the key technology of mining high-dimensional data.Among many dimension reduction methods,graph-based method becomes a research hotspot.However,most graph-based dimension reduction algorithms suffer from the following two problems:(1)most of the graph-based dimension reduction algorithms need to calculate or learn adjacency graphs,which have high computational complexity;(2)the purpose of dimension reduction is not considered in the process of dimension reduction.To address the problem,a fast unsupervised dimension reduction algorithm is proposed based on the maximum entropy-MEDR,which combines linear projection and the maximum entropy clustering model to find the potential optimal cluster structure of high-dimensional data embedded in low-dimensional subspace through an effective iterative optimization algorithm.The MEDR algorithm does not need the adjacency graph as an input in advance,and has linear time complexity of input data scale.A large number of experimental results on real datasets show that the MEDR algorithm can find a better projection matrix to project high-dimensional data into low-dimensional subspace compared with the traditional dimensionality reduction method,so that the projected data is conducive to clustering analysis.
作者 王继奎 杨正国 刘学文 易纪海 李冰 聂飞平 WANG Ji-Kui;YANG Zheng-Guo;LIU Xue-Wen;YI Ji-Hai;LI Bing;NIE Fei-Ping(College of Information Engineering,Lanzhou University of Finance and Economics,Lanzhou 730020,China;Center for Optical Imagery Analysis and Learning(OPTIMAL),Northwestern Polytechnical University,Xi’an 710072,China)
出处 《软件学报》 EI CSCD 北大核心 2023年第4期1779-1795,共17页 Journal of Software
基金 国家自然科学基金(61772427,11801345) 甘肃省高等学校创新能力提升项目(2019B-97) 兰州财经大学校级重点项目(Lzufe2020B-0010,Lzufe2020B-011)。
关键词 无监督学习 线性降维 邻接图 聚类 极大熵 unsupervised learning dimension reduction adjacency graph clustering maximum entropy
  • 相关文献

参考文献5

二级参考文献62

  • 1张志华,郑南宁,史罡.Maximum-entropy clustering algorithm and its global convergence analysis[J].Science China(Technological Sciences),2001,44(1):89-101. 被引量:3
  • 2李兴斯.一类不可微优化问题的有效解法[J].中国科学(A辑),1994,24(4):371-377. 被引量:137
  • 3吕庆文,陈武凡.基于互信息量的图像分割[J].计算机学报,2006,29(2):296-301. 被引量:40
  • 4Rose K,Gurewtiz E,Fox G.A deterministic annealing approach to clustering. Pattern Recognition . 1990
  • 5Karayiannis N B.An axiomatic approach to soft learn ingvector quantization and clustering. IEEE ACM Transactions on Networking . 1999
  • 6Karayiannis N,B Meca.Maximum entropy clustering algorithm. Proc IEEE Conf Fuzzy Syst . 1994
  • 7Li R P,Mukaidon M.A maximum entropy approach to fuzzy clustering. Proceedings of the 4th IEEE International Conference onFuzzy System . 1995
  • 8Karayiannis N B.Fuzzy partition entropies and entropy constrained fuzzy clustering algorithms. J Intell Fuzzy Syst . 1997
  • 9Miyamoto S,Mukaidono M.Fuzzy c-means as aregularization and maximum entropy approach. Proceedings of the 7th International Fuzzy SystemsAssociation World Congress (IFSA’97), Prague: Chech . 1997
  • 10Miyamoto S,,Umayahara K.Two methods of fuzzy c-meansand classification functions. Proceedings of the 6thConference of the International Federation of ClassificationSocieties . 1998

共引文献68

同被引文献16

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部