基于自组织映射的流形学习与可视化被引量：2

Manifold learning and visualization based on self-organizing map

下载PDF

导出

摘要针对自组织映射(SOM)在学习和可视化高维数据内在的低维流形结构时容易产生"拓扑缺陷"的这一问题,提出了一种新的流形学习算法——动态自组织映射(DSOM)。该算法按照数据的邻域结构逐步扩展训练数据集合,对网络进行渐进训练,以避免局部极值,克服"拓扑缺陷"问题;同时,网络规模也随之动态扩展,以降低算法的时间复杂度。实验表明,该算法能更加真实地学习和可视化高维数据内在的低维流形结构;此外,与传统的流形学习算法相比,该算法对邻域大小和噪声也更加鲁棒。所提算法的网络规模和训练数据集合都将按照数据内在的邻域结构进行同步扩展,从而能更加简洁并真实地学习和可视化高维数据内在的低维流形结构。 Self-Organizing Map （SOM） tends to yield the topological defect problem when learning and visualizing the intrinsic low-dimensional manifold structure of high-dimensional data sets. To solve this problem, a manifold learning algorithm, Dynamic Self-Organizing MAP （DSOM）, was presented in this paper. In the DSOM, the training data set was expanded gradually according to its neighborhood structure, and thus the map was trained step by step, by which local minima could be avoided and the topological defect problem could be overcome. Meanwhile, the map size was increased dynamically, by which the time cost of the algorithm could be reduced greatly. The experimental results show that DSOM can learn and visualize the intrinsic low-dimensional manifold structure of high-dimensional data sets more faithfully than SOM. In addition, compared with traditional manifold learning algorithms, DSOM can obtain more concise visualization results and be less sensitive to the neighborhood size and the noise, which can also be verified by the experimental results. The innovation of this paper lies in that DSOM expands the map size and the training data set synchronously according to its intrinsic neighborhood structure, by which the intrinsic low-dimensional manifold structure of high-dimensional data sets can be learned and visualized more concisely and faithfully.

作者邵超万春红

机构地区河南财经政法大学计算机与信息工程学院

出处《计算机应用》 CSCD 北大核心 2013年第7期1917-1921,1934,共6页 journal of Computer Applications

基金国家自然科学基金资助项目(61202285) 河南省基础与前沿技术研究项目(112300410201) 河南省教育厅科学技术研究重点项目基础研究计划(13B520899)

关键词流形学习自组织映射拓扑缺陷局部欧氏性邻域结构 manifold learning Self-Organizing Map （SOM） topological defect locally Euclidean nature neighborhood structure

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献23

1KOHONEN T. Self-organized formation of topologically correct fea- ture maps [J]. Biological Cybernetics, 1982, 43(1): 59-69.
2THALAMUTHU A, MUKHOPADHYAY I, ZHENG X, et al. Eval- uation and comparison of gene clustering methods in microarray anal- ysis[J]. Bioinformatics, 2006, 22 (19) : 2405 -2412.
3GHOUILA A, YAHIA S B, MALOUCHE D, et al. Application of Multi-SOM clustering approach to macmphage gene expression anal- ysis[J]. Infection, Genetics and Evolution, 2009, 9(3): 328- 336.
4王丽敏,梁艳春,韩旭明,时小虎,李明.多获胜节点SOM及其在股票分析中的应用[J].计算机研究与发展,2008,45(9):1493-1500. 被引量：2
5SIMILA T. Self-organizing map learning nonlinearly embedded man- ifoldsf J]. Information Visualization, 2005, 4(1) : 22 -31.
6万春红,邵超.一种新的基于自组织映射的流形学习算法[J].北京交通大学学报,2009,33(6):101-105. 被引量：2
7SEUNG H S, LEE D D. The manifold ways of perception[ J]. Sci- ence, 2000, 290(5500): 2268-2269.
8TENENBAUM J B, de SILVA V, LANGFORD J C. A global geo- metric framework for nonlinear dimensionality reduction [ J]. Sci- ence, 2000, 290(5500): 2319-2323.
9杨剑,李伏欣,王珏.一种改进的局部切空间排列算法[J].软件学报,2005,16(9):1584-1590. 被引量：36
10王耀南,张莹,李春生.基于核矩阵的Isomap增量学习算法研究[J].计算机研究与发展,2009,46(9):1515-1522. 被引量：5

二级参考文献90

1詹德川,周志华.基于集成的流形学习可视化[J].计算机研究与发展,2005,42(9):1533-1537. 被引量：24
2杨剑,李伏欣,王珏.一种改进的局部切空间排列算法[J].软件学报,2005,16(9):1584-1590. 被引量：36
3邵超,黄厚宽,赵连伟.一种更具拓扑稳定性的ISOMAP算法[J].软件学报,2007,18(4):869-877. 被引量：20
4Tenenbaum J, SilvaV, Langford J. A global geometric framework for nonlinear dimensionality reduction [J]. Science, 2000, 290(5500): 2319-2323.
5Roweis S T, Saul L K. Nonlinear dimensionality reduction by locally linear embedding [J]. Science, 2000, 290(5500) : 2323-2326.
6Belkin M, Niyogi P. Laplacian eigenmaps for dimensionality reduction and data representation [J]. Neural Computation, 2003, 15(6): 1373-1396.
7Donoho D, Grimes C. Hessian eigenmaps.. Locally linear embedding techniques for high-dimensional data [J]. Proceedings of the National Academy of Sciences, 2005, 102 (21) : 7426-7431.
8Zhang Zhenyue, Zha Hongyuan. Principal manifolds and nonlinear dimensionality reduction via tangent space alignment [J]. SIAM Journal of Scientific Computing, 2004, 26(1): 313-338.
9Law M, Jain A. Incremental nonlinear dimensionality reduction by manifold learning [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2006, 28(3):377-391.
10Zhao D, Yang L. Incremental construction of neighborhood graphs for nonlinear dimensionality reduction [C] //Proc of the 18th Int Conf on Pattern Recognition, Vol 3. Los Alamitos, CA: IEEE Computer Society, 2006:177-180.

共引文献73

1唐皓,刘希玉.引力流形上的空间聚类[J].科协论坛（下半月）,2009(10):96-98.
2罗四维,赵连伟.基于谱图理论的流形学习算法[J].计算机研究与发展,2006,43(7):1173-1179. 被引量：76
3刘峰,刘希玉,刘弘.流形上的空间密度聚类算法研究[J].中国海洋大学学报（自然科学版）,2007,37(4):681-684. 被引量：1
4杨剑,王珏,钟宁.流形上的Laplacian半监督回归[J].计算机研究与发展,2007,44(7):1121-1127. 被引量：15
5黄启宏,刘钊.流形学习中非线性维数约简方法概述[J].计算机应用研究,2007,24(11):19-25. 被引量：24
6邵超,万春红,陈广宇.基于最小连通邻域图的ISOMAP算法[J].计算机应用,2007,27(10):2570-2574. 被引量：2
7赵继东,鲁珂,吴跃.一种基于谱图理论的Web图像搜索方法[J].计算机应用研究,2008,25(5):1598-1600. 被引量：2
8齐玮,李夕海,刘代志.基于Isomap的核爆地震模式识别[J].核电子学与探测技术,2008,28(2):434-439.
9魏莱,王守觉,徐菲菲.一种自适应邻域选择算法[J].模式识别与人工智能,2008,21(3):406-409. 被引量：3
10曾宪华,罗四维.局部保持的流形学习算法对比研究[J].计算机工程与应用,2008,44(29):1-7. 被引量：4

同被引文献36

1RAMOS S, VERMUNT J, DIAS J. When markets fall down: Are emerging markets all the same [ J]. International Journal of Finance and Economics, 2011, 16(4) : 324 - 338.
2KAKIZAWA Y, SHUMWAY R, TANIQUCH! M. Discrimination and clustering for multivariate time series[ J]. Journal of American Statistical Association, 1998, 93(441) : 328 - 340.
3LIAO T W. Clustering of time series data - a survey[ J]. Pattern Recogintion, 2005, 38( 11 ) : 1857 - 1874.
4REES J, KOEHLER G. Learning genetic algorithm parameters u- sing hidden Markov models[ J]. European Journal of Operational Research, 2006, 175(8) : 806 -820.
5KULLBACK S, LEIBLER R A. On information and sufficiency [ J]. Annuals of Mathematical Statistics, 1951, 22(1) : 79 - 86.
6de ANGELIS L, DIAS J G. Mining categorical sequences from data using a hybrid clustering method[ J]. European Journal of Opera- tional Research, 2014, 234(1) : 720 -730.
7DEMPSTER A P, LAIARD N M, RUBIN D B. Maximum likeli- hood from incomplete data via the EM algorithm[ J]. Journal of the Royal Statistical Society Series B-Methodological, 1977, 39(1) : 1 -38.
8PAN S J, YANG Q. A survey on transfer learning [ J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10) : 1345 - 1359.
9YANG P, TAN Q, DING Y. Bayesian task-level transfer learning for non-linear regression [ C]//Proceedings of the 2008 International Conference on Computer Science and Software Engineering. Piscaraway, NJ: IEEE, 2008:62-65.
10XIE S, FAN W, PENG J, et al. Latent space domain transfer between high dimensional overlapping distributions [ C]// Proceedings of the 18th International Conference on World Wide Web. New York: ACM, 2009:91 - 100.

引证文献2

1武健.时序Web数据挖掘方法[J].计算机应用,2014,34(A02):120-122. 被引量：1
2吴蕾,田儒雅,张学福.稀疏分层概率自组织图实例迁移学习方法[J].计算机应用,2016,36(3):692-696. 被引量：3

二级引证文献4

1王惠.迁移学习研究综述[J].电脑知识与技术（过刊）,2017,23(11X):203-205. 被引量：19
2顾佩月,刘峥,李云,李涛.基于时滞特征的时序依赖情节发现[J].计算机应用,2019,39(2):421-428. 被引量：1
3黄炜,童青云,李岳峰.广度学习研究进展:基于情报学的视角[J].情报理论与实践,2020,43(4):177-185. 被引量：2
4赵昊罡,崔红霞,张芳菲,顾海燕,穆潇莹.改进SegNet+CRF高分辨率遥感影像建筑物提取方法[J].计算机测量与控制,2023,31(7):177-183. 被引量：1

1万春红,邵超.一种新的基于自组织映射的流形学习算法[J].北京交通大学学报,2009,33(6):101-105. 被引量：2
2胡超,徐明华.CAD模型面的拓扑缺陷检测算法的设计[J].计算机工程与设计,2010,31(15):3521-3525. 被引量：2
3刘志勇.基于保距与保拓扑的流形学习算法[J].长江大学学报（自科版）（上旬）,2010,7(2):249-251.
4任继荣,戎树军,朱涛.A novel approach to topological defects in a vector order parameter system[J].Chinese Physics B,2009,18(7):2901-2904.
5蒋恒恒,李奇敏,汤宝平.基于数学形态学与拓扑规则的三角网格修补算法[J].机械工程学报,2013,49(1):148-155. 被引量：3
6齐维开,朱涛,陈勇,任继荣.Topological aspect of disclinations in two-dimensional crystals[J].Chinese Physics B,2009,18(3):1002-1008.
7刘飞荣,段隆振,陈梅香,杨艳玲.一种基于动态模糊Kohonen网络的聚类模型及应用[J].南昌大学学报（理科版）,2010,34(6):603-606. 被引量：5
8吴丽花,刘鲁,卫昆,吴菊华.基于动态自组织映射网的用户兴趣建模方法[J].计算机集成制造系统,2006,12(8):1183-1187. 被引量：7
9李华忠,覃国蓉,唐强平,李亮.基于μC/OS-Ⅱ的RTOS仿真教学平台研究[J].深圳信息职业技术学院学报,2010,8(4):5-9.
10常瑞,白杨森,孟庆涛.置信规则库专家系统学习优化问题的研究[J].华北水利水电大学学报（自然科学版）,2015,36(4):72-78. 被引量：2

计算机应用

2013年第7期

浏览历史

内容加载中请稍等...

基于自组织映射的流形学习与可视化被引量：2

参考文献23

二级参考文献90

共引文献73

同被引文献36

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于自组织映射的流形学习与可视化 被引量：2

参考文献23

二级参考文献90

共引文献73

同被引文献36

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于自组织映射的流形学习与可视化被引量：2