期刊文献+

稀疏分层概率自组织图实例迁移学习方法 被引量:3

Instance transfer learning model based on sparse hierarchical probabilistic self-organizing graphs
下载PDF
导出
摘要 针对基于实例的迁移学习在关联多源异构领域数据时遇到的数据颗粒度不匹配问题,以单领域分层概率自组织图(Hi PSOG)聚类方法为基础,提出一种具有迁移学习能力的稀疏化非监督分层概率自组织图(TSHi PSOG)方法。首先,在源领域和目标领域分别基于概率混合多变量高斯分布生成分层自组织模型以便在多领域中分别提取不同粒度的表示向量,并用稀疏图方法通过概率准则控制模型增长;其次,利用最大信息系数(MIC),在具有富信息的源领域中寻找与目标领域表示向量最相似的表示向量,并利用这些源领域表示向量的类别标签细化目标领域数据分类;最后,在国际通用分类数据集20新闻组数据集和垃圾邮件检测数据集上进行了实验,结果表明算法可以利用源领域的有用信息辅助目标领域的分类问题,并使分类准确率最高提高约15.26%和9.05%;对比其他经典迁移学习方法,通过稀疏分层可以挖掘不同颗粒度的表示向量,分类准确率最高提高约4.48%和4.13%。 The current study of instance-transfer learning suffers from the mismatch between the granularities of data from multi-source heterogeneous domains. A Transfer Sparse unsupervised Hierarchical Probabilistic Self-Organizing Graph( TSHi PSOG) method based on the framework of Hierarchical Probabilistic Self-Organizing Graph( Hi PSOG) method in the single domain was proposed. Firstly,representation vectors with different granularities were extracted from source and target domains by using hierarchical self-organizing model based on a probabilistic mixture of multivariate Gaussian component; and the sparse graph probabilistic criterion was used to control the growth of the model. Secondly,the most similar representation vector of the target domain data was searched in the rich-information source domain by using the Maximum Information Coefficient( MIC). Then,the data in the target domain was classified using labels of similar representation vectors in the source domain. Finally,the experimental results on the international universal 20 Newsgroups dataset and the spam detection dataset show that the proposed method improves the average classifying accuracy of target domain using the information from source domain by 15. 26% and 9. 05%. Moreover,the approach improves the average classifying accuracy with mining different granularity representation vectors by 4. 48% and 4. 13%.
出处 《计算机应用》 CSCD 北大核心 2016年第3期692-696,730,共6页 journal of Computer Applications
基金 国家自然科学基金资助项目(61305018) 国家社会科学基金资助项目(15CTQ030) 中国博士后科学基金第57批面上资助项目(2015M571183) 中国农业科学院科技创新工程项目~~
关键词 机器学习 迁移学习 非监督学习 分层算法 稀疏图方法 machine learning transfer learning unsupervised learning hierarchical method sparse graphical method
  • 相关文献

参考文献19

  • 1PAN S J, YANG Q. A survey on transfer learning [ J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10) : 1345 - 1359.
  • 2史荧中,王士同,蒋亦樟,刘培林.迁移学习支持向量回归机[J].计算机应用,2013,33(11):3084-3089. 被引量:5
  • 3YANG P, TAN Q, DING Y. Bayesian task-level transfer learning for non-linear regression [ C]//Proceedings of the 2008 International Conference on Computer Science and Software Engineering. Piscaraway, NJ: IEEE, 2008:62-65.
  • 4XIE S, FAN W, PENG J, et al. Latent space domain transfer between high dimensional overlapping distributions [ C]// Proceedings of the 18th International Conference on World Wide Web. New York: ACM, 2009:91 - 100.
  • 5DELGADO S, MORAN F, MORA A, et al. A novel representation of genomic sequences for taxonomic clustering and visualization by means of self-organizing maps [ J]. Bioinformatics, 2015, 31 (5): 736 - 744.
  • 6邵超,万春红.基于自组织映射的流形学习与可视化[J].计算机应用,2013,33(7):1917-1921. 被引量:2
  • 7CHENG S S, FU H C, WANG H M. Model-based clustering by probabilistic self-organizing maps [ J]. IEEE Transactions on Neural Networks, 2009, 20(5) : 805 - 826.
  • 8LOPEZ-RUBIO E, PALOMO E J. Growing hierarchical probabilistic self-organizing graphs [ J]. IEEE Transactions on Neural Networks, 2011, 22(7): 997-1008.
  • 9RESHEF D N, RESHEF Y A, FINUCANE H K, et al. Detecting novel associations in large data sets [ J]. Science, 2011, 334 (6062) : 1518 - 1524.
  • 10GOSAVI A. Simulation-based optimization: an overview [ M]// Simulation-Based Optimization. Berlin: Springer, 2015:29 - 35.

二级参考文献41

  • 1詹德川,周志华.基于集成的流形学习可视化[J].计算机研究与发展,2005,42(9):1533-1537. 被引量:24
  • 2杨剑,李伏欣,王珏.一种改进的局部切空间排列算法[J].软件学报,2005,16(9):1584-1590. 被引量:36
  • 3邵超,黄厚宽,赵连伟.一种更具拓扑稳定性的ISOMAP算法[J].软件学报,2007,18(4):869-877. 被引量:20
  • 4曾宪华,罗四维.动态增殖流形学习算法[J].计算机研究与发展,2007,44(9):1462-1468. 被引量:13
  • 5KOHONEN T. Self-organized formation of topologically correct fea- ture maps [J]. Biological Cybernetics, 1982, 43(1): 59-69.
  • 6THALAMUTHU A, MUKHOPADHYAY I, ZHENG X, et al. Eval- uation and comparison of gene clustering methods in microarray anal- ysis[J]. Bioinformatics, 2006, 22 (19) : 2405 -2412.
  • 7GHOUILA A, YAHIA S B, MALOUCHE D, et al. Application of Multi-SOM clustering approach to macmphage gene expression anal- ysis[J]. Infection, Genetics and Evolution, 2009, 9(3): 328- 336.
  • 8SIMILA T. Self-organizing map learning nonlinearly embedded man- ifoldsf J]. Information Visualization, 2005, 4(1) : 22 -31.
  • 9SEUNG H S, LEE D D. The manifold ways of perception[ J]. Sci- ence, 2000, 290(5500): 2268-2269.
  • 10TENENBAUM J B, de SILVA V, LANGFORD J C. A global geo- metric framework for nonlinear dimensionality reduction [ J]. Sci- ence, 2000, 290(5500): 2319-2323.

共引文献5

同被引文献23

引证文献3

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部