一种半监督重复软最大化模型

A Semi-supervised Replicated Softmax Model

下载PDF

导出

摘要概率主题模型由于其高效的数据降维和文档主题特征挖掘能力被广泛应用于各种文档分析任务中,然而概率主题模型主要基于有向图模型构建,使得模型的表示能力受到极大限制。为此,研究分布式主题特征表示和基于无向图模型玻尔兹曼机的重复软最大化模型(RSM),提出一种半监督的RSM(SSRSM)。将SSRSM、RSM模型提取的主题特征应用于多标记判别任务中,实验结果表明,相比LDA和RSM模型,SSRSM模型具有更好的多标记判别能力。 Recently probabilistic topic models are widely used because of high performance of dimension reduction and topic features mining. However, topic models are built based on directed graph model which limits the performance of data representation. This paper based on the studies on distributed feature representation and Replicated Softmax Model （RSM） which is based on the Restricted Bolzmann Machine （RBM） proposes a Semi Supervised Replicated Softmax Model（SSRSM）. Experimental results show that the SSRSM outperforms LDA and RSM in task of topics extraction. In addition,by using the features learned by SSRSM and RSM in task of multi-label classification,it is shown that SSRSM has a better performance of multi-label learning than RSM.

作者邢国正江雨燕吴超李常训

机构地区安徽工业大学管理科学与工程学院

出处《计算机工程》 CAS CSCD 北大核心 2015年第9期209-214,共6页 Computer Engineering

基金国家自然科学基金资助项目(71172219) 国家科技型中小企业创新基金资助项目(11C26213402013)

关键词主题模型无向图模型重复软最大化模型半监督模型特征学习 topic model undirected graph model Replicated Softmax Model（RSM） semi-supervised model featurelearning

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献22

1Blei D M, Ng A Y, Jordan M I. Latent Dirichlet AllocationE J ]. Journal of Machine Learning Research, 2003,3 ( 3 ) :993-1022.
2Wei Xing,Croft W B. LDA-based Document Models for Ad-hoc Retrieval [ C ]//Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA : ACM Press ,2006 : 178-185.
3Teh Y W, Newman D, Welling M. A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation I M 1. Cambridge, USA: MIT Press, 2006.
4Elman J L. Distributed Representations, Simple Recurrent Networks, and Grammatical Structure I J ]. Machine Learning, 1991,7 ( 2/3 ) : 195-225.
5Ackley D H, Hinton G E, Sejnowski T J. A Learning Algorithm for Boltzmann Machines I Jl. Cognitive Science, 1985,9 ( 1 ) : 147-169.
6Tieleman T. Training Restricted Boltzmann Machines Using Approximations to the Likelihood Gradient: C:// Proceedings of the 25th International Conference on Machine Learning. New York, USA : ACM Press, 2008 : 1064-1071.
7Freund Y, Haussler D. Unsupervised Learning of Distri- butions on Binary Vectors Using Two Layer Networks [ J ]. Neural Computation,2002,14(8) :1711-1800.
8Hinton G E. Products of Experts I C ]//Proceedings of the 9th International Conference on Artificial Neural Networks. Washington D. C. ,USA: IEEE Press, 1999 : 1-6.
9Younes L. On the Convergence of Markovian Stochastic Algorithms with Rapidly Decreasing Ergodicity Rates[ J]. International Journal of Probability and Stochastic Processes, 1999,65 (3/4) : 177-228.
10Boureau Y,Cun Y L. Sparse Feature Learning for Deep Belief Networks [ D ]. New York, USA: New York University, 2007.

二级参考文献32

1Steyvers M, Griffiths T. Probabilistic topic models. Handbook of Latent Semantic Analysis, 2007,427(7) :424-440.
2Blei D M, Ng A Y, Jordan M I. Latent dirichlet allocation. The Journal of Machine Learning Research, 2003,3 : 993 - 1022.
3Mimno D, McCallum A. Topic models conditioned on arbitrary features with dirichlet- multinomial regression. Proceedings of the 24th Annual Conference on Uncertainty in ArtificialIn- telligence, Helsinki, Finland, 2008.
4Kim H, Sun Y, Hockenmaier J, et al. ETM: Entity topic models for mining documents associated with entities. 2012 IEEE 12tu International Conference on Data Mining. IEEE, 2012:349-358.
5Blei D M, McAuliffe J D. Supervised topic models. Advances in Neural Information Processing Systems (NIPS), 2007.
6Ramage D, Hall D, Nallapati R, et aZ. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. Proceedings of the 2009.Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1. Association for Computational Linguistics, 2009 : 248 - 256.
7Ramage D, Manning C D, Dumais S. Partially labeled topic models for interpretable text mining. Proceedings of the 17'h ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM,2011:457-465.
8Hofmann T. Probabilistic latent semantic analysis. Proceedings of the 15^th conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc. , 1999 : 289-296.
9Palmer J, Wipf D, Kreutz-Delgado K, et al. Variational EM lgorithms for non Gaussian latent variable models. Advances in Neural Information Processing Systems, 2006,18 : 1059.
10Minka T, Lafferty J. Expectation propagation for the generative aspect model. Proceedings of the 18^th Conference on Uncertainty in Artificial Intel- ligence. Morgan Kaufmann Publishers Inc. , 2002:352-359.

共引文献26

1吕静,何志芬.一种基于正则化最小二乘的多标记分类算法[J].南京大学学报（自然科学版）,2015,51(1):139-147. 被引量：3
2余文利,余建军,方建文.混合属性数据k-prototypes聚类算法[J].计算机系统应用,2015,24(6):168-172. 被引量：3
3石林宾,余正涛,严馨,宋海霞,洪旭东.基于半监督图聚类的项目主题模型构建方法[J].计算机科学,2015,42(5):119-123. 被引量：1
4田刚,何克清,王健,孙承爱,徐建建.面向领域标签辅助的服务聚类方法[J].电子学报,2015,43(7):1266-1274. 被引量：30
5李卫平,杨杰,王钢.多变参pLSI文本敏感特征抽取算法[J].计算机应用研究,2015,32(9):2587-2589. 被引量：2
6欧阳继红,刘燕辉,李熙铭,周晓堂.基于LDA的多粒度主题情感混合模型[J].电子学报,2015,43(9):1875-1880. 被引量：23
7张膂.基于LPAL模型的超文本分析[J].微型电脑应用,2016,32(3):77-80. 被引量：1
8李云毅,苗夺谦,卫志华.基于特征融合与多元关系一致性的社会标签精化模型[J].南京大学学报（自然科学版）,2016,52(2):244-252. 被引量：1
9李博,陈志刚,黄瑞,郑祥云.基于LDA模型的音乐推荐算法[J].计算机工程,2016,42(6):175-179. 被引量：15
10马宁,陶亮.基于多特征融合的室内场景识别[J].控制工程,2016,23(11):1845-1850. 被引量：7

1张素智,张琳,曲旭凯.基于最短路径的加权属性图聚类算法研究[J].计算机应用与软件,2016,33(11):212-214. 被引量：7
2朱聪慧,赵铁军,郑德权.基于无向图序列标注模型的中文分词词性标注一体化系统[J].电子与信息学报,2010,32(3):700-704. 被引量：12
3付勋,宋俊德.基于有监督Topic Model的图像分类[J].软件,2013,34(12):253-255. 被引量：1
4史殿习,李勇谋,丁博.无监督特征学习的人体活动识别[J].国防科技大学学报,2015,37(5):128-134. 被引量：4
5王剑,张伟华,李跃新.结合图模型的优化多类SVM及智能交通应用[J].电子技术应用,2017,43(2):132-136. 被引量：1
6易文斌,冒亚明,慎利.利用概率主题模型的遥感影像半监督分类[J].计算机工程与应用,2013,49(10):1-4. 被引量：2
7杨陟卓,黄河燕.基于语言模型的有监督词义消歧模型优化研究[J].中文信息学报,2014,28(1):19-25. 被引量：8
8唐钊.条件随机场模型在中文人名识别中的研究与实现[J].现代计算机,2012,18(14):3-7. 被引量：7
9郝国舜,刘静华,李士才.基于对象的P&ID设计软件无向图模型[J].北京航空航天大学学报,2004,30(2):122-126. 被引量：1
10王博,黄九鸣,贾焰,杨树强.适用于多种监督模型的特征选择方法研究[J].计算机研究与发展,2010,47(9):1548-1557. 被引量：6

计算机工程

2015年第9期

浏览历史

内容加载中请稍等...

一种半监督重复软最大化模型

参考文献22

二级参考文献32

共引文献26

相关作者

相关机构

相关主题

浏览历史