期刊文献+

基于结构化事件的主题表示

Topic representation based on structured events
下载PDF
导出
摘要 基于词袋的主题模型其最终主题表示存在着表意不清、可读性差等问题,为解决此问题,提出将事件作为文档和主题描述的基本元素进行主题建模。鉴于事件的稀疏性,采用基于Biterm的主题模型,并在主题推断时结合generalized Pólya urn(GPU)模型加入事件间关联性的先验知识进行指导监督,从共现和语义两个层面削弱了事件稀疏性对主题生成的负作用。实验结果表明,该算法得到的主题可解释性较好且聚类效果提升明显。 Topic models based on bag-of-words got topics represented by individual words which were difficult to understand.This paper thus proposed using events as documents' terms,meanwhile,considered Biterm topic model combined with GPU model which incorporated event correlation knowledge for inference,alleviated the adverse effect of event sparsity from co-occurrence and semantic two aspects. Experimental results show that the proposed method guarantees both interpretable and coherent topics.
出处 《计算机应用研究》 CSCD 北大核心 2017年第3期671-674,共4页 Application Research of Computers
基金 国家自然科学基金面上资助项目(61373108) 国家社科重大招标计划资助项目(11&ZD189)
关键词 主题模型 主题表示 事件语义 GPU模型 topic model topic representation event semantic GPU model
  • 相关文献

参考文献1

二级参考文献14

  • 1Blei D,Ng A,Jordan M.Latent Dirichlet allocation[J].Journal of Machine Learning Research,2003:3,993-1022.
  • 2Griffiths T L,Steyvers M.A Probabilistic Approach to Semantic Representation[C]∥ Proceedings of the 24th Annual Conference of the Cognitive Science Society,2002:381-386.
  • 3Griffiths T L,Steyvers M.Prediction and Semantic Association[C]∥ Advances in Neural Information Processing Systems,2003,15:11-18.
  • 4Griffiths T L,Steyvers M.Finding Scientific Topics[C]∥ Proceedings of the National Academy of Science,2004:5228-5235.
  • 5Hofmann T.Probabilistic Latent Semantic Analysis[C]∥ Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence.1999:289-296.
  • 6Deerwester S,Dumais S,Furnas G,et al.Indexing by Latent Semantic Analysis[J].Journal of the American Society for Information Science,1990,41:391-407.
  • 7Hofmann T.Unsupervised Learning by Probabilistic Latent Semantic Analysis[J].Machine Learning Journal,2001,42(1):177-196.
  • 8Blei D,Lafferty J.Correlated Topic Models[C]∥ Advances in Neural Information Processing Systems,2006,18:147-154.
  • 9Blei D,Griffiths T,Jordan M,et al.Hierarchical Topic Models and the Nested Chinese Restaurant[C]∥ Advances in Neural Information Processing Systems,2004,16:17-24.
  • 10Li W,McCallum A.Pachinko Allocation:DAG-Structured Mixture Models of Topic Correlations[C]∥ 23th International Conference on Machine Learning,2006:577-584.

共引文献46

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部