期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Hierarchical topic modeling with nested hierarchical Dirichlet process
1
作者 Yi-qun DING Shan-ping LI +1 位作者 Zhen ZHANG Bin SHEN 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第6期858-867,共10页
This paper deals with the statistical modeling of latent topic hierarchies in text corpora. The height of the topic tree is assumed as fixed, while the number of topics on each level as unknown a priori and to be infe... This paper deals with the statistical modeling of latent topic hierarchies in text corpora. The height of the topic tree is assumed as fixed, while the number of topics on each level as unknown a priori and to be inferred from data. Taking a nonpara-metric Bayesian approach to this problem, we propose a new probabilistic generative model based on the nested hierarchical Dirichlet process (nHDP) and present a Markov chain Monte Carlo sampling algorithm for the inference of the topic tree structure as well as the word distribution of each topic and topic distribution of each document. Our theoretical analysis and experiment results show that this model can produce a more compact hierarchical topic structure and captures more fine-grained topic rela-tionships compared to the hierarchical latent Dirichlet allocation model. 展开更多
关键词 Topic modeling Natural language processing Chinese restaurant process Hierarchical Dirichlet process markovchain monte carlo Nonparametric Bayesian statistics
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部