期刊文献+

基于依存连接权VSM的子话题检测与跟踪方法 被引量:10

Sub-topic detection and tracking based on dependency connection weights for vector space model
下载PDF
导出
摘要 针对在新闻话题中报道突发、热点相似且子话题层次丰富的现象,依据增量TF-IDF值构造特征维,生成全局向量;然后在时间窗内生成特征连接权的局部邻接图,利用依存句法进行分析降维;最后采用领域词典加权,时间阈值衰减;从而构造出利用依存连接权VSM进行关联分析的子话题检测与跟踪(sTDT)计算方法。实验表明,利用依存关联分析使文本表示由线性变为平面结构,能够有效地提取描述子话题;在人工标注的测试语料下,其最小DET代价比经典方法至少降低2.2%。 Aiming at the phenomenon that there are abrupt reports, similar topics and abundant levels of subtopics in the news, a novel method based on relationship analysis using dependent sentence pattern was proposed for sub-topic detection and tracking (sTDT), which constructed feature dimensions to generate the global vectors according to the increment of TF-IDF, and then created the partial adjoin map based on the connection weights within the time window and decreased the dimensions through dependent sentence pattern. Finally, a novel method for sTDT computing was built with adjoins dictionary weights and time threshold attenuation. Experiments show that the proposed method transferrs the text from linear to plane structure, and extracts the subtopics effectively, of which the minimum DET cost is reduced by at least 2.2 percent than that of classical methods.
出处 《通信学报》 EI CSCD 北大核心 2013年第8期1-9,共9页 Journal on Communications
基金 海军工程大学科学研究基金资助项目(HGDYDJJ10008)~~
关键词 话题检测与跟踪 依存连接权 关联词对 报道关系检测 向量空间模型 topic detection and tracking dependency connection weights associating words group report relation detection vector space model
  • 相关文献

参考文献13

  • 1洪宇,张宇,刘挺,李生.话题检测与跟踪的评测及研究综述[J].中文信息学报,2007,21(6):71-87. 被引量:153
  • 2ALLAN J, JIN H, RAJMAN M, et al. Topic-based novelty detection[A]. Proceedings of the Johns Hopkins Summer Workshop[C]. CLSP, Baltimore, 1999.
  • 3ALLAN J, LAVRENKO V, JIN H. First story detection in TDT is hard[A]. Proceedings of 9th Conference on Information Knowledge Management[C]. Washington, DC, 2000. 374-381.
  • 4PONTE J, CROFT W B. Text segmentation by topic[A]. Proceedings of the European Conference on Research and Advanced Technology for Digital Libraries[C]. Europe: ECDL, 1997. 113-125.
  • 5TU Y N, SENG J L. Indices of novelty for emerging topic detection[J]. Information Processing and Management, 2012, 48:303-325.
  • 6ZENG J P, ZHANG S Y. Variable space hidden markov model for topic detection and analysis[J]. Knowledge-Based Systems, 2007, 20: 607-613.
  • 7ZENG J P, ZHANG S Y. Incorporating topic transition in topic detec- tion and tracking algorithms[J]. Expert Systems with Application, 2009, 36:227-232.
  • 8ZHAO Z Y, FENG S Z, WANG Q, et al. topic oriented community detection through social objects and link analysis in social networks[J]. Knowledge-Based Systems, 2012, 26: 164-173.
  • 9FLORIAN H, SVEN T. Towards automatic detection and tracking of topic change[A]. Proc of Cicling 2010[C]. Iasi, Romania, LNCS 2010. 327-339.
  • 10TESNIERE L, ELEMENTS D. Syntaxe Structurale[M]. Paris: Klincksieck, 1959.

二级参考文献68

共引文献152

同被引文献111

引证文献10

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部