期刊文献+

有判别力的话题字典动态生成方法

Dynamic generation method of discriminative topic dictionary
下载PDF
导出
摘要 话题字典是话题的描述特征子集,有判别力的话题字典可以在降低特征维数的同时提高对话题描述的准确性,进而提高话题识别与追踪的综合性能。以互信息为研究基础,提出了确定话题初始字典规模的目标函数,并采用坐标下降法对其求解,考虑到新闻话题是随时间动态变化、发展的,给出了融合时间信息的话题字典动态更新方法,最终得到有辨别力的话题字典。实验在TDT语料上,以漏报率、误报率为评价标准,比较了增量式TF-IDF方法与提出的话题字典生成方法的性能。实验结果显示,提出的话题字典生成方法的性能较优。 Topic dictionary is the descriptive feature subset of topic. A discriminative topic dictionary can decrease feature di- mension and improve descriptive accuracy for topic. And then, it will improve the overall performance of topic detection and tracking. Based on the basic mutual information, this paper proposed an objective function to determine the size of original topic dictionary, which was solved by coordinate descent algorithm. Considering news topics dynamic developed with time, this paper put forward a dynamic updating algorithm. Experiments on TDT4 corpus, it adopted miss probability and false alarm probability as evaluation criteria to compare the performance of incremental TF-IDF and the proposed method. Experimental results show that the performance of the proposed method is better.
作者 吴树芳 朱杰 徐建民 Wu Shufang Zhu Jie Xu Jianmin(College of Management College of Computer Science & Technology, Hebei University, Baoding Hebei 071000, China College of Management & Economics, Tianjin University, Tianjin 300072, China Dept. of Information Management, Central Institute for Correctional Police, Baoding Hebei 071000, China)
出处 《计算机应用研究》 CSCD 北大核心 2017年第9期2723-2726,共4页 Application Research of Computers
基金 河北省教育厅青年基金资助项目(QN2015099) 河北省社会科学基金资助项目(HB15TQ013) 河北大学中西部提升综合实力专项资金资助项目 河北省自然科学基金资助项目(F2015201142) 国家社会科学基金资助课题(17BTQ068)
关键词 话题字典 互信息 动态更新 目标函数 topic dictionary mutual information dynamic updating objective function
  • 相关文献

参考文献7

二级参考文献71

共引文献185

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部