期刊文献+

基于信息粒度的主题相似性信息检索

Topic similarity information retrieval based on information granularity
下载PDF
导出
摘要 主题相似性计算是信息检索、文本分类等领域内的一个研究热点。基于信息粒度原理,内容主题识别和事件主题识别是在不同粒度世界进行的识别计算。本研究将传统的内容主题识别算法与基于TDT的事件主题识别算法相结合,提出1种新的主题相似性计算模型,即先进行内容主题识别,再进行事件主题识别。试验结果表明:该模型具有良好的分类效果。 Topic similarity computation model is a research hotspot in fields of information retrieval and text classification.From the view of information granularity,it is very clear that similarity computations of topic identification are under different granularity.According to this principal,this paper integrates traditional content topic identification with event topic identification,and presents a new topic similarity computation model,i.e.,first carrying out content identification,secondly event identification.The experiments show that this model can obtain satisfactory classification result.
出处 《河北农业大学学报》 CAS CSCD 北大核心 2011年第1期114-118,共5页 Journal of Hebei Agricultural University
关键词 信息检索 信息粒度 TDT 主题识别 概念空间 information retrieval information granularity TDT topic identification concept space
  • 相关文献

参考文献8

  • 1Wayne Charles L.Multilingual Topic Detection and Tracking:Successful Research Enabled by Corpora and Evaluation[C] //In Proceedings of the Second International Language Resources and Evaluation Conference.Athens:e-society,2000:1487-1494.
  • 2朱靖波 姚天顺.面向TDT的主题相似性计算模型[C]..语言计算与基于内容的文本处理[C].北京:清华大学出版社,2004.476-481.
  • 3卜东波,白硕,李国杰.聚类/分类中的粒度原理[J].计算机学报,2002,25(8):810-816. 被引量:95
  • 4陆玉昌,鲁明羽,李凡,周立柱.向量空间法中单词权重函数的分析和构造[J].计算机研究与发展,2002,39(10):1205-1210. 被引量:126
  • 5Hsinchun Chen,Tobun D Ng.A Concept Space Approach to Addressing the Vocabulary Problem in Scientific Information Retrieval:An Experiment on the Worm Community System[J].Journal of the American Society for Information Science.1997,48(1):17-31.
  • 6董振东.《知网》进展与应用[EB/OL].http://www.keenage.com/html/cindex.html.2002-07-10.
  • 7Ellen Riloff,Rosie Jones.Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping[A] //In:Proceedings of the sixteenth national conference on Artificial intelligence (AAAI-99),CA:American Association for Artificial Intelligence,1999:474-479.
  • 8郑家恒,王兴义,李飞.信息抽取模式自动生成方法的研究[J].中文信息学报,2004,18(1):48-54. 被引量:22

二级参考文献12

  • 1王珏,苗夺谦,周育健.关于Rough Set理论与应用的综述[J].模式识别与人工智能,1996,9(4):337-344. 被引量:264
  • 2苗夺谦.Rough Set理论在机器学习中的应用研究:博士学位论文[M].北京:中国科学院自动化研究所,1997..
  • 3Vapnik V N.统计学习理论的本质(中文版)[M].北京:清华大学出版社,2000..
  • 4黄萱菁.大规模中文文本的检索、分类与摘要研究:博士学位论文[M].上海:复旦大学,1998..
  • 5[1]Ellen Riloff. Automatically Constructing a Dictionary for Information Extraction Tasks[C]. In: Proceedings of the Eleventh National Conference on Artificial Intelligence, 811-816. AAAI Press/ The MIT Press, 1993.
  • 6[2]Stephen Soderland, David Fisher, Jonathan Aseltine, and Wendy Lehnert. CRYSTAL: Inducing a conceptual dictionary[C]. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1314-1319, 1995.
  • 7[3]Ellen Riloff. Automatically Generating Extraction Patterns from Untagged Text[C]. In: Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI-96), 1044-1049. 1996.
  • 8[4]Ellen Riloff, Rosie Jones. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping[C]. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), Orlando FL. 1999.
  • 9[5]Roman Yangarber, Ralph Grishman, Pasi Tapanainen and Silja Huttunen. Unsupervised Discovery of Scenario-Level Patterns for Information Extraction[C]. In: Proceedings of Sixth Applied Natural Language Processing Conference (ANLP-2000), 282-289, Seattle WA. 2000.
  • 10邵健.基于Rough Sets的信息粒度计算及其应用:硕士学位论文[M].北京:中国科学院自动化研究所,2000..

共引文献237

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部