期刊文献+

基于多角度关联模型的实体检索方法 被引量:1

Entity Retrieval Method Based on Multi-perspective Association Model
下载PDF
导出
摘要 针对信息检索领域特定类型实体的检索问题,在传统搜索引擎的基础上,提出一种基于多角度关联模型的实体检索方法,综合运用实体名识别(NER)、文本向量、关联规则等技术以及Wikipedia、Stanford NER等工具,并在TREC2010实体检索项目中进行评测。实验结果表明,与基于BM25和贝叶斯模型的检索方法相比,该方法的nDCG@R值平均提高11.49%和18.09%。 This paper proposes an entity search method based on multi-perspective association model for the problem of searching particular type of entities in information retrieval field.The method employs Named Entity Recognition(NER),text vector,association rules,etc,and traditional search engines as well as Wikipedia,Stanford NER etc.Experimental result on the large Web data collection provided show that,compared with BM25 and traditional Bayesian model,this method increases nDCG@R by 11.49% and 18.09% separately.
作者 王东 牛军钰
出处 《计算机工程》 CAS CSCD 2013年第1期71-75,共5页 Computer Engineering
基金 国家"863"计划基金资助项目(2009AA01Z429)
关键词 文本挖掘 关联规则 实体检索 实体名识别 词频-逆文档频率 维基百科 搜索引擎 text mining association rule entity retrieval Named Entity Recognition(NER) Term Frequency Inverse Document Frequency(TF-IDF) Wikipedia search engine
  • 相关文献

参考文献11

  • 1王宏志,樊文飞.复杂数据上的实体识别技术研究[J].计算机学报,2011,34(10):1843-1852. 被引量:19
  • 2Imielinski A R T,Swami A. Mining Association Rules Between Sets of Items in Large Databases[A].Washington D.C,USA:ACM Press,1993.
  • 3邓志鸿,唐世渭,张铭,杨冬青,陈捷.Ontology研究综述[J].北京大学学报(自然科学版),2002,38(5):730-738. 被引量:765
  • 4拜战胜,徐德智,彭佳红,陈光仪.基于主题本体的信息采集模型研究[J].计算机技术与发展,2009,19(10):102-105. 被引量:4
  • 5Wang Zhanyi,Tang Chunsong,Sun Xueji. PRIS at TREC 2010:Related Entity Finding Task of Entity Track[A].Gaithersburg,USA:[s.n.],2010.
  • 6Wu Youzheng,Hori C,Kawai H. NiCT at TREC 2010:Related Entity Finding[A].Gaithersburg,USA:[s.n.],2010.
  • 7Lei Cao,Lu Bai,Cheng Xueqi. ICTNET at Entity Track TREC 2010[A].Gaithersburg,USA:[s.n.],2010.
  • 8Salton G,McGill M. Introduction to Modem Information Retrieval[M].New York,USA:McGraw-Hill,1983.
  • 9Liu Zhiyuan,Huang Wenyi,Zheng Yabin. Automatic Keyphrase Extraction via Topic Decomposition[A].Washington D.C,USA:[s.n.],2010.
  • 10姚静;郑佳谦;徐隽.Intranet中Web对象的属性挖掘[A]桂林:中国计算机学会,2008.

二级参考文献87

  • 1陈康,武港山.基于Ontology的信息检索技术研究[J].中文信息学报,2005,19(2):51-57. 被引量:29
  • 2宋峻峰,张维明,肖卫东,唐九阳.基于本体的信息检索模型研究[J].南京大学学报(自然科学版),2005,41(2):189-197. 被引量:44
  • 3Tijerino Y A, Sanati R. Onto TEMAS: an ontology based teaching materials search engine[J]. Journal of Computing Sciences in Colleges,2005,20(4) : 177 - 182.
  • 4[13]SENSUS.http://www.isi.edu/natural-language/resources/sensus.html
  • 5[14]Mikrokmos.http://crl.nmsu.edu/Research/Projects/mikro/
  • 6[15]Guarino N.Semantic Matching:Formal Ontological Distinctions for Information Organization,Extraction,and Integration.In:Pazienza M T,eds.Information Extraction:A Multidisciplinary Approach to an Emerging Information Technology,Springer Verlag,1997,139~170
  • 7[16]Perez A G,Benjamins V R.Overview of Knowledge Sharing and Reuse Components:Ontologies and Problem-Solving Methods.Workshop on Ontologies and Problem-Solving Methods:Lessons Learned and Future Trends (IJCAI99),de Agosto,Estocolmo,1999
  • 8[17]Gruber T R.Towards Principles for the Design of Ontologies Used for Knowledge Sharing.International Journal of Human-Computer Studies,1995,43:907~928
  • 9[18]Guarino N,Welty C.A Formal Ontology of Properties.In:Dieg R,Corby O,eds.the Proceedings of the 12th International Conference on Knowledge Engineering and Knowledge Management (EKAW'2000),Springer Verlag,2000,97~112
  • 10[19]Guarino N,Masolo C,Vetere G.OntoSeek:Content-Based Access to the Web.IEEE Intelligent Systems,1999,14(3):70~80

共引文献785

同被引文献1

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部