期刊文献+

基于多语境的相关词自动提取系统的设计与实现 被引量:6

Design and Implementation of Automatic Extraction Relevance Terms System Based on Multi-context
下载PDF
导出
摘要 利用语料库、释义词典、用户检索日志作为识别相关词的语境,设计并实现相关词自动提取系统。实验结果表明,虽然面向相同的基本词汇集合,但是基于不同语境提取的相关词之间的重复率很低,各个结果间的互补性很强,说明结果整合非常有必要。在本系统中,通过直接整合途径构建最后的相关词词表。 This paper chooses corpus, definitions dictionaries and users' query logs as contexts to extract the relevance terms. The experiment results show that the overlap ratio of results in different contexts is very low. So, it is necessary to integrate the different results. All of the relevance terms are integrated to a relevance table through direct integration.
出处 《现代图书情报技术》 CSSCI 北大核心 2006年第9期23-28,80,共7页 New Technology of Library and Information Service
关键词 相关词 多语境 语料 释义词典 用户日志 Relevance term Multi -context Corpus Definitions dictionary Query log
  • 相关文献

参考文献18

  • 1贺宏朝,何丕廉,高剑峰,黄昌宁.一种基于上下文的中文信息检索查询扩展[J].中文信息学报,2002,16(6):32-37. 被引量:25
  • 2Voorhees, E. M.. Query expansion using lexieal semantic relations.Proceedings of the 17th Annual International ACM - SIGIR Conference on Research and Development in Information Retrieval, Dublin,Ireland. 1994.61 - 69
  • 3Yufeng Jing, W. B. Croft. An Association Thesaurus for Information Retrieval. Technical Report: UM - CS - 1994 - 017. University of Massachusetts. 1994
  • 4Crouch C. A Cluster - based approach to thesaurus construction. Proceedings of the 11 th Annual International ACM - SIGIR Conference on Research & Development in Information Retrieval, Grenoble, ACM Press, 1988. 309 - 320
  • 5Crouch, C, Yang, B. Experiments in Automatic Statistical Thesaurus Construction. Proceedings of the 15th Annual International ACM - SIGIR Conference on Research & Development in Information Retrieval,Copenhagen, Denmark, ACM Press, 1992.77-88
  • 6Hsinchun Chen, Kevin J , Lynch. Automatic construction of networks of concepts characterizing document databases. IEEE Transactions on Systems, 1992,22 ( 5 ) :885 - 902
  • 7Peter D. Tumey. Mining the Web for synonyms- PMI - IR versus LSA on TOEFL. Proceedings of the 12th European Conference on Machine Learning. Freiburg, Germany. 2001. 491 -502
  • 8Pierre P. Senellart, Vincent D. Blondel, Automatic discovery of similar words, chapter in : Survey of Text Mining, Springer - Verlag,2003
  • 9Masaki Murata, Toshiyuki Kanamaru, Hitoshi Isahara. Automatic synonym acquisition based on matching of definition sentences in multiple dictionaries. CICLing 2005, LNCS 3406. 2005:293 - 304
  • 10崔航,文继荣,李敏强.基于用户日志的查询扩展统计模型[J].软件学报,2003,14(9):1593-1599. 被引量:61

二级参考文献33

  • 1王源,吴晓滨,涂从文,刘滨,章元峰,王金娥.后控规范的计算机处理[J].现代图书情报技术,1993(2):4-7. 被引量:30
  • 2-.现代汉语词典[M].北京:商务印书馆,1994..
  • 3宋明亮 张琪玉.报纸文献机助自由标引研究及对后控制词表动态维护的思维:硕士论文[M].空军政治学院,1994,6..
  • 4吴志强 侯汉清.经济信息检索后控制词表的研制:硕士论文[M].南京:南京农业大学,1999,6..
  • 5朱毅华 侯汉清.智能搜索引擎中同义词识别算法的研究:硕士论文[M].南京:南洋农业大学,2001,6..
  • 6李朝阳 侯汉清.汉语科技同义词字面相似度测试[J].理论学术年刊,1998,.
  • 7[1]Miller G A, et al. Introduction to WordNet:an on-line lexical database, International Journal of Lexicography, 1990,3(4) :235 - 312
  • 8[2]Rila Mandala,Takenobu Tokunaga,Hozumi Tanaka,Combining multiple evidence from different types of thesaurus for query expansion,SIGIR, 1999:191 - 197
  • 9[3]Voorhees E M, Harman D K,The sixth Test REtrieval Conferenee(TREC-6) ,Gaithersburg,NIST, 1998
  • 10[4]Salton G, The SMART retrieval system-experiments in automatic document processing, Prentice Hall, 1971:115 -411

共引文献143

同被引文献78

引证文献6

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部