期刊文献+

一种基于同义词词典的模糊查询扩展方法 被引量:17

An approach to fuzzy query expansion based on synonymy thesaurus
下载PDF
导出
摘要 在信息检索系统中,查询扩展是一种非常有效的改进检索性能的方法.为此,提出一种基于同义词词典的模糊查询扩展方法.该方法中的同义词词典是基于著名的语义词典WordNet中的同义词集合建立的,同义词之间的贴近度[0,1]使用Tanimoto系数获得.利用该词典,能够进行较好的查询扩展.将该方法与向量空间模型结合应用于文本信息检索系统中,所构造的检索模型相当于一种简单的语义模型,并且可以根据阈值来控制查询扩展的程度.所得试验结果表明,使用该查询扩展方法的信息检索系统较常规信息检索系统的检索性能有一定改善. Query expansion (QE) has been proved to be one of effective methods for improving the performance of the information retrieval (IR) system. Therefore, a new fuzzy QE method based on synonymy thesaurus is proposed, and the synonymy thesaurus is built based on the famous lexical database WordNet. In the synonymy thesaurus, the similarity between the synonyms is [0, 1], which is obtained by Tanimoto coefficient. By using this synonymy thesaurus, query expansion can be done well. Then the fuzzy QE method is introduced into the document information retrieval system together with the modified vector space model. The experimental results show that the developed information retrieval system has got more effective performance than before by using the fuzzy query expansion method. One feature of the proposed information retrieval model is that it can be treated as one of simple semantic models. Another feature is that the expansion degree is controllable based on different thresholds.
出处 《大连理工大学学报》 EI CAS CSCD 北大核心 2007年第3期439-443,共5页 Journal of Dalian University of Technology
基金 日本佳思腾株式会社资助项目
关键词 模糊查询扩展 同义词词典 信息检索 fuzzy query expansion synonymy thesaurus information retrieval
  • 相关文献

参考文献9

  • 1LEE H M,LIN S K,HUANG C W.Interactive query expansion based on fuzzy association thesaurus for Web information retrieval[C]∥ Proceedings of the 10th IEEE International Conference on Fuzzy Systems.Australia:[s n],2001:724-727
  • 2LIM J,SEUNG H,HWANG J,et al.Query expansion for intelligent information retrieval on internet[C]∥ Proceedings of Parallel and Distributed Systems International Conference.Washington:IEEE Computer Society,1997:656-662
  • 3贺宏朝,何丕廉,陈霞.利用人工和自动生成的资源进行中文信息检索查询扩展[J].计算机工程与应用,2002,38(21):18-20. 被引量:4
  • 4Cognitive Science Laboratory,Princeton University.WordNet[EB/OL].[2003-10-10] http:∥www.cogsci.princeton.edu/~wn/
  • 5MANDALA R,TOKUNAGA T,TANAKA H.Query expansion using heterogeneous thesauri[J].Inf Process and Manage,2000,36:361-378
  • 6SALTON G,WONG A,YANG C S.A vector space model for automatic indexing[J].Commun of the ACM,1975,18(11):613-620
  • 7JING Li-ping,HUANG Hou-kuan,SHI Hong-bo.Improved feature selection approach TFIDF in text mining[C]∥ Proceedings of 1st Information Conference on Machine Learning and Cybernetics.Beijing:[s n],2002:944-946
  • 8NIST.Text Retrieval Conference[EB/OL].[2003-04-08] http:∥trec.nist.gov/
  • 9钱学森图书馆医学分馆.信息检索基础知识:检索效率及评价[EB/OL].[2007-01-10] http:∥202.117.24.24/html/xjtu/kejian/yxkj/pages/bjjc/chap-ter1/7.htm

二级参考文献1

  • 1MillerGAetal.IntroductiontoWordNet:anon-linelexicaldata-base犤J犦[].InternationalJournalofLexicography.1990

共引文献3

同被引文献168

引证文献17

二级引证文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部