期刊文献+

一种基于词向量的模糊查询扩展方法 被引量:1

Query Expansion Based on Word Embedding in Fuzzy Document Retrieval
下载PDF
导出
摘要 在中文文本信息中,同一个语义往往有多种不同的表达方法,不同的个体对同一个词语理解也会有一定的偏差,这将导致在信息检索时,出现查询项与检索数据"词不匹配"的问题.虽然,模糊检索是改善这一问题的有效方法之一,但仅仅利用已知信息进行模糊检索,已不能满足充斥着大规模无标定文本信息的网络时代的检索需要.提出一个基于词向量的模糊检索查询扩展方法,通过词向量计算查询项的相似词,进而进行查询项扩展.相比与传统的模糊检索方法,在同一测试集中,基于词向量的模糊查询扩展方法测评出的查全率、查准率以及两者的调和平均数均得到了有效提升. There are different ways to express the same word sense in Chinese.When different individuals learn and understand the same words,deviations will appear.This results in term mismatch between queries and documents.A fuzzy document retrieval system is one of the effective method to solve the problem.However,it can not achieve satisfying results,when we deal with large-scale unmarked data.An approach to query expansion based on word embedding in fuzzy document retrieval is proposed to settle the issue in this paper.The word embedding,being trained in a large number of corpus with the continuous bag-of-words model,is used to gain the similar word,and then the fuzzy query is expanded.Compared with the traditional fuzzy retrieval method,the recall ratio,precision ratio and the harmonic average of them are all increased.
作者 陈淑巧 邱东 江海欢 CHEN Shuqiao;QIU Dong;JIANG Haihuan(College of Mathematics and Physics, Chongqing University of Posts and Telecommunications, Chongqing 400065)
出处 《四川师范大学学报(自然科学版)》 CAS 北大核心 2019年第1期92-97,共6页 Journal of Sichuan Normal University(Natural Science)
基金 国家自然科学基金(11671001和61472056)
关键词 词向量 模糊查询项扩展 信息检索 word embedding fuzzy query expansion information retrieval
  • 相关文献

参考文献4

二级参考文献57

  • 1刘煜,郭利.ISI Web of Knowledge(R)平台上信息资源的收集[J].中国科技期刊研究,2003,14(z1):732-736. 被引量:4
  • 2孙思,张玉峰.基于模糊逻辑推理的信息检索方法[J].图书情报知识,1996,13(1):41-44. 被引量:2
  • 3吴友政,赵军,徐波.基于主题语言模型的句子检索算法[J].计算机研究与发展,2007,44(2):288-295. 被引量:8
  • 4Miyamoto S. Information Retrieval Based on Fuzzy Association. Fuzzy Sets and System, 1990, 38 (2):191-205.
  • 5NIST.Text Retrieval Conference[EB/OL].[2003-04-08] http:∥trec.nist.gov/
  • 6钱学森图书馆医学分馆.信息检索基础知识:检索效率及评价[EB/OL].[2007-01-10] http:∥202.117.24.24/html/xjtu/kejian/yxkj/pages/bjjc/chap-ter1/7.htm
  • 7LEE H M,LIN S K,HUANG C W.Interactive query expansion based on fuzzy association thesaurus for Web information retrieval[C]∥ Proceedings of the 10th IEEE International Conference on Fuzzy Systems.Australia:[s n],2001:724-727
  • 8LIM J,SEUNG H,HWANG J,et al.Query expansion for intelligent information retrieval on internet[C]∥ Proceedings of Parallel and Distributed Systems International Conference.Washington:IEEE Computer Society,1997:656-662
  • 9Cognitive Science Laboratory,Princeton University.WordNet[EB/OL].[2003-10-10] http:∥www.cogsci.princeton.edu/~wn/
  • 10MANDALA R,TOKUNAGA T,TANAKA H.Query expansion using heterogeneous thesauri[J].Inf Process and Manage,2000,36:361-378

共引文献30

同被引文献14

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部