期刊文献+
共找到6篇文章
< 1 >
每页显示 20 50 100
Query Expansion Based on Semantics and Statistics in Chinese Question Answering System 被引量:2
1
作者 JIA Keliang PANG Xiuling +1 位作者 LI Zhinuo FAN Xiaozhong 《Wuhan University Journal of Natural Sciences》 CAS 2008年第4期505-508,共4页
In Chinese question answering system, because there is more semantic relation in questions than that in query words, the precision can be improved by expanding query while using natural language questions to retrieve ... In Chinese question answering system, because there is more semantic relation in questions than that in query words, the precision can be improved by expanding query while using natural language questions to retrieve documents. This paper proposes a new approach to query expansion based on semantics and statistics Firstly automatic relevance feedback method is used to generate a candidate expansion word set. Then the expanded query words are selected from the set based on the semantic similarity and seman- tic relevancy between the candidate words and the original words. Experiments show the new approach is effective for Web retrieval and out-performs the conventional expansion approaches. 展开更多
关键词 Chinese question answering system query expansion relevance feedback semantic similarity semantic relevancy
下载PDF
Deep Neural Network and Pseudo Relevance Feedback Based Query Expansion
2
作者 Abhishek Kumar Shukla Sujoy Das 《Computers, Materials & Continua》 SCIE EI 2022年第5期3557-3570,共14页
The neural network has attracted researchers immensely in the last couple of years due to its wide applications in various areas such as Data mining,Natural language processing,Image processing,and Information retriev... The neural network has attracted researchers immensely in the last couple of years due to its wide applications in various areas such as Data mining,Natural language processing,Image processing,and Information retrieval etc.Word embedding has been applied by many researchers for Information retrieval tasks.In this paper word embedding-based skip-gram model has been developed for the query expansion task.Vocabulary terms are obtained from the top“k”initially retrieved documents using the Pseudo relevance feedback model and then they are trained using the skip-gram model to find the expansion terms for the user query.The performance of the model based on mean average precision is 0.3176.The proposed model compares with other existing models.An improvement of 6.61%,6.93%,and 9.07%on MAP value is observed compare to the Original query,BM25 model,and query expansion with the Chi-Square model respectively.The proposed model also retrieves 84,25,and 81 additional relevant documents compare to the original query,query expansion with Chi-Square model,and BM25 model respectively and thus improves the recall value also.The per query analysis reveals that the proposed model performs well in 30,36,and 30 queries compare to the original query,query expansion with Chi-square model,and BM25 model respectively. 展开更多
关键词 Information retrieval query expansion word embedding neural network deep neural network
下载PDF
A Novel Web Query Automatic Expansion Based on Rough Set
3
作者 YI Gaoxiang HU Heping +1 位作者 LU Zhengding LI Ruixuan 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1167-1171,共5页
One of important reasons caused low precision was presented, which was due to inaccurate express of the query. So a new method of automatic query expansion based on tolerance rough was put forward. In the algorithm, t... One of important reasons caused low precision was presented, which was due to inaccurate express of the query. So a new method of automatic query expansion based on tolerance rough was put forward. In the algorithm, the uncertain connection between query terms and retrial documents was described as term tolerance class. The upper approximation set of query sentence was considered as query expansion. The new additional terms were also given weight numbers. The results of experiment on collection of Google 5 000 Web pages showed that the approach was effective on query expansion and high search precision was gained. 展开更多
关键词 Web query query expansion rough set
下载PDF
Strength Pareto fitness assignment for pseudo-relevance feedback: application to MEDLINE 被引量:1
4
作者 Ilyes KHENNAK Habiba DRIAS 《Frontiers of Computer Science》 SCIE EI CSCD 2018年第1期163-176,共14页
Because of users' growing utilization of unclear and imprecise keywords when characterizing their informa- tion need, it has become necessary to expand their original search queries with additional words that best ca... Because of users' growing utilization of unclear and imprecise keywords when characterizing their informa- tion need, it has become necessary to expand their original search queries with additional words that best capture their actual intent. The selection of the terms that are suitable for use as additional words is in general dependent on the degree of relatedness between each candidate expansion term and the query keywords. In this paper, we propose two criteria for evaluating the degree of relatedness between a candidate expansion word and the query keywords: (1) co-occurrence frequency, where more importance is attributed to terms oc- curring in the largest possible number of documents where the query keywords appear; (2) proximity, where more im- portance is assigned to terms having a short distance from the query terms within documents. We also employ the strength Pareto fitness assignment in order to satisfy both criteria si- multaneously. The results of our numerical experiments on MEDLINE, the online medical information database, show that the proposed approach significantly enhances the re- trieval performance as compared to the baseline. 展开更多
关键词 information retrieval query expansion pseudo-relevance feedback PROXIMITY multi-objective optimization Pareto dominance MEDLINE
原文传递
The Application of the Comparable Corpora in Chinese-English Cross-Lingual Information Retrieval
5
作者 杜林 张毅波 +1 位作者 孙乐 孙玉芳 《Journal of Computer Science & Technology》 SCIE EI CSCD 2001年第4期351-358,共8页
This paper proposes a novel Chinese-English Cross-Lingual Information Retrieval (CECLIR) model PME, in which bilingual dictionary and comparable corpora are used to translate the query terms. The Proximity and mutua... This paper proposes a novel Chinese-English Cross-Lingual Information Retrieval (CECLIR) model PME, in which bilingual dictionary and comparable corpora are used to translate the query terms. The Proximity and mutual information of the term-pairs in the Chinese and English comparable corpora are employed not only to resolve the translation ambiguities but also to perform the query expansion so as to deal with the out-of-vocabulary issues in the CECLIR. The evaluation results show that the query precision of PME algorithm is about 84.4% of the monolingual information retrieval. 展开更多
关键词 cross-lingual information retrieval comparable corpus mutual information query expansion
原文传递
Visual polysemy and synonymy:toward near-duplicate image retrieval
6
作者 Manni DUAN Xiuqing WU 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2010年第4期419-429,共11页
that are duplicate or near duplicate to a query image.One of the most popular and practical methods in near-duplicate image retrieval is based on bag-of-words(BoW)model.However,the fundamental deficiency of current Bo... that are duplicate or near duplicate to a query image.One of the most popular and practical methods in near-duplicate image retrieval is based on bag-of-words(BoW)model.However,the fundamental deficiency of current BoW method is the gap between visual word and image’s semantic meaning.Similar problem also plagues existing text retrieval.A prevalent method against such issue in text retrieval is to eliminate text synonymy and polysemy and therefore improve the whole performance.Our proposed approach borrows ideas from text retrieval and tries to overcome these deficiencies of BoW model by treating the semantic gap problem as visual synonymy and polysemy issues.We use visual synonymy in a very general sense to describe the fact that there are many different visual words referring to the same visual meaning.By visual polysemy,we refer to the general fact that most visual words have more than one distinct meaning.To eliminate visual synonymy,we present an extended similarity function to implicitly extend query visual words.To eliminate visual polysemy,we use visual pattern and prove that the most efficient way of using visual pattern is merging visual word vector together with visual pattern vector and obtain the similarity score by cosine function.In addition,we observe that there is a high possibility that duplicates visual words occur in an adjacent area.Therefore,we modify traditional Apriori algorithm to mine quantitative pattern that can be defined as patterns containing duplicate items.Experiments prove quantitative patterns improving mean average precision(MAP)significantly. 展开更多
关键词 near-duplicate image retrieval bag-of-words(BoW)model visual synonymy visual polysemy extended similarity function query expansion visual pattern
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部