期刊文献+

基于数据挖掘的面向话题搜索引擎研究 被引量:4

Research on Topic Oriented Search Engine Based on Data Mining Technology
下载PDF
导出
摘要 为了解决面向话题的搜索问题,提出一种新的面向话题的检索技术。首先分析了面向话题的搜索技术所面临的问题,然后基于数据挖掘技术提出了解决方案。利用数据挖掘技术抽取文本的多层次语义特征,形成对文本的多精度表示,抽取的特征不仅包括单个词特征也包括多词特征。建立了一个示例检索系统,实验表明利用多层次文本特征能够很好地实现面向话题的文本检索。 A novel topic-oriented text retrieval approach is proposed in this paper. In this approach,data mining techniques are used to extract multi-level semantic features from texts, generating multi-precision representation on text. Features extracted from text include both single word features and multi-word features. With this approach, more significant feature in text can be discovered and used. Extracted features are closed to the essence of texts. Experiments show that multi-level features can be used to create a topic-oriented text retrieval system.
出处 《无线电通信技术》 2011年第5期38-40,共3页 Radio Communications Technology
关键词 信息检索 数据挖掘 文本分析 information retrieval data mining text analysis
  • 相关文献

参考文献5

  • 1KENNETH L V, MOSES C M. Information retrieval in document spaces using clustering [ M ]. Master' s Thesis, Department of Informatics and Mathematical Modelling Technical University of Denmark, Auguest, 2005.
  • 2LIU Shuang, LIU Fang, YU Clement. An Effective Approach to Document Retrieval via Utilizing WordNet and Recognizing Phrases [ C ]//SIGIR' 04, July 25 29, Sheffield, Yorkshire, UK, 2004 : 266-272.
  • 3ZAMIR O, ETZIONI O. Web Document Clustering: A Feasibility Demonstration [ C ] // Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR98 ), 1998 : 46 - 54.
  • 4SALTON G, YANG C S, YU C T. A Theory of Term Importance in Automatic Text Analysis [ J ]. JASIS, 1975, 26(1):33-44.
  • 5HAN Jia-wei, MICHELINE KAMBER. Data Mining: Concepts and Techniques [ M ]. San Fransisco: Morgan Kaufmann Publishers, 2002.

同被引文献38

引证文献4

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部