期刊文献+

基于决策树方法的特定主题Web搜索策略 被引量:3

Subject-specific Web search strategy based on decision tree method
下载PDF
导出
摘要 基于数据挖掘中决策树方法提出了一种新的W eb搜索策略。在该策略中,通过对预先采集的W eb页面进行学习得到一棵决策树,然后对其进行剪枝,得到简化布尔表达式形式的主题内在规律性信息,在其基础上进行查询修改,把修改后的查询转发到通用搜索引擎上,最终得到查询结果。实验结果表明,提出的查询策略对于特定主题的W eb搜索,查询结果的质量有明显的改善和提升。 A new Web search strategy based on decision tree method in data mining was proposed. In this strategy, a decision tree was obtained by studying the Web pages sampled in advance. Then the tree was pruned to get the inherent regularity of the subject in the form of simplified boolean expression, on the basis of which the query was modified and forwarded to general search engines, and finally the searching results were got. The experiment performed indicated that for subject-specific Web search, the quality of query results improved obviously.
作者 李新安 石冰
出处 《计算机应用》 CSCD 北大核心 2006年第1期223-226,共4页 journal of Computer Applications
关键词 查询修改 决策树 信息检索 数据挖掘 机器学习 query modification decision tree information retrieval data mining machine learning
  • 相关文献

参考文献14

  • 1.[EB/OL].http://www.iresearch.com. cn/html/search_engine/detail freeid_12038. html[EB/OL],.
  • 2BUTLER D, Souped-Up Search Engines[J]. Nature, 2000,405:112-115.
  • 3MCCALLUM A, NIGAM K, RENNIE J, et al. A Machine Learning Approach to Building Domain-Specific Search Engines[A]. Pmc,16th Int'l Joint Conf. Artificial Intelligence (IJCAI-99)[C]. 1999.662 - 667.
  • 4GLOVER E, FLAKE G, LAWRENCE S, et al. Improving Category Specific Web Search by Learning Query Modifications[A]. Proc.2001 Symp. Applications and the Internet (SAINT 2001)[C].2001.23 - 31.
  • 5PAHLEVI SM, KITAGAWA H. Taxonomy-Based Adaptive Web Search Method[A]. Proc. Third IEEE Int'l Conf. Information Technology: Coding and Computing( ITCC 2002)[C]. 2002. 320 -325.
  • 6OYAMA S, KOKUBO T, ISHIDA T. Domain-Specific Web Search with Keyword Spices[J]. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(1) : 17 -27.
  • 7韩客松,王永成,陈桂林.无词典高频字串快速提取和统计算法研究[J].中文信息学报,2001,15(2):23-30. 被引量:36
  • 8金翔宇,孙正兴,张福炎.一种中文文档的非受限无词典抽词方法[J].中文信息学报,2001,15(6):33-39. 被引量:28
  • 9HANJ KAMBERM 范明 孟小峰译.数据挖掘概念与技术[M].北京:机械工业出版社,2001..
  • 10OYAMA S, KOKUBO T, ISHIDA T, et al. Keyword Spices: A New Method for Building Domain-Specific Web Search Engines[A]. Proc. 17th Int'l Joint Conf. Artificial Intelligence (IJCAI-01)[C]. 2001. 1457 - 1463.

二级参考文献10

共引文献94

同被引文献45

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部