期刊文献+

基于伪爬行器的主题式元搜索引擎研究与设计

Research and Design of Topic-specific Meta-search Engine Based on Bogus Crawler
下载PDF
导出
摘要 为提高搜索的查准率和查全率,设计一个主题式的元搜索引擎和一个类似于爬行器的伪爬行器,通过调用通用搜索引擎采集信息,查全率高于通用搜索引擎。利用反馈机制,参考用户查询历史记录,搜索结果更加接近用户的要求。通过采用主题式策略,改进文档相似度算法,提高分类的正确率和搜索引擎的查准率与搜索范围,同时减少系统响应时间,降低对服务器性能的要求。 To improve the correct-rate and completeness-rate of search, a topic-specific meta-search engine is designed. A bogus crawler is invented, which collects information by the normal search engines, so that the search-area is wider than the normal search engine. The feedback mechanism is adopted and the search-history of user is considered, which make the search result is more imminent to the purpose of the user. Owing to the strategy of topic-specific and mending the arithmetic of similitude-degree of the texts, the correct-rate is improved. Both the correct-rate and completeness-rate of searching are improved, the response time is decreased as well, at the same time, the request of capability of the server is reduced.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第22期70-72,76,共4页 Computer Engineering
基金 国家"863"计划基金资助项目(2006AA706103) 航空基金资助项目(05F2037)
关键词 元搜索 主题式 搜索引擎 伪爬行器 meta-search topic-specific search engine bogus crawler
  • 相关文献

参考文献5

  • 1Frost R Building Better Search Engines[J]. Computing in Science & Engineering, 2007 9(4): 7-11.
  • 2李志.搜索引擎的缺陷及其完善[J].现代情报,2007,27(1):154-156. 被引量:3
  • 3Keyhanipour A H, Moshiri B, Piroozmand M, et al. Web Fusion: Fundamentals and Principals of a Novel Meta-search Engine[C]// Proceedings of 2006 IEEE international Joint Conference on Neural Network. [S. l.]: IEEE Press, 2006: 4126-4131.
  • 4郭岩,白硕,杨志峰,张凯.网络日志规模分析和用户兴趣挖掘[J].计算机学报,2005,28(9):1483-1496. 被引量:62
  • 5Zhang Qirui, Zhang Ling, Dong Shoubin, et al. Document Indexing in Text Categorization[C]//Proceedings of the 4th International Conference on Machine Learning and Cybernetics. Guangzhou, China: [s. n.], 2005: 3792-3796.

二级参考文献14

  • 1郭岩.基于网络用户行为的搜索引擎系统SISI[J].计算机工程,2004,30(16):9-11. 被引量:1
  • 2龚蛟腾.元搜索引擎研究[J].情报杂志,2004,23(10):77-78. 被引量:15
  • 3叶弈乾 孔克勤.个性心理学[M].上海:华东师范大学出版社,1993.349,181.
  • 4Perkowitz M., Etzioni O.. Towards adaptive Web sites: Conceptual framework and case study. Artificial Intelligence, 2000, 118: 245~275.
  • 5Schechter S., Krishnan M., Smith M.D.. Using path profiles to predict HTTP requests. In: Proceedings of the 7th International World Wide Web Conference Computer, Networks and ISDN Systems, Brisbane, Australia, 1998, 30: 457~467.
  • 6Cooley R., Mobasher B., Srivastava J.. Data preparation for mining world wide Web browsing patterns. Knowledge and Information Systems, 1999, 1(1): 5~32.
  • 7化柏林.搜索引擎面面观-技术系列之一[EB].http://inux.ccidnet.com/pub/article/c322-a1 32826-pl.him/.2004-7-20 (Visit January.10.2006).
  • 8百度搜索引擎[EB].http://www.baidu.com(Visit January.11.2006).
  • 9Google搜索引擎[EB].http://www.goodle.com.cn(visit January.11.2006).
  • 10宋擒豹,沈钧毅.Web日志的高效多能挖掘算法[J].计算机研究与发展,2001,38(3):328-333. 被引量:115

共引文献63

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部