期刊文献+

基于搜索引擎日志发现相近Web查询 被引量:4

Discovering Related Web Queries Based on Search Engine's User Log
下载PDF
导出
摘要 提出了一种利用搜索引擎日志发现高质量相近Web查询的新方法.对一个给定的查询,从日志中抽取候选查询的一些量化指标,如被查询的不同用户量、被查询的次数、用户在反馈结果中的点击次数、与给定查询间的共有词项个数、点击相同URL的个数及其分布等,用手工标记部分训练数据,进而建立一个发现有较好反馈结果的相近查询的回归模型.实验显示用该方法可得到较高的结果精度.
机构地区 北京大学
出处 《北京邮电大学学报》 EI CAS CSCD 北大核心 2005年第z1期44-48,共5页 Journal of Beijing University of Posts and Telecommunications
基金 国家自然科学基金项目(60435020) 教育部博士点基金项目(20030001076) 中国博士后科学基金项目(2004036182)
  • 相关文献

参考文献11

  • 1[1]CNMC.第十五次中国互联网络发展状况统计报告(2005/1)[EB/OL]. http:∥www. cnnic. net. cn/, 2005-01.CN NIC. The fifteenth statistical report on the development of China Internet network information [EB/OL]. http:∥www. cnnic. net. cn/. 2005- 01.
  • 2[3]Cui Hang, Wen Jirong, Nie Jianyun, et al. Query expansion by mining user logs[J]. IEEE Transactions on Knowledge and Data Engineering, 2003,15(4): 829 - 839.
  • 3[4]Baeza-Yates R, Ribeiro-Neto B. Modern information retrieval [M]. England: Addison-Wesley, 1999.
  • 4[5]Wen Jirong, Nie Jianyun, Zhang Hongjiang. Query clustering using user logs[A]. Proceedings of the 10th World Wide Web Conference[C]. New York: ACM Press, 2001. 162 - 168.
  • 5[6]Beeferman D, Berger A. Agglomerative clustering of a search engine query log [A]. Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C]. 2000. 407-415.
  • 6[7]Chan W S, Leung W T, Lee D L. Clustering search engine query log containing noisy clickthroughs[A].Proceedings of the 2004 International Symposium on Applications and the Internet[C]. 2004. 305 -308.
  • 7[8]Fonseca B M, Golgher P B, De Moura E S, et al.Using association rules to discovery search engines related queries [A]. First Latin American Web Congress[C]. Santiago, Chile, 2003. 66-71.
  • 8[9]Zeng Huajun, He Qicai, Chen Zheng, et al. Learning to cluster Web search results [A]. Proceedings of SIGIR'04[C]. New York: ACM Press, 2004. 210-217.
  • 9[11]Xie Yinglian, O'Hallaron D. Locality in search engine queries and its implications for caching[A]. Proc IEEE Infocom 2002 [C]. 2002. 1238-1247.
  • 10[12]Kang I H, Kim G. Query type classification for Web document retrieval[A]. Proceedings of SIGIR ' 03 [C].New York: ACM Press, 2003. 64-71.

同被引文献50

  • 1余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 2第23次中国互联网络发展状况统计报告[EB].http:∥www.cnnic.net.cn/index/0E/00/11/index.htm,2009-04-05.
  • 3Broder A.Taxonomy of web search.SIGIR Forum,2002,36(2):3-10.
  • 4Silverstein C,Marais H,Henzinger M,et al.Analysis of a very large web search engine query log.SIGIR Forum,1998,33(1):6-12.
  • 5Ruthven I,Lalmas M.A survey on the use of relevance feedback for information access systems.Knowledge Engineering Review,2003,18(2):95-145.
  • 6Beeferman D,Berger A L.Agglomerative clustering of a search engine query log.In:Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,Boston,USA,2000.407-416.
  • 7Jones R.Generating query substitutions.In:Proceedings of the 15th International Conference on World Wide Web,Edinburgh,Scotland,2006.387-396.
  • 8Zhang Z Y,Nasraoui O.Mining search engine query logs for query recommendation.In:Proceedings of the 15th International Conference on World Wide Web,Edinburgh,Scotland,2006.1039-1040.
  • 9Shen X,Tan B,Zhai C.Context-sensitive information retrieval using implicit feedback.In:Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,Salvador,Brazil,2005.43-50.
  • 10Cucerzan S,White R W.Query suggestion based on user landing pages.SIGIR Forum,2007,875-876.

引证文献4

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部