期刊文献+

基于搜索引擎日志的用户查询意图分类 被引量:6

User Query Intention Classification Based on Search Engine Log
下载PDF
导出
摘要 为了搜索引擎能提供高质量检索,提出了一种查询意图自动分类模型。该模型将用户查询分为咨询、学术、资源、服务、导航和热点6类,建立了一套查询意图分类体系;在传统搜索引擎系统中加入查询意图处理模块,通过对用户查询意图的查询词信息(Qi)、点击URL信息(Cu)和基于某分类的URL点击排序(Cr)3个分类特征进行统计分析,提取其特征向量,进而推断出用户查询意图。通过在Sogou数据集上的试验表明,各类信息的查询分类效果F值均大于0.8,取得了较好的分类效果。 To provide high quality search for the search engine, a query intention automatic classification model is proposed. In the model, user query is classified into consultation, academia, resource, service, navigation and hotpot six types, and a suit of query intention classification system is constructed. The query intention process module is added into the traditional search engine system. Three classification features of the user query intention include query information (Qi), clicked URL information (Cu) and sequence of clicked URLs based on a classification (Cr) are counted and analyzed, and the feature vector is extracted. Thus, the user query intention is concluded. The experiment on Sogou dataset shows that each type of information′s value F of query classification effect is more than 0.8. Therefore, the model achieves great classification effect.
作者 杨杰 徐越 余建桥 蒋建华 YANG Jie;XU Yue;YU Jianqiao;JIANG Jianhua(College of Mobile Telecommunications, Chongqing University of Posts and Telecommunications, Chongqing 401520, China;Unit 31102 of PLA, Nanjing 210016, China;School of Computer Engineering, Chongqing College of Humanities, Science & Technology, Chongqing 401524, China)
出处 《指挥信息系统与技术》 2019年第2期74-79,共6页 Command Information System and Technology
基金 国家自然科学基金(61303267)资助项目
关键词 信息搜索 分类 查询意图 支持向量机 information search classification query intention support vector machine
  • 相关文献

参考文献6

二级参考文献123

  • 1高洁,吉根林.文本分类技术研究[J].计算机应用研究,2004,21(7):28-30. 被引量:36
  • 2牛强,王志晓,陈岱,夏士雄.基于支持向量机的Web文本分类方法[J].微电子学与计算机,2006,23(9):102-104. 被引量:19
  • 3凌波,周水庚,周傲英.P2P信息检索系统的查询结果排序与合并策略[J].计算机学报,2007,30(3):405-414. 被引量:13
  • 4Broder A. A taxonomy of Web search[C]//SIGIR Forum. New York, N Y, USA: ACM Press, 2002 : 3-10.
  • 5Rose D E, Levinson D. Understanding user goals in web search [C] //WWW ' 04 : Proceedings of the 13the international confe- rence on World Wide Web. New York, N Y, USA: ACM Press, 2004: 13-19.
  • 6Jansen B J,Booth D L,Spink A. Determining the user intent of Web search engine queries[C] // Williamson CL, Zurko ME, Patel-Schneider PF,et al. , eds. Proc. of the 16th Int'l Conf. on World Wide Web. New York: ACM Press, 2007:1149-1150.
  • 7Ricardo A, Liliana C B, Cristina N. The intention behind Webqueries[C]//Crestani F, Ferragina P, Sanderson M, eds. Proc. of the 13th Int'l Conf. on String Processing and Information Re- trieval (SPIRE 2006 ). Berlin, Heidelberg: Springer-Verlag, 2006 :98-109.
  • 8Qi G, Eugene A. Exploring mouse movements for inferring que- ry intent[-C]//Myaeng SH, Oard DW, Sebastianj F, et al. , eds. Proc. of the 31st Annual Int' 1 ACM SIGIR Conf. on Research and Development in Information Retrieval. 2008:707-708.
  • 9Liu YQ, Fu Y P, Zhang M, et al. Automatic search engine per- formance evaluation with click-through data analysis[C] ffWil- liamson CL, Zurko ME, Patel-Schneider PF, et al. , eds. Proc. of the 16th Int'l Conf. on World Wide Web. New York: ACM Press, 2007 : 1133-1134.
  • 10Broder A, Fontoura M, Gabrilovich E, et al. Robust classification of rare queries using Web knowledge [C] //Proc of ACM SIGIR 2007. New York: ACM, 2007: 231-238.

共引文献62

同被引文献30

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部