期刊文献+

用户查询意图切分的研究 被引量:11

Study on Segmentation of User's Query Intents
下载PDF
导出
摘要 用户查询意图是指用户在构建查询时,希望搜索引擎能够返回的信息.如果搜索引擎可以判断用户当前查询与之前输入的查询是否属于同一查询意图,那么可以为用户提供更适当的查询建议、查询扩展或者个性化检索服务等.该文提出了基于点击相似度切分用户查询意图,在决策树模型和CRF模型上都取得了一定的提升.利用用户点击信息可以提高查询意图切分的效果,引入用户点击信息后,基于决策树的方法,F值提高1%,基于CRF模型的F值提高1.4%. Query intent denotes the specific information that user wants to look for when he sub- mit a query to a search engine. If search engine can judge whether the intent of current query is same as that of previous query, they would refine current query and offer users more accurate results and services, such as query suggestion, query expansion or personalized information retrieval. This paper proposed click data-based similarity, and this similarity is effective with decision tree model and CRF model. Users' click data is helpful for this task, with users' click data, F-measure is improved by 1% and 1.4% for decision tree-based method and CRF-based method respectively.
作者 江雪 孙乐
出处 《计算机学报》 EI CSCD 北大核心 2013年第3期664-670,共7页 Chinese Journal of Computers
基金 国家自然科学基金(60736044,90920010) 重大科技专项经费(2010ZX01037-001-002)资助~~
关键词 信息检索 查询日志 查询意图切分 information retrieval query log segmentation of query intents
  • 相关文献

参考文献2

二级参考文献33

  • 1余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 2Bin Tan, Fuchun Peng. Unsupervised query segmentation using generative language models and Wikipedia[C]//Proceeding of the 17th international conference on World Wide Web. Beijing, China, 2008:347-356.
  • 3Craig Silverstein, Monika Henzinger, Hannes Marais, et al. Analysis of a very large Web search engine query log[J]. In SIGIR Forum, fall 1998, 33(1):6-12.
  • 4Daqing He, Ays, e Goker. Detecting session boundaries from Web user logs[C]//Proceedings of the 22nd annual colloquium on information, 2000.
  • 5H. Cenk Ozmutlu , Fatih cavdur, Application of automatic topic identification on excite web search engine data logs.[J]Information Processing and Management: an International Journal, 2005, 41(5) : 1243-1262.
  • 6Jing Bai, Jian-Yun Nie, Guihong Cao, Hugues Bouchard. Using query contexts in information retrieval[J]. SIGIR'07, July 23-27, 2007.
  • 7Jinhui Yuan, Huiyi Wang, Lan Xiao, Wujie Zheng, Jianmin Li, Fuzong Lin, and Bo Zhang. A Formal Study of Shot Boundary Detection. [C]//IEEE transactions on circuits and systems for video technology, VOL. 17, NO. 2, pp. 168-186. February 2007.
  • 8Qingsong Yao, Xiangji Huang and Aijun An. Applying Language Modeling to Session Identification from Database Trace Logs[C]//Knowledge and Information Systems, 2006-Springer.
  • 9S Ozmutlu, F Cavdur. Neural network applications for automatic new topic identification[J]. Online Information Review,2005, 29(1):34-53.
  • 10Seda Ozmutlu, H. Cenk Ozmutlu, Amanda Spink. Automatic New Topic Identification in Search Engine Transaction Logs using Multiple Linear Regression [C]//Proceedings of the 41st Hawaii International Conference on System Sciences. 2008: 140.

共引文献124

同被引文献66

  • 1张鹏飞,李赟,刘建毅,钟义信.基于相对词频的文本特征抽取方法[J].计算机应用研究,2005,22(4):23-26. 被引量:9
  • 2余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 3董振东.[EB/OL].知网http://www.keenage.com,1999.
  • 4Crammer K, Gentile C. Multiclass classification with ban- dit feedback using adaptive regularization [ J ]. Machine Learning,2013,90:357 - 383.
  • 5Wenbin Zheng, Lixin An, Zhanyi Xu. Dimensionality Re- duction by Combining Category Information and Latent Semantic Index for Text Categorization [ J]. Journal of In- formation & Computational Science, 2013,10 ( 8 ) : 2463 - 2469.
  • 6Bin Zhang, Alex Marin, Brian Hutchinson. Learning Phrase Patterns for Text Classification [ J ]. IEEE Trans- actions on audio, speech, and language processing,2013, 21 (6) :1180 - 1189.
  • 7Baccianella S, Esuli A, Sebastiani F. Using micro-docu- ments for feature selection: The case of ordinal text classi- fication [ J ]. Expert Systems with Applications, 2013,40 : 4687 - 4696.
  • 8Djeddi C, Siddiqi I, Souici-Meslati L. Text-independent writer recognition using multi-script handwritten texts [ J ]. Pattern Recognition Letters,2013,34 : 1194 - 1202.
  • 9刘群,李素建.基于《知网》的词汇语义相似度计算[J].计算语言学及中文信息处理,2002,7:59-76.
  • 10Bahojb I M, Reza K M, Reza A. A novel embedded fea- ture selection method:Acomparative study in the applica- tion of text categorization [ J ]. Applied Artificial Intelli- gence ,2013,27(5) :408 -427.

引证文献11

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部