Query auto-completion(QAC)facilitates query formulation by predicting completions for given query prefix inputs.Most web search engines use behavioral signals to customize query completion lists for users.To be effect...Query auto-completion(QAC)facilitates query formulation by predicting completions for given query prefix inputs.Most web search engines use behavioral signals to customize query completion lists for users.To be effective,such personalized QAC models rely on the access to suffcient context about each user’s interest and intentions.Hence,they often suffer from data sparseness problems.For this reason,we propose the construction and application of cohorts to address context sparsity and to enhance QAC personalization.We build an individual’s interest profile by learning his/her topic preferences through topic models and then aggregate users who share similar profiles.As conventional topic models are unable to automatically learn cohorts,we propose two cohort topic models that handle topic modeling and cohort discovery in the same framework.We present four cohortbased personalized QAC models that employ four different cohort discovery strategies.Our proposals use cohorts’contextual information together with query frequency to rank completions.We perform extensive experiments on the publicly available AOL query log and compare the ranking effectiveness with that of models that discard cohort contexts.Experimental results suggest that our cohort-based personalized QAC models can solve the sparseness problem and yield significant relevance improvement over competitive baselines.展开更多
基金the National Natural Science Foundation of China(No.61702526)the Defense Industrial Technology Development Program of China(No.JCKY2017204B064)the National Advanced Research Project of China(No.6141B0801010b)。
文摘Query auto-completion(QAC)facilitates query formulation by predicting completions for given query prefix inputs.Most web search engines use behavioral signals to customize query completion lists for users.To be effective,such personalized QAC models rely on the access to suffcient context about each user’s interest and intentions.Hence,they often suffer from data sparseness problems.For this reason,we propose the construction and application of cohorts to address context sparsity and to enhance QAC personalization.We build an individual’s interest profile by learning his/her topic preferences through topic models and then aggregate users who share similar profiles.As conventional topic models are unable to automatically learn cohorts,we propose two cohort topic models that handle topic modeling and cohort discovery in the same framework.We present four cohortbased personalized QAC models that employ four different cohort discovery strategies.Our proposals use cohorts’contextual information together with query frequency to rank completions.We perform extensive experiments on the publicly available AOL query log and compare the ranking effectiveness with that of models that discard cohort contexts.Experimental results suggest that our cohort-based personalized QAC models can solve the sparseness problem and yield significant relevance improvement over competitive baselines.