摘要
用户在使用Web搜索引擎进行信息查询时,可能包含单个或多个主题。该文针对大规模中文搜索引擎系统——北大天网的多任务Web查询,进行了研究和分析。结果显示:多于1/3的用户进行多任务Web查询;超过1/2的多任务会话包含2个不同的主题并进行2~7次查询;多任务会话时间的均值是一般会话时间均值的2倍;天网用户的多任务查询主要有3个主题:计算机,娱乐和教育;近1/4的多任务会话中包含不确定的信息。该文用关联分析的方法发现了用户查询主题之间的一些关系。
The Web queries submitted to search engines usually involves single or multiple topics. This paper investigates and analyzes the characteristics of multitasking Chinese Web searches based on the query logs of Tianwang system, a large-scale Chinese search engine. The results shows: More than one third of users often perform multitasking Web searching; More than half of multitasking sessions include two topics, with two to seven queries per session; The mean duration of multitasking sessions is twice that of regular sessions; Most multitasking searches in Tianwang systems have three topics: computer/network, entertainment and education; Nearly one fourth of multitasking sessions include inexplicit information. In addition, this paper also carries out the analysis of association rule and finds some relationships among session topics.
出处
《计算机工程》
CAS
CSCD
北大核心
2006年第14期25-26,68,共3页
Computer Engineering
基金
国家自然科学基金资助重点项目(60435020)
教育部博士点基金(20030001076)
中国博士后科学基金(2004036182)