摘要
用户对Web网站访问兴趣可以通过页面的浏览顺序表现出来,Web站点的访问日志记录了用户访问页面的详细信息。介绍Web站点访问日志挖掘的相关知识,并定义新的兴趣度,相似度和聚类中心,提出了一种基于用户访问兴趣的路径聚类算法,最后通过实验来验证这种算法的有效性。
Users'interest to a web site may be represented by the sequence of access path. Web log files detailedly record the access information. Some knowledge of web log data mining is introduced, and new definitions of interest, similarity and clustering center are presented. A new path clustering algorithm based on users, access interest is proposed. The experimental result shows that this algorithm is effective.
出处
《计算机应用与软件》
CSCD
北大核心
2008年第8期205-206,226,共3页
Computer Applications and Software
关键词
Web访问信息
访问兴趣
聚类路径
Web access information Access interest Path clustering