摘要
对Web用户的访问序列进行分析,可以发现用户的爱好、兴趣、习惯等因素,为Web网站的升级修正提供必要的信息支持,提出一种通过对用户访问序列进行分析的数据挖掘方法,该方法采用网页驻留时间为参数来约减会话序列中的网页数,压缩频繁访问序列的规模。实验结果表明,该算法可以降低挖掘成本,为Web用户的商业数据挖掘提供有益的借鉴。
By analysis of Web user access sequence, it can find tile factors of user's hobbies, interests, habits etc., and provides the necessary support of information for the upgrade and amendment of Web sites. This article proposes a method of data mining by analysis of the user access sequence. It can reduce the number of Web pages of the session sequence and compress the size of fl'equeut traversal sequence by taking the duration time of Web page as a parameter. Experimental results show the algorithm can reduce the cost of mining and provide a useful refcrence for mining of Web users' commercial data.
出处
《计算机工程》
CAS
CSCD
北大核心
2010年第24期45-47,50,共4页
Computer Engineering
基金
江苏省高校自然科学研究计划基金资助项目(06KJB520022)
关键词
网页驻留时间
数据挖掘
序列
duration time of Web page
data mining
sequence