期刊文献+

一种网络日志挖掘的高效算法 被引量:2

An Efficient Algorithm with Incremental Data Mining for Web Usage Mining
下载PDF
导出
摘要 提出了一种网络日志挖掘算法PWU,其采用了异构树结构。通过对异构树叶子节点进行编号,使得对候选集计数时只需对具有相同编号的叶子节点进行计数,极大地简化了候选集计数过程。在此基础上,算法还具有增量挖掘功能。最后,从理论分析和实验两方面证明了算法的高效性以及增量挖掘功能的高效性和完备性。 Mining server access data can provide significant and useful information. This paper presents an algorithm called PWU,which adopts the data structure named Heterogeneity. The data structure uses a set of rules to number the branches of the Heterogeneity Tree. The rules simplify the process of counting the support of candidates. Finally ,the completeness of the mined set and efficiency of the algorithm PWU are proved both in theory and experiment.
作者 张兵
出处 《广西师范大学学报(自然科学版)》 CAS 北大核心 2006年第1期26-29,共4页 Journal of Guangxi Normal University:Natural Science Edition
基金 国家自然科学基金资助项目(60463003) 北京市教育委员会科技发展计划项目(KM200510016002)
关键词 网络日志挖掘PWU算法 增量挖掘 Web usage mining PWU algorithm incremental data mining
  • 相关文献

参考文献10

  • 1COOLEY R,MOBASHER B,SRIVASTAVA J.Web mining:information and pattern discovery on the world wide Web[C]//Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence.Los Alamitos,Calif.:IEEE Computer Society,1997:558-567.
  • 2PERKOWITZ M,ETZIONI O.Adaptive web sites:Conceptual cluster mining[C]//Proceedings of the 16th Int.Joint Conf.on Artificial Intelligence.San Francisco,Calif.:Morgan Kaufmann Publishers,1999:264-269.
  • 3TAUSCHER L,GREENBERG S.How people revisit web pages:Empirical findings and implications for the design of history systems[J].International Journal of Human-Computer Studies,1997,47:97-137.
  • 4ZAIANE O R,XIN Man,HAN Jia-wei.Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs[C]//Proc.of the 5th International Forum on Research and Technology Advances in Digital Libraries.Los Alamitos,Calif.:IEEE Computer Society,1998:19-29.
  • 5TAMAKRISHNAN Srikant,RAKESH Agrawal.Mining sequential patterns:generalizations and performance improvements[C]//Proceedings of the 5th International Conference on Extending Database Technology.Berlin:Springer-Verlag,1996:3-17.
  • 6苏毅娟,严小卫.一种改进的频繁集挖掘方法[J].广西师范大学学报(自然科学版),2001,19(3):22-26. 被引量:10
  • 7刘美玲,徐章艳,卢景丽,区玉明,袁鼎荣,吴信东.利用项集有序特性改进Apriori算法[J].广西师范大学学报(自然科学版),2004,22(1):33-37. 被引量:11
  • 8NEUSS C,VROMAS J.Applications CGI en Perl pour les Webmasters[M].Paris:International Thomson Publishing France,1996.
  • 9张兵,聂永红,林士敏.NPSP:一种高效的序列模式增量挖掘算法[J].广西师范大学学报(自然科学版),2004,22(4):22-26. 被引量:4
  • 10SRIKANT R,AGRAWAL R.Mining Generalized Association Rules[J].Future Generation Computer Systems,1997,13(2/3):161-180.

二级参考文献20

  • 1苏毅娟,严小卫.一种改进的频繁集挖掘方法[J].广西师范大学学报(自然科学版),2001,19(3):22-26. 被引量:10
  • 2Agrawal R ,Imielinski T ,Swami A. Mining association rules between sets of items in large databases[A]. Proceedings of the ACM SIGMOD international conference on management of data[C]. New York:ACM Press, 1993. 207-216.
  • 3Jong Soo Park,Ming-Syan Chen ,Philip S Yu. Using a hash based method with transaction trimming for mining association rules [J]. IEEE Transactions on Knowledge and Data Engineering, 1997,9(5):813-825.
  • 4Agrawal R,Srikant R. Fast algorithms for mining association rules[A]. Proceedings of the 20th VLDB conference[C].San Mateo:Morgan Kaufmann Publishers,1994.487-499.
  • 5Han J,Pei J,Yin Y. Mining frequent patterns without candidate generation[A]. Proceedings of the 2000 ACM SIG-MOD international conference on management of data[C]. New York:ACM Press,2000.1-12.
  • 6Agrawal Rakesh,Srikant Ramakrishnan.Mining sequential patterns[A].Proceedings of the 11th international conference on data engineering[C].Los Alamitos,CA:IEEE Computer Society Press,1995.3-14.
  • 7Srikant Ramakrishnan,Agrawal Rakesh.Mining sequential patterns:generalizations and performance improvements[A].Proceedings of the 5th international conference on extending database technology[C].Berlin:Springer-Verlag,1996.3-17.
  • 8Masseglia F,Cathala F,Poncelet P.The PSP approach for mining sequential patterns[A].Proceedings of the 2nd European symposium on principles of data mining and knowledge discovery[C].Berlin:Springer-Verlag,1998.176-184.
  • 9Mueller A.Fast sequential and parallel algorithms for association rule mining:a comparison(technical report CS-TR-3515)[R].College Park:University of Maryland,1995.
  • 10Agrawal R,Srikant R.Fast algorithms for mining association rules in large databases[A].Proceedings of the 20th international conference on very large databases[C].San Mateo:Morgan Kaufmann Publishers,1994.487-499.

共引文献22

同被引文献11

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部