基于server session约束的序列模式增长挖掘研究被引量：1

Server Session Constraint-based Serial Pattern Growth Mining Research

下载PDF

导出

摘要在WUM(Web Usage Mining)中挖掘序列模式的背景下,提出了一种基于server session约束的序列模式增长挖掘算法.首先,为了更好地从网站服务器日志文件中挖掘模式和发现知识,提出了一种基于server session的服务器日志文件格式.同时,引入基于server session的约束概念,利用其能够减少初始序列模式和候选项集大小的特点来减少每次扫描后缀数据库的规模,再从预处理后的日志文件中挖掘WUM的频繁访问路径的序列模式.最后通过实验证明了算法的有效性和优越性. In the context of the sequence pattern mining in WUM, a server session constraintbased serial pattern growth mining algorithm is proposed. Firstly, to mine pattern and discover knowledge better from the log file, a server session-based server log file format is proposed. Then, by introducing server session-based constraint concept, which can reduce the initial sequence model and candidate set size, relying on that, the size of the suffix database scanned can be reduced each time. And then the serial pattern of the frequent access path in WUM can be mined. Finally, the validity and superiority of the presented algorithm are demonstrated by two experiments.

作者蔡宏果元昌安罗锦光张增银石亚冰

机构地区广西师范学院计算机与信息工程学院

出处《郑州大学学报（理学版）》 CAS 北大核心 2010年第1期24-28,共5页 Journal of Zhengzhou University:Natural Science Edition

基金国家自然科学基金资助项目编号60763012 广西科学研究与技术开发计划重大项目编号0815007-1-15 广西研究生创新计划项目编号2009106030774M03

关键词序列模式服务器日志文件服务器会话 WEB使用挖掘数据挖掘 serial pattern server log file server session Web usage mining data mining

分类号 TP392 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1Zaki M J. SPADE: an efficient algorithm for mining frequent sequences[J]. Machine Learning, 2001,42(1/2) : 31- 60.
2Ceddia J, Sheard J, Tibbey G. WAT:a tool for classifying learning activities from a log file[C]//Proceedings of the 9th Australasian Conference on Computing Education. Darlinghurst: Australian Computer Society, 2007.
3Liang Q A, Miller S, Chung J. Service mining for Web service composition[C]// IEEE International Conference on Information Reuse and Integration . Las Vegas, Nevada, 2005.
4张兵,聂永红,林士敏.NPSP:一种高效的序列模式增量挖掘算法[J].广西师范大学学报（自然科学版）,2004,22(4):22-26. 被引量：4
5Han J W,Kamber M. Data Mining: Concepts and Techniques[M]. 2nd ed . San Francisco: Morgan Kaufmann Publishers,2006.
6朱志国,邓贵仕.Web使用挖掘技术的分析与研究[J].计算机应用研究,2008(1):29-32. 被引量：23
7吕安民,李成名,林宗坚,范明.基于分形的时间序列模式挖掘方法及其应用[J].郑州大学学报（自然科学版）,2001,33(4):59-62. 被引量：3
8Asbagh M J, Abolhassani H. Web service usage mining: mining for executable sequences [C]// Proceedings of the 7th WSEAS International Conference on Applied Computer Science . Wisconsin: World Scientific and Engineering Academy and Society, 2007.
9Lin M Y , Hsueh S C, Chang C W. Fast discovery of sequential patterns in large databases using effective time-indexing[J].Information Sciences, 2008,178 (22) : 4228-4245.
10Silvestri C, Orlando S. Approximate mining of frequent patterns on streams[J].Intelligent Data Analysis, 2007,11(1): 49-73.

二级参考文献24

1苏毅娟,严小卫.一种改进的频繁集挖掘方法[J].广西师范大学学报（自然科学版）,2001,19(3):22-26. 被引量：10
2Agrawal Rakesh,Srikant Ramakrishnan.Mining sequential patterns[A].Proceedings of the 11th international conference on data engineering[C].Los Alamitos,CA:IEEE Computer Society Press,1995.3-14.
3Srikant Ramakrishnan,Agrawal Rakesh.Mining sequential patterns:generalizations and performance improvements[A].Proceedings of the 5th international conference on extending database technology[C].Berlin:Springer-Verlag,1996.3-17.
4Masseglia F,Cathala F,Poncelet P.The PSP approach for mining sequential patterns[A].Proceedings of the 2nd European symposium on principles of data mining and knowledge discovery[C].Berlin:Springer-Verlag,1998.176-184.
5Mueller A.Fast sequential and parallel algorithms for association rule mining:a comparison(technical report CS-TR-3515)[R].College Park:University of Maryland,1995.
6Agrawal R,Srikant R.Fast algorithms for mining association rules in large databases[A].Proceedings of the 20th international conference on very large databases[C].San Mateo:Morgan Kaufmann Publishers,1994.487-499.
7BRIN S,MOTWANI R.What can you do with a Web in your pocket[J].Data Engineering Bulletin,1998,21(2):37-47.
8FELDMAN R,DAGAN I.Knowledge discovery in textual databases (KDT)[C]//Proc of the 1st Int'l Conf on Knowledge Discovery and Data Mining.Montreal:[s.n.],1995:112-117.
9CHAKRABARTI S.Data mining for hypertext:a tutorial survey[J].SIGKDD Exploration,2000,1(2):1-11.
10COOLEY R,MOBASHER B,SRIVASTAVA J.Web mining:information and pattern discovery on the World Wide Web[C]//Proc of the 9th Int'l Conf on Tools with Artificial Intelligence.Washington DC:IEEE Computer Society Press,1997:558-567.

共引文献27

1刁哲军,吴欣明,靳慧龙,许成谦.似最佳自相关序列偶的研究[J].广西师范大学学报（自然科学版）,2005,23(3):17-20. 被引量：1
2张兵.一种网络日志挖掘的高效算法[J].广西师范大学学报（自然科学版）,2006,24(1):26-29. 被引量：2
3李晓凯,郭红.一种可变长子片段对拼接的DNA双序列局部比对算法[J].广西师范大学学报（自然科学版）,2008,26(4):53-57.
4雷亮,李善君,彭军.改进的遗传算法在Web使用挖掘中的应用[J].计算机工程与应用,2009,45(8):135-137. 被引量：2
5朱志国,邓贵仕.挖掘频繁波动的Web访问模式算法研究[J].大连理工大学学报,2009,49(2):282-287.
6贺可强,孙林娜,王思敬.滑坡位移分形参数Hurst指数及其在堆积层滑坡预报中的应用[J].岩石力学与工程学报,2009,28(6):1107-1115. 被引量：45
7王晓静,张晋.WEB使用挖掘中的数据预处理分析与算法研究[J].辽宁大学学报（自然科学版）,2009,36(2):157-160. 被引量：1
8李诗诗,方寿海.基于Web使用挖掘技术的聚类算法改进[J].计算机工程与设计,2009,30(22):5182-5184. 被引量：5
9杨斌,董祥军.基于负关联规则的Web使用挖掘技术及发展趋势[J].微型机与应用,2009,28(24):64-66.
10蔡宏果,元昌安,彭昱忠,陶俊剑.基于GEP的多层关联规则挖掘算法及其应用[J].计算机工程与设计,2010,31(1):137-140. 被引量：10

同被引文献1

1杨明辉,郭肇德.基于扩展的BNF文法的通用语法分析算法[J].软件学报,1992,3(3):24-32. 被引量：3

引证文献1

1朱华旻,周振吉,吴礼发,王海波.一种多云环境的资源及应用监控方法SEPQMS[J].郑州大学学报（理学版）,2017,49(3):45-51.

1许艳丹.Web日志挖掘数据预处理中的会话识别技术[J].中国西部科技,2011,10(4):28-29. 被引量：1
2姚学礼.Web数据挖掘在电子商务中的应用[J].现代经济信息,2009(3X):32-33. 被引量：2
3葛昕,黄永慧,陈锐.Web使用模式挖掘系统的设计与实现[J].柳州师专学报,2003,18(3):89-92. 被引量：3
4王宏宇,陈冬梅,王兴国.Windows服务器日志文件的保护[J].中国教育网络,2008(11):79-79.
5汪剑.基于Web的数据挖掘在信息服务领域的应用[J].软件导刊,2008,7(10):8-9.
6薛福亮,张慧颖.应用WUM和RBFN补值的协同过滤推荐研究[J].计算机工程与应用,2012,48(9):22-26.
7林彬煌.数字图书馆服务中的Web数据挖掘[J].科技创新与应用,2012,2(10Z):46-46. 被引量：1
8任家东,周晓磊.一种挖掘序列模式的增量式更新算法[J].燕山大学学报,2007,31(6):476-480. 被引量：1
9刁雅静,卢健.基于权重的关联模式分析改进在网站优化中的应用[J].江苏科技大学学报（自然科学版）,2012,26(3):305-309. 被引量：1
10Windows 2000／XP／2003服务辞典[J].网管员世界,2007(21):99-99.

郑州大学学报（理学版）

2010年第1期

浏览历史

内容加载中请稍等...

基于server session约束的序列模式增长挖掘研究被引量：1

参考文献11

二级参考文献24

共引文献27

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于server session约束的序列模式增长挖掘研究 被引量：1

参考文献11

二级参考文献24

共引文献27

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于server session约束的序列模式增长挖掘研究被引量：1