期刊文献+

用页组拓扑平均距离改善页面聚类算法 被引量:1

Enhanced Algorithm for Page Clustering by Using Topology Average Distance of Web Pages Group
下载PDF
导出
摘要 提出一种支持站点结构优化的页面聚类改进算法,通过引入图论中的拓扑平均距离,量化评估与挖掘站点结构中访问效率较低的内容文档集合为结构优化的兴趣页组,挖掘的页组具有更高的兴趣性,并将兴趣页组挖掘算法融入到拓扑优化算法中。实验结果表明改进算法能更好地优化站点结构,较一般算法收敛性好。 An enhanced algorithm which supports Website structure optimization was proposed for page clustering. A quantitative criteria was proposed by introducing the average distance in graph and the low access efficiency Web content pages group was discovered as interesting page group for Website structure optimization. Thus, the enhanced algorithm can find out more interesting page groups than the normal algorithm. Meanwhile the mining algorithm was integrated into the topology optimization algorithm. Experiment results show that the enhanced algorithm can improve Website structure better and it converges more rapidly.
出处 《计算机科学》 CSCD 北大核心 2008年第10期200-203,共4页 Computer Science
基金 国家自然科学基金项目(70672097) 国家自然科学基金重点项目(70631003)
关键词 WEB使用挖掘 页面聚类 频繁访问页组 自适应站点 Web usage mining,Page clustering,Frequently visited page group,Self-adaptive Website
  • 相关文献

参考文献13

  • 1Jaideep S, Robert C, Mukund D, et al. Web usage mining:discovery and applications of usage patterns from Web data[C]. ACM SIGKDD Explorations Newsletter, 2000,1 (2) : 12-23
  • 2陈新中,李岩,杨炳儒,谢永红,张运涛.Web日志挖掘技术进展[J].系统工程与电子技术,2003,25(4):492-495. 被引量:17
  • 3刘业政,林文龙,焦宁,姜元春.WEB站点结构优化仿真[J].系统仿真学报,2007,19(20):4685-4688. 被引量:5
  • 4Wu K L, Yu P S, Ballman A. SpeedTracer: a Web usage mining and analysis tool[J]. IBM System Journal(S0018-8670), 1998, 37(1) :89-105
  • 5Perkowitz M, Etzioni O. Adaptive Web sites:automatically synthesizing Web pages//Fifteenth National Conference on Artificial Intelligence. Madison, 1998
  • 6杨怡玲,管旭东,尤晋元.基于页面内容和站点结构的页面聚类挖掘算法[J].软件学报,2002,13(3):467-469. 被引量:20
  • 7Fayyad U M,Piatetsky S G,Smyth P. The KDD process for extracting useful knowledge from volumes of data[J]. Communications of the ACM,1996,39(11):27-34
  • 8Fu Y,Creado M,Shih M Y. Adaptive Web Sites by Web Usage Mining//International Conference on Internet Computing 2001. Las Vegas, USA, 2001
  • 9Doyle J K, Graver J E. Mean distance in a graph[J]. Discrete Math. , 1977,17:147-154
  • 10West D B. Introduction to Graph Theory (2nd Edition) [M].Published by Prentice Hall 1996,2001:67-107

二级参考文献33

  • 1[1]Cooley,R.,Srivastava,J.Data preparation for mining World Wide Web browsing patterns.Journal of Knowledge and Information Systems,1999,1(1):5~32.
  • 2[2]Fayyad,U.M.,Piatetsky-Shapiro,G.,Smyth,P.The KDD process for extracting useful knowledge from volumes of data.Communications of the ACM,1996,39(11):27~34.
  • 3[3]Mobasher,B.,Jain,N.,Han,E.H.,et al.Web mining:pattern discovery and from World Wide Web transactions.Technical Report,96-050,University of Minnesota,1996.
  • 4[4]Wu,K.L.,Yu,P.S.,Ballman,A.SpeedTracer:a web usage mining and analysis tool.IBM System Journal,1998,37(1):89~105.
  • 5Bunchner A G, Mulvenna M D. Discovering Internet Marketing Intelligence Through Online Analytical Web Usage Mining[ J ]. SIGMOD.Record, 1998, 27(4).
  • 6Chen M S, Park J S. Efficient Data Mining For Traversal Patterns[J].IEEE. Trans. Knowledge and Data Engineering, 1998, 10(2).
  • 7Borges J, Levene M. Data Mining of User Navigation Patterns[C]. In:Proc Web Usage Analysis and User Profiling Workshop, San Diego,California, 1999.
  • 8Myra Spiliopoulou, Faulstich Lukas C.WUM: A Tool for Web Utilization Analysis[M]. In EDBT Workshop WebDB'98, Valencia,Spain Springer Verbs, 1998.
  • 9Zaiane R. Xin M, Han J. Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology oa Web Logs[C].Proc. Advances in Digital Libraries Conf. (ADL'98), Santa Barbara,CA, April 1998:19-29.
  • 10Cyrus Shahabi, Amir Zalkesh, Jafar Adibi, et al. Knowledge Discovery from Users Wen-Page Navigatioa [ C ]. In Proceedings of the IEEE RIDE97 Workshop, April 1997.

共引文献37

同被引文献8

  • 1Hou. J. Zhang. Y.. 2003. Utilizing Hyperlink Transitivity to Improve Web Page Clustering. In Proceedings of the 14th Australasian Database Conference-Volume 17 (Adelaide, Australia). K. Schewe, X. Zhou, Eds. ACM International Conference Proceeding Series, vol. 143. Australian Computer Society, Darlinghurst, Australia, 49-57.
  • 2Cutting, D. R., Karger, D. R., Pedersen, J. O., Tukey, J. W. 1992. Scatter/Gather: A Cluster-Based Approach to Browsing Large Document Collections. In Proceedings of the 15th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval (Copenhagen, Denmark, June 21 - 24, 1992). N. Belkin, P. Ingwersen, A. M. Pejtersen, Eds. SIGIR "92. ACM, New York, NY, 318-329.
  • 3Zamir O,Etzioni O.Web Document Clustering:A Feasibility Demonstration[C].Proceedings of the 17th International ACM SIGIR Conference on Research and Development of Information Retrieval, 1998:46-54.
  • 4计算所汉语词法分析分析系统ICTCLAS.http://www.nlp.org.cn/project/project.php?proj_id=6.
  • 5G.Salton, A Wong, C Yang. A Vector Space Model for Automatic Indexing[J]. Communications of the ACM, 1998, 18 (11): 613-620.
  • 6Wang Y., Kitsuregawa. M.. 2002. Evaluating Contents-Link Coupled Web Page Clustering for Web Search Results. In Proceedings of the Eleventh International Conference on Information and Knowledge Management (McLean, Virginia, USA, November 04 - 09, 2002). CIKM '02. ACM, New York, NY, 499-506.
  • 7A. Strehl, E. Strehl, J. Ghosh, R. Mooney. Impact of Similarity Measures on Web-Page Clustering. In Workshop on Artificial Intelligence for Web Search (AAAI 2000), 2000, pp.58-64.
  • 8谢艳玲,何丕廉,于鷃,孙越恒.一种高效的网页聚类方法[J].计算机工程与设计,2007,28(17):4229-4232. 被引量:7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部