权值矩阵聚类算法被引量：2

A Weight Matrix Clustering Algorithm

下载PDF

导出

摘要由于以往的算法不能对用户感兴趣的页面进行很好的聚类,所以将网站访问频度作为参数考虑进来,提出一个新的概念——权值关联矩阵,以Web服务器URL为行、以UserID为列建立URL-UserID关联矩阵,与普通的矩阵聚类算法相比,根据用户对某页面的兴趣度,再生成权值关联矩阵。从而发现相似的用户群体和相似的W eb页面。该算法通过上机实践,与传统的矩阵聚类算法相比具有识别准确率高,用户向量特征描述更准确,且能够更准确的反映网站的访问情况等优点。同时为用户提供个性化推荐服务铺平了道路。 Because the algorithms can not cluster the interested pages well, the visit frequency of the site is taken as a parameter into account to introduce a new concept - the right of correlation matrix, namely a URL - UserID correlation matrix. Compared to the matrix clustering algorithm, the right correlation matrix is generated according to the pape on users interest. Thus a similar user groups and similar Web pages are found. The algorithm matrix has higher identification accuracy, more accurate description of vector features, and can more accurately reflect the site visits compared with traditional clustering algorithms.

作者刘丽娜孙铁利

机构地区东北师范大学计算机学院

出处《计算机仿真》 CSCD 北大核心 2009年第5期115-117,149,共4页 Computer Simulation

关键词聚类算法兴趣度关联矩阵个性化推荐 Clustering algorithms Interest Correlation matrix Personalized recommendation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1J Srivastava, et al. Web usage mining : Discovery and applications of usage patterns from web data[ J]. SIGKDD Explorations, 2000, 1 (2) :12 -23.
2R Cooley, B Mobasher, J Srivastava. Data preparation for mining world wide web browsing patterns[ J ]. Knowledge and Information Systems, 1999,1 ( 1 ) :5 - 32.
3B Mobasher, R Cooley. Creating adaptive Web sites through Usage2based clustering of URLs[ C]. Proc of the 1999 IEEE Knowledge and Data Engineering Exchange Workshop. New York : IEEE Press, 1999:32 - 37.
4G Paliouras, et al. Clustering the users of large web sites into communities[ C]. Proc of the 17th lnt Conf on Machine Learning. San Mateo : Morgan Kaufmann, 2000. 719 - 728.
5Y Fu, K Sandhu, M Shih. A generalization 2 based approach to clustering of Web usage session[ C]. Web Usage Analysis and User Profiling. New York :Springer2Verlag, 2000.21 - 38.
6C Shahabi, A M Zarski, J Shah. Knowledge discovery from users web2page navigation[ C]. Proc of 7th Int Conf on Research Issues in Data Engineering. Birmingham: IEEE Computer Society Press, 1997.20 - 29.
7苏中,马少平,杨强,张宏江.基于Web-Log Mining的Web文档聚类[J].软件学报,2002,13(1):99-104. 被引量：29
8M Perkowitz, O Etzioni. Adaptive websites :Automatically synthesizing Web pages[ C]. Proc of AAAI 98. Madison: AAAI Press, 1998.35 - 40.
9宋擒豹,沈钧毅.Web日志的高效多能挖掘算法[J].计算机研究与发展,2001,38(3):328-333. 被引量：115

二级参考文献9

1Zaiane O R，Proc Advances Digital Libraries Conf，1998年，19页
2Chen M S，Proc of the 16th Int Conf Distributed Computing Systems，1996年，385页
3Mobasher B，Tech Rep:TR96，1996年
4Ng, R., Han, J. Efficient and effective clustering methods for data mining. In: Bocca, J.B., Jarke, M., Zaniolo, C., eds. Proceedings of the 1994 International Conference on Very Large Data Bases (VLDB'94). Santiago, Chile: Morgan Kaufmann, 1994. 144～155.
5Ester, M., Kriegal, H.P, Sander, J. A density-based algorithm for discovering clusters in large spatial databases with noise. In: Simoudis, Evangelos, Han, Jia-wei, Fayyad, U.M., eds. KDD'96--Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. AAAI Press, 1996.
6Kaufman, L., Rousseeuw, P. J. Finding Groups in Data: an Introduction to Cluster Analysis. John Wiley & Sons, 1990.
7Sibson, R. SLINK: an optimally efficient algorithm for the single-link cluster method. The Computer Journal, 1973,16(1):20～34.
8Bouguettaya, A. On-Line clustering. IEEE Transactions on Knowledge and Data Engineering. 1996,8(2):333～339.
9Voorhees, E.M. Implementing agglomerative hierarchical clustering algorithms for use in document retrieval. Information Processing and Management, 1986,22:465～476.

共引文献137

1吕佳.Web日志挖掘技术应用研究[J].重庆师范大学学报（自然科学版）,2006,23(4):39-44. 被引量：15
2赵娜,臧景才.多标记传播聚类算法在电子商务中的应用[J].青海大学学报（自然科学版）,2009,27(1):85-88.
3薛昌春.浅谈电子商务中客户购物信息挖掘研究[J].科技经济市场,2007(11):32-33. 被引量：1
4蔡猷花,张岐山.Web日志挖掘及其在电子商务领域的应用[J].管理学报,2005,2(z1):133-135.
5朱丽红,赵燕平.Web挖掘研究综述[J].情报杂志,2004,23(7):2-5. 被引量：16
6朱克斌,唐菁,杨炳儒.Web文本挖掘系统及聚类分析算法[J].计算机工程,2004,30(13):138-139. 被引量：7
7张猛,王大玲,于戈.一种基于自动阈值发现的文本聚类方法[J].计算机研究与发展,2004,41(10):1748-1753. 被引量：16
8严华云.Web挖掘在网络教育中的应用研究[J].湖州师范学院学报,2003,25(6):72-75. 被引量：10
9杜威,邹先霞,魏长华.基于OLAP的Web日志挖掘的研究与探讨[J].计算机与现代化,2004(12):106-109. 被引量：3
10邱均平,张洋.网络信息计量学综述[J].高校图书馆工作,2005,25(1):1-12. 被引量：44

同被引文献14

1周昌令,钱群,赵伊秋,尚群.校园无线网用户群体的移动行为聚集分析[J].通信学报,2013,34(S2):111-116. 被引量：4
2张敏,冯登国,徐震.多级多版本数据库管理系统全局串行化(英文)[J].软件学报,2007,18(2):345-350. 被引量：11
3潘莹,梁京章,黎慧娟.基于K-means算法的校园网用户行为聚类分析[J].计算技术与自动化,2007,26(1):66-69. 被引量：10
4JimGray,AndreasReuter.事务处理概念与技术(英文版)[M].北京:人民邮电出版社,2009-5.
5B S Song, K M Lee, S U Lee. Model - based object recognition using geometric invariants of oints and lines [ J ]. Computer Vision and Image Understanding,2001,84(3) :361 -381.
6Kamel Barkaoui, Rabah Benamara. On Eoncurrency Control in Muhidatabase Systems with an Extended Transaction Model [ J ]. Journal of Supercomputing, 2003,24 (2).
7Gerhard Weikum, Gotffried Vossen. Transactional Information Systems: Theory, Algorithms, and the practice of Concurrenecy Control and Recovery[ M]. San Francisco:Morgan Kaufmann, 2001,4 (6).
8易和平.分布式多数据库高校学籍管理系统研究与应用[J].西安石油大学学报（自然科学版）,2009,24(4):92-95. 被引量：15
9崔亮,路向中,党倩,王健肃.一种模糊自适应虚拟队列管理算法[J].计算机仿真,2009,26(10):111-114. 被引量：2
10丁青,周留根,朱爱兵,张义东.基于K-means聚类算法的校园网用户行为分析研究[J].微计算机应用,2010,31(6):74-80. 被引量：15

引证文献2

1杜立佳,董丽丽,何浩,申艳芬.多数据库事务并发调度算法优化技术研究[J].计算机仿真,2011,28(2):393-396. 被引量：9
2李旭,刘方爱,刘浩然.校园无线局域网用户兴趣度算法分析[J].山东师范大学学报（自然科学版）,2016,31(1):25-30.

二级引证文献9

1乌岚.基于多样约束模型的远程教育数据库优化查询算法[J].科技通报,2013,29(1):154-156. 被引量：35
2黄楠.海量信息存储中数据库性能优化方法[J].科技通报,2013,29(3):162-164. 被引量：10
3姚瑶,夏斌.相频特征群延迟算法在数据库级差访问中应用[J].计算机仿真,2014,31(12):238-241. 被引量：1
4曲鸣飞,赵丹.基于数据链模板匹配的数据库优化访问技术[J].软件导刊,2016,15(3):165-167.
5张雨晨,石宇灏,姜攀.共享经济背景下共享单车数据库的分析与研究[J].电脑编程技巧与维护,2017(13):45-47. 被引量：1
6农民强.远程教育网络中的多媒体资源调度技术研究[J].现代电子技术,2017,40(24):68-70. 被引量：4
7俞思伟,吴庆斌,孙晓玮,黄之怡.基于真实世界的医院数据效能研究[J].中国数字医学,2023,18(7):1-6. 被引量：1
8王小君,卢昱明.大数据分析下的船舶调度方法研究[J].舰船科学技术,2018,40(1X):34-36. 被引量：2
9王烨,张子春,宋文超,刘恒军,黄勇,刘增良.多层α-核散列聚类的异常数据社团发现算法[J].信息安全与通信保密,2014,0(9):94-97. 被引量：1

1高凤荣,邢春晓,杜小勇,王珊.基于矩阵聚类的协作过滤算法[J].华中科技大学学报（自然科学版）,2005,33(z1):257-260. 被引量：3
2包剑,郭丽春,黄金波.一种基于Web用户访问模式的矩阵聚类算法研究[J].西华大学学报（自然科学版）,2010,29(4):85-87. 被引量：2
3岳训,苗良,巩君华,岳荣.基于矩阵聚类的电子商务网站个性化推荐系统[J].小型微型计算机系统,2003,24(11):1922-1926. 被引量：8
4赵海涛,龙鹏飞.基于Apriori算法的确定指定精度矩阵聚类方法[J].微计算机信息,2011,27(9):222-223.
5陈立宁,罗可.基于Apriori算法的确定指定精度矩阵聚类方法[J].计算机工程与应用,2012,48(7):139-141. 被引量：2
6乔虎,莫蓉,杨海成,向颖.一种考虑客户需求的产品模块规划方法[J].西北工业大学学报,2014,32(2):256-261. 被引量：6
7杨宏宇,张树茂,江华.基于多重聚类的网络攻击检测方法研究[J].微电子学与计算机,2015,32(8):24-29. 被引量：2

计算机仿真

2009年第5期

浏览历史

内容加载中请稍等...

权值矩阵聚类算法被引量：2

参考文献9

二级参考文献9

共引文献137

同被引文献14

引证文献2

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

权值矩阵聚类算法 被引量：2

参考文献9

二级参考文献9

共引文献137

同被引文献14

引证文献2

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

权值矩阵聚类算法被引量：2