一种改进的缺失数据协同过滤推荐算法被引量：2

An improved collaborative filtering recommendation algorithm for missing data

下载PDF

导出

摘要协同过滤推荐算法是推荐系统研究的热点,近年来,在亚马逊、淘宝等商业系统中获得应用。在实际应用过程中,协同过滤推荐面临数据稀疏和准确性低的问题。作为推荐基础的用户-产品(项目)矩阵通常非常稀疏(存在大量缺失数据),从而导致推荐结果不准确。文章试图在缺失数据情况下提高协同过滤推荐的准确性,聚焦以下两个方面:(1)用户相似度、产品(项目)相似度计算;(2)缺失数据预测。首先,用增强的皮尔森相关系数算法,通过增加参数,对相似度进行修正,提高用户、产品(项目)相似度计算的准确率。接着,提出一种同时考虑了用户和产品(项目)特征的缺失数据预测算法。算法中,对用户和产品(项目)分别设置相似度阈值,只有当用户或产品(项目)相似度达到阈值时,才进行缺失数据预测。预测过程中,同时使用用户和产品(项目)相似度信息,以提高准确度。在模型基础上,用淘宝移动客户端的数据集进行了验证,实验结果表明所提算法比其他推荐算法要优异,对数据稀疏性的鲁棒性要高。 Collaborative filtering recommendation algorithm has been widely studied, and widely applied in recent years in many business sys- tems, such as Amazon, Taobao, etc. In practice, collaborative filtering recommendation algorithm faces the problem of data sparsity and low accuracy. The user-item matrix, which is the basic of collaborative filtering, is usually very sparse （with a large number of missing data）, and this leads to inaccurate results. This paper attempts to improve the accuracy of collaborative filtering recommendation from two aspects：（ 1 ） the similarity between users and items ; （2） the prediction of missing data. Firstly, we used the enhanced Pearson Correlation Algorithm to improve the accuracy of user, item similarity calculation by increasing parameters. Then we proposed a new method for predicting missing data, which is based on both the information of users and the information of items. In our algorithm, we set similarity threshold respectively for the user and the item, and only when users or items similarity meet or exceed the threshold, the missing data is predicted. In the prediction process, we used both the user and the item similarity information to improve the accuracy of the algorithm. Finally, through the experimental analysis of the data set of Taobao mobile client, we found that our algorithm is superior to other collaborative filtering algorithms, and the robustness of da- ta sparsity is much higher.

作者周明升韩冬梅

机构地区上海财经大学信息管理与工程学院上海外高桥保税区联合发展有限公司

出处《微型机与应用》 2016年第17期17-19,共3页 Microcomputer & Its Applications

基金国家自然科学基金资助项目(41174007) 上海财经大学研究生教育创新计划项目(CXJJ-2014-440)

关键词协同过滤推荐系统缺失数据预测数据稀疏性 collaborative filtering recommender system missing data prediction data sparsity

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1RESNICK P, IACOVOU N, SUCHAK M, et al. Grouplens: an open architecture for collaborative filtering of netnews [ C ]. Pro- ceedings of ACM Conference on Computer Supported Coopera- tive Work, 1994 : 175-186.
2BREESE J S, HECKERMAN D, KADIE C. Empirical analysis of predictive algorithms for collaborative filtering[ C ]. In Pro- ceedings of the 14th Conference on Uncertainty in Articifical In- telligence, 1998:43-52.
3Wang Jun, DEVRIES A P, REINDERS M J T. Unifying user- based and item-based collaborative filtering approaches by simi- larity fusion[ A ]. Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Infor- mation Retrieval[ C ]. USA : Seatole, 2006:501-508.
4XUE G R, LIN C, YANG Q, et al. Scalable collaborative filte- ring using cluster-based smoothing [ C]. Proceedings 28th In- ternational ACM SIGIR Conference on Research and Develop- ment in Information Retrieval, 2005:114-121.
5Ma Hao, KING I, LYU M R. Effective missing data prediction for collaborate filtering [ C ]. SIGIR 2007 : Proceedings of the Intermational ACM SIGIR Conference on Research and Devel- opment in Information Retrieval, Amsterdam the Netherlands, 2007:39-46.
6黄创光,印鉴,汪静,刘玉葆,王甲海.不确定近邻的协同过滤推荐算法[J].计算机学报,2010,33(8):1369-1377. 被引量：217
7邓爱林,朱扬勇,施伯乐.基于项目评分预测的协同过滤推荐算法[J].软件学报,2003,14(9):1621-1628. 被引量：558
8MCLAUGHLIN M R, HERLOCKER J L. A collaborative filte- ring algorithm and evaluation metric that accurately model the user experience[ C]. International ACM SIGIR Conference on Reseach and Development in Information Retrieral. ACM, 2004 : 329-336.
9HOFMANN T, HOFMANN T. Latent semantic models for col- laborative filtering[J]. ACM Transactions on Information Sys- tems, 2004, 22(1) :89-115.

二级参考文献33

1陈健,印鉴.基于影响集的协作过滤推荐算法[J].软件学报,2007,18(7):1685-1694. 被引量：59
2Goldberg D,Nichols D,Oki B,Terry D.Using collaborative filtering to weave an information tapestry.Communications of the ACM,1992,35(12):61-70.
3Resnick P,Iacovou N,Suchak M,Bergstorm P,Riedl J.GroupLens:An open architecture for collaborative filtering of netnews//Proceedings of the 1994 ACM Conference on Computer Supported Cooperative Work.Chapel Hill,North Carolina,United States,1994:175-186.
4Shardanand U,Maes P.Social information filtering:Algorithms for automating "word of mouth"//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.Denver,Colorado,United States,1995:210-217.
5Hill M,Stead L,Furnas G.Recommending and evaluating choices in a virtual community of use//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.Denver,Colorado,United States,1995:194-201.
6Sarwar B M,Karypis G,Konstan J A,Riedl J.Application of dimensionality reduction in recommender system-A case study//Proceedings of the ACM WebKDD Web Mining for E-Commerce Workshop.Boston,MA,United States,2000:82-90.
7Massa P,Avesani P.Trust-aware collaborative filtering for recommender systems.Lecture Notes in Computer Science,2004,3290:492-508.
8Vincent S-Z,Boi Faltings.Using hierarchical clustering for learning the ontologies used in recommendation systems//Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.San Jose,California,United States,2007:599-608.
9Park S-T,Pennock D M.Applying collaborative filtering techniques to movie search for better ranking and browsing//Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.San Jose,California,United States,2007:550-559.
10Tomoharu I,Kazumi S,Takeshi Y.Modeling user behavior in recommender systems based on maximum entropy//Proceedings of the 16th International Conference on World Wide Web.Banff,Alberta,Canada,2007:1281-1282.

共引文献737

1陈晋鹏,李海洋,张帆,李环,魏凯敏.基于会话的推荐方法综述[J].中文信息学报,2023,37(3):1-17. 被引量：4
2查琇山,刘方方.基于缺失值补全和SVD的手游推荐方法[J].计算机应用研究,2020,37(S02):166-169. 被引量：1
3刘美博,满君丰,彭成,刘鸣.引入隐式反馈的多维度推荐算法[J].计算机应用研究,2020,37(1):158-162. 被引量：2
4张光卫,康建初,李鹤松,刘常昱,李德毅.面向场景的协同过滤推荐算法[J].系统仿真学报,2006,18(z2):595-601. 被引量：27
5龚松杰.个性化推荐中一种新的相似性计算方法[J].计算机系统应用,2008,17(7):87-89. 被引量：1
6娄建玮,刘红军,郑伟.C#/SQL实现基于项目评分预测的推荐算法[J].职大学报,2007(4):22-23.
7谢瑗瑗,胡祥光,刘军,谷发平.P2P网络中信任模型研究综述[J].军事通信技术,2009,30(2):38-42. 被引量：4
8王恒.基于协同过滤的电子农务推荐系统模型研究[J].宁夏大学学报（自然科学版）,2009,30(4):358-360. 被引量：2
9王茜,杨莉云,杨德礼.面向用户偏好的属性值评分分布协同过滤算法[J].系统工程学报,2010,25(4):561-568. 被引量：24
10张明磊,韩明,王震洲.基于安全多方计算的系统间隐私保持推荐算法[J].河北工业大学学报,2012,41(4):14-18. 被引量：1

同被引文献21

1廖志平.数据挖掘在学校图书馆的应用[J].科技创新导报,2012,9(12):211-211. 被引量：6
2姚舜.关联规则算法在图书自动推荐系统中的应用[J].四川图书馆学报,2012(6):55-58. 被引量：3
3冷亚军,陆青,梁昌勇.协同过滤推荐技术综述[J].模式识别与人工智能,2014,27(8):720-734. 被引量：194
4黄震华,张佳雯,田春岐,孙圣力,向阳.基于排序学习的推荐算法研究综述[J].软件学报,2016,27(3):691-713. 被引量：108
5梁婧文,蒋朝惠.一种基于用户交易行为的隐语义模型推荐算法[J].微型机与应用,2017,36(21):15-18. 被引量：1
6黄立威,江碧涛,吕守业,刘艳博,李德毅.基于深度学习的推荐系统研究综述[J].计算机学报,2018,41(7):1619-1647. 被引量：420
7吴雪君,米红娟,李欣.一种基于随机游走的歌曲推荐算法[J].信息技术与网络安全,2019,38(10):35-39. 被引量：1
8邝耿力.基于“图书足迹”的阅读推荐系统研究[J].四川图书馆学报,2020(2):38-43. 被引量：1
9杨彦荣,张莹.基于用户聚类的图书协同推荐算法研究[J].科技资讯,2020,18(9):198-199. 被引量：1
10刘超慧,韩传福,陈天成,孔先进.融合惩罚因子和时间权重的协同过滤推荐算法[J].信息技术与网络安全,2020,39(5):17-21. 被引量：10

引证文献2

1杨玉枝.一种改进的缺失数据协同过滤图书自动推荐模型研究[J].科技资讯,2021,19(10):181-186.
2张天蔚.基于深度网络的推荐系统偏置项改良研究[J].信息技术与网络安全,2021,40(8):42-46. 被引量：1

二级引证文献1

1边琳丽,刘泽惠,李琦.基于反馈神经网络的财务服务机器人研究[J].自动化与仪器仪表,2021(12):167-171.

1彭新东,杨勇.双犹豫模糊软集的研究[J].计算机工程,2015,41(8):262-267. 被引量：8
2俞五炎.基于特征权值系数算法的网页分类方法研究[J].中国电子商务,2012(8):34-35.
3马敏,陆成超,江锋.基于ECT系统的信号完整性分析[J].传感器与微系统,2016,35(7):30-31.
4靖辉.基于DSP的嵌入式车辆图像监控算法的优化[J].吉林建筑工程学院学报,2009,26(6):80-82.
5许华荣,杨怡,洪朝群.基于颜色概率模型的交通标志识别算法研究[J].漳州师范学院学报（自然科学版）,2012,25(4):19-23. 被引量：3
6刘骥,曹凤莲,甘林昊.基于叶片形状特征的植物识别方法[J].计算机应用,2016,36(A02):200-202. 被引量：14
7郑仁富,刘杰.基于DSP的广播节目自动识别系统的实现[J].电子技术应用,2008,34(7):35-38. 被引量：1
8陈凯俊.自适应快速最大信息系数算法实现[J].微电子学与计算机,2016,33(9):70-73.
9张利强.市政管网无线监测系统中位置选择的优化方法[J].机电产品开发与创新,2012,25(6):131-133. 被引量：1
10王娅丹,李鹏,金瑜,刘宇.标签共现的标签聚类算法研究[J].计算机工程与应用,2015,51(2):146-150. 被引量：3

微型机与应用

2016年第17期

浏览历史

内容加载中请稍等...

一种改进的缺失数据协同过滤推荐算法被引量：2

参考文献9

二级参考文献33

共引文献737

同被引文献21

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种改进的缺失数据协同过滤推荐算法 被引量：2

参考文献9

二级参考文献33

共引文献737

同被引文献21

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种改进的缺失数据协同过滤推荐算法被引量：2