摘要
协同过滤推荐算法通过研究用户的喜好,实现从海量数据资源中为用户推荐其感兴趣的内容。衡量用户(资源)的相似性是协同过滤算法的核心内容,在数据量大的系统中,用户(资源)的相似性度量会面临准确性和计算复杂性等问题,影响到推荐效果。提出一种改进的协同过滤推荐算法,提取用户兴趣偏好的多值信息,运用改进Minhash算法度量用户相似性,并结合Mapreduce分布式计算,合理、高效地产生用户邻居,实现对用户的评分推荐。实验结果表明:改进算法能有效改善大数据集的推荐准确性并提高推荐效率,降低了推荐耗时。
Collaborative filtering recommendation algorithm recommends interesting content for users from a massive data resource, by studying the user’s preferences. Measuring similarity of user(resource)is the core of collaborative filtering algorithms. In the large volume of data systems, the accuracy and computational complexity are faced in similarity measuring, which thus affect the recommendation results. This paper proposes an improved collaborative filtering algorithm by extracting multi-valued information of user interest preferences, uses improved Minhash algorithm to measure user similarity, and combines with Mapreduce distributed computing, to generate neighbor rationally and effectively, and finishes user ratings recommendations. Experimental results show that the improved algorithm can improve the recommendation accuracy and efficiency, reduce the recommended time-consuming for large data sets.
出处
《计算机工程与应用》
CSCD
北大核心
2016年第12期95-100,共6页
Computer Engineering and Applications
关键词
协同过滤
兴趣偏好
相似度计算
分布式计算
collaborative filtering
interest preferences
similarity calculation
distributed computing