期刊文献+

基于集群的协同过滤实时推荐系统研究

Collaborative Filtering Recommendation System Based on Cluster
下载PDF
导出
摘要 大数据环境下的信息挖掘已成为推荐系统研究较为活跃的领域,通过对现有大数据处理框架的对比,采用Spark大数据计算处理引擎,结合基于隐式反馈的ALS协同过滤推荐算法,提出一种Spark框架下ALS算法并行化解决方案,设计了分布式流式计算系统(Spark Distributed-ALS,SD-ALS)。实验结果验证了ALS算法在Spark集群环境下预测精度与单机环境基本保持一致,随迭代次数的增大,RMSE逐渐趋于稳定,并且计算效率显著提升,满足实时推荐的性能要求。 Information mining has become an active research field of recommender system under big data environment. A Spark framework ALS algorithm parallelization solution,which is called Distributed Flow Computing System( Spark Distributed-ALS,SD-ALS)is proposed through the comparison of the existing Big Data processing framework and the usage of Spark Big Data calculation processing engine,combining with implicit feedback ALS collaborative filtering recommendation algorithm. The experimental results verify that the prediction accuracy of the ALS algorithm in the Spark cluster environment is consistent with that in the Stand-alone environment. As the number of iterations increases,RMSE tends to be stable and the computational efficiency is significantly improved to meet the performance requirements of Real-time Recommendation.
作者 舒贵阳 辜丽川 冯娟娟 陈卫 赵子豪 王超 SHU Guiyang;GU Liehuan;FENG Juanjuan;CHEN Wei;ZHAO Zihao;WANG Chao(Anhui Agricultural University,Hefei 230036,Chin)
出处 《洛阳理工学院学报(自然科学版)》 2018年第2期71-77,共7页 Journal of Luoyang Institute of Science and Technology:Natural Science Edition
基金 国家自然科学基金项目(31371533)
关键词 流式数据 SPARK ALS 协同过滤 推荐系统 streaming data Spark ALS collaborative filtering recommender system
  • 相关文献

参考文献8

二级参考文献134

  • 1贾丽会,张修如.BP算法分析与改进[J].计算机技术与发展,2006,16(10):101-103. 被引量:47
  • 2陈刚,刘发升.基于BP神经网络的数据挖掘方法[J].计算机与现代化,2006(10):20-22. 被引量:14
  • 3邢春晓,高凤荣,战思南,周立柱.适应用户兴趣变化的协同过滤推荐算法[J].计算机研究与发展,2007,44(2):296-301. 被引量:146
  • 4黄海清,张平,张曦文.基于用户偏好的智能业务选取研究[J].电子学报,2006,34(B12):2537-2540. 被引量:3
  • 5Takacs G, Pilaszy I, Nemeth B, et al. Matrix factorization and neighbor based algorithms the nettlix prize problem [ C ]//Pro- ceedings of the 2008 ACM conference on recommender sys- tems. Lausanne, Switzerland : ACM, 2008 : 267-274.
  • 6Pilaszy I,Zibriczky D, Tikk D. Fast ALS-basedmatrix factori- zation for explicit and implicit feedback datasets [ C ]//Pro- ceedings of the fourth ACM conference on recommender sys-terns. New York : ACM ,2010:71-78.
  • 7Zhou Yunhong, Wilkinson D, Schreiber R, et al. Large- scale parallel collaborative filtering for the netflix prize [ C ]//Proc of the 4th international conference on algorthmic aspects in in- formation and management. Shanghai: Springer, 2008:337- 348.
  • 8Apache Mahout[ EB/OL]. 2013-12-20. http://mah- out. a- pache, org,/.
  • 9Apache Hadoop[ EB/OL]. 2013-12-20. http://hado- op. a- pache, org.
  • 10Dean J, Ghemawat S. MapReduce:simplified data processing on large clusters [ J]. Communication of the ACM, 2008,51 (1) :107-113.

共引文献509

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部