期刊文献+

基于Flink的分布式推荐系统 被引量:1

Distributed Recommendation System Based on Flink
下载PDF
导出
摘要 随着不断扩张的数据量,传统推荐系统面临着计算效率低、实时推荐速度较慢、推荐效果不够理想等情况。针对上述问题,使用新一代流式计算引擎ApacheFlink作为推荐的计算平台,结合Hadoop、Hive、Redis、ZooKeeper和Kafka等大数据开源技术构建分布式推荐系统。同时,使用Alink提高离线推荐算法在分布式场景的效率;改进实时推荐算法,利用用户最近历史评分,融入时间衰减函数,生成TOP-N实时推荐列表。结果表明,推荐结果的准确率、召回率以及归一化折损累计增益等指标都有较好地提升,改进后算法有更好的推荐效果。 With the ever-expanding data volume, the traditional recommendation system faces low computational efficiency,slow real-time recommendation speed, and less-than-ideal recommendation effect. To address the aforesaid problems, we use Apache Flink, a new generation of streaming computing engine, as the computing platform for recommendation, and combine with big data open source technologies such as Hadoop, Hive, Redis, ZooKeeper and Kafka to build a distributed recommendation system.Meanwhile, Alink is used to improve the efficiency of the offline recommendation algorithm in distributed scenarios;the real-time recommendation algorithm is improved to generate the TOP-N real-time recommendation list by using users’ recent historical ratings and incorporating the time decay function. The results show that the accuracy, recall and normalized discounted cumulative gain of recommendation results are better improved, and the improved algorithm has better recommendation effect..
作者 郑江文 赵超 ZHENG Jiangwen;ZHAO Chao(School of Information and Electrical Engineering,Hebei University of Engineering,Handan Hebei 056038,China)
出处 《信息与电脑》 2022年第19期108-112,共5页 Information & Computer
关键词 Apache Flink 协同过滤 Alink 时间衰减函数 推荐系统 Apache flink collaborative filtering Alink time decay function recommendation system
  • 相关文献

参考文献7

二级参考文献51

  • 1Toffler A.Future shock[M].New York:Bantam Books,1970
  • 2Hensiak K.Too much of a good Sing:information overload and law librarians[J].Legal Reference Services Quarterly,2003,22(2/3):86
  • 3Bawden D,Holtham C,Courtney N.Perspectives on information overload[J].Aslib Proceedings,1999,51(8):249
  • 4Eppler M J,Mengis J.A framework for information overload research in organization[EB/OL].http://www.bul.unisi.chlcerca/bul/pubblieazioni/com/pdf./wpca030/.odf
  • 5Casey C J.Coping with information overload:the need for empirical research[J].Cost and Management,1992,66(4):31-38
  • 6Kiley K.The cyberspace database information overload[J].Catalog Age,1995,12(9):56-59
  • 7Schneider S C.Information overload:causes and consequences[J].Human Systems Management,1987,7(2):143-154
  • 8Meglio C E,Kleiner B H.Managing information overload[J].Industrial Management and Data System,1990,1(1):23-26
  • 9Wheelwright G.Information overload[J].Communications International,1995,22(1):55-58
  • 10Wurman R S.Information anxiety[M].New York:Bantam Doubleday Dell Publishing Group Inc.,1989

共引文献828

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部