期刊文献+

结合强化学习和用户短期行为的新闻推荐算法

NEWS RECOMMENDATION ALGORITHM COMBINING REINFORCEMENT LEARNING AND USER SHORT-TERM BEHAVIOR
下载PDF
导出
摘要 针对传统的协同过滤推荐算法仅根据用户历史评分矩阵进行推荐,存在矩阵稀疏和无法动态观察用户兴趣变化的问题,提出一种将用户短期行为和强化学习相结合的新闻推荐方法。将新闻文本向量化后,通过聚类提取类别特征,再根据强化学习中的状态、动作和奖励等概念,以Double DQN算法为框架来建立推荐模型,利用循环神经网络近似动作值函数来进行计算。最后在财新网的真实新闻浏览数据集上对提出的算法进行验证,对比传统算法,实验结果表明,提出的算法在推荐准确率、召回率等指标上都有明显提高,能够更加有效地进行推荐。 The traditional collaborative filtering recommendation algorithm only makes recommendations based on the user history score matrix,which has the problems of sparse matrix and inability to dynamically observe user interest changes.A news recommendation method that combines user short-term behavior and reinforcement learning is proposed.After the news text was vectorized,the category features were extracted through clustering.Based on the concepts of state,action and reward in reinforcement learning,the Double DQN algorithm was used as the framework to establish a recommendation model,and the recurrent neural network was used to approximate the action value function for calculation.The proposed algorithm was verified on the real news browsing data set of Caixin.Compared with the traditional algorithm,the experimental results show that the proposed algorithm has significantly improved the recommendation precision rate,recall rate and other indicators,and can perform more effectively.
作者 姚楠 何山 赵越 李任花 Yao Nan;He Shan;Zhao Yue;Li Renhua(School of Computer Science,Southwest Petroleum University,Chengdu 610500,Sichuan,China;Chengdu Advanced Power Semiconductor Co.,Ltd.,Chengdu 611730,Sichuan,China)
出处 《计算机应用与软件》 北大核心 2024年第4期284-290,共7页 Computer Applications and Software
关键词 推荐系统 强化学习 新闻推荐 神经网络 聚类 Recommender system Reinforcement learning News recommendation Neural network Clustering
  • 相关文献

参考文献4

二级参考文献51

  • 1邢春晓,高凤荣,战思南,周立柱.适应用户兴趣变化的协同过滤推荐算法[J].计算机研究与发展,2007,44(2):296-301. 被引量:147
  • 2Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].New York:ACM press,1999.
  • 3Manning C D,Schütze H.Foundations of Statistical NaturalLanguage Processing [M].Cambridge:MIT press,1999.
  • 4Hwang M,Choi C,Youn B,et al.Word Sense Disambiguation Based on Relation Structure[C]∥International Conference on Advanced Language Processing and Web Information Technology.2008:15-20.
  • 5Wang X,Mccallum A,Wei X.Topical N-Grams:Phrase andTopic Discovery,with an Application to Information Retrieval [C]∥IEEE International Conference on Data Mining.IEEE Computer Society,2007:697-702.
  • 6Haruechaiyasak C,Jitkrittum W,Sangkeettrakarn C,et al.Im-plementing News Article Category Browsing Based on Text Categorization Technique [C]∥2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.IEEE Computer Society,2008:143-146.
  • 7Mikolov T,Sutskever I,Chen K,et al.Distributed Representations of Words and Phrases and their Compositionality [J].Advances in Neural Information Processing Systems,2013,26:3111-3119.
  • 8Mikolov T,Chen K,Corrado G,et al.Efficient Estimation of Word Representations in Vector Space [C]∥ICLR 2013.2013.
  • 9Joachims T.A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization [M].Springer US,1997:143-151.
  • 10Hinton G E.Learning distributed representations of concepts[C]∥Proceedings of CogSci.1986:1-12.

共引文献566

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部