摘要
针对目前主流的推荐算法中获取的用户信息不完整以及推荐时间过长的问题,本文提出一种基于用户信息向量聚类和改进SAMME的推荐算法,该算法通过分析用户基本信息(地域、时间、兴趣、标签等),找出用户信息关键词;对不同用户信息关键词基于TF-IDF方法进行加权构建用户信息向量;接着使用K-means算法进行用户聚类分析,将用户聚类结果作为改进SAMME训练样本集;最后通过改进SAMME算法将预测结果对用户进行好友推荐,并在训练过程中保存模型,大大减少推荐时间。最终将本文算法在真实的微博用户数据集上进行实验,并与其他主流算法进行对比,结果显示本文算法在准确率、召回率、F值上都取得了不错的效果。
Aiming at the problem of imperfect user information acquisition and long recommendation time in the current mainstream recommendation algorithms,this paper proposes a recommendation algorithm based on user information vector clustering and improved SAMME.The algorithm analyzes basic user information(region,time,interest,tags,etc)to find user information keywords;weights different user information keywords based on the TF-IDF method to construct user information vectors;then uses the K-means algorithm to perform user clustering analysis,and uses the user clustering results as improved SAMME training sample set;finally,the prediction results are recommended to the user by the improved SAMME algorithm,and the model is saved during the training process,which greatly reduces the recommendation time.Finally,the algorithm of this paper is tested on the real Weibo user data set and compared with other mainstream algorithms.The results show that the algorithm of this paper achieves good results in accuracy,recall and F-value.
作者
王杉文
欧鸥
马万民
陈建林
WANG Shan-wen;OU Ou;MA Wan-min;CHEN Jian-lin(College of Information Science and Technology(College of Internet Security),Chengdu University of Technology,Chengdu 610051,China)
出处
《计算机与现代化》
2021年第7期23-28,94,共7页
Computer and Modernization
基金
国家重点研发计划资助项目(2018YFF01013304)。
关键词
推荐系统
SAMME算法
用户信息
聚类分析
recommendation system
SAMME algorithm
user information
cluster analysis