摘要
用户特征可以通过在线用户的点赞信息进行奇异值分解和Logistic回归有效预测,然而对新用户的特征预测却难以实现。为了解决该问题,提出了一种基于LDA主题模型的在线用户特征预测方法。首先使用LDA模型提取微博用户的点赞文本主题,然后基于主题对新用户的特征进行预测,最后与基于奇异值分解的传统方法比较预测结果。实验结果表明其F1值最高提升0.15,且计算时间平均缩短了69.09%。研究改进了点赞信息固有标签不能准确反映用户偏好的缺陷,避免了传统方法预测过程中仍需对新用户及其点赞信息重新计算的繁琐弊端,为用户特征分析提供了另一条可行途径。
User traits can be effectively predicted by singular value decomposition and Logistic Regression through online user’s‘Like’information.However,this method cannot predict new users’traits.To slove the problem,this paper proposes an online user traits predicting method based on LDA topic model.Firstly,the method extracted the Weibo user’s‘Like’text topic through LDA model.Then it predicted new user traits based on topic.Finally,the result is compared to the traditional method based on singular value decomposition.The results showed that the F1 value of this method was up to 0.15,and the calculation time was shortened by 69.09%in average.Research inproves the defect that the inherent tags of the‘Like’informations cannot accurately reflect user preference,avoiding the disadvantage of recalculating new users and their‘like’information in the predicting process of traditional methods,providing another feasible way for user traits analysis.
作者
王雅静
郭强
邓春燕
林青轩
刘建国
WANG Yajing;GUO Qiang;DENG Chunyan;LIN Qingxuan;LIU Jianguo(Research Center for Complex Systems Science,University of Shanghai for Science&Technology,Shanghai 200093,China;Institute of Accounting and Finance,Shanghai University of Finance and Economics,Shanghai 200433,China;Institute of Sina WRD Big Data,Shanghai 210204,China)
出处
《复杂系统与复杂性科学》
EI
CSCD
2020年第4期9-15,共7页
Complex Systems and Complexity Science
基金
国家自然科学基金(61773248,71771152)
国家社科重大项目(18ZDA088,20ZDA060)。