摘要
用户对商品和信息服务的评价包含评论和评分,富含了用户的兴趣、观点和偏好等行为信息。以真实和量化地反映用户对商品的喜好程度为目标,从海量的用户评价数据出发,基于边际效用定义用户偏好,基于D-S证据理论描述影响用户偏好的各影响因素的不确定性以及各因素之间的相互关系;以评论中的各词汇、包含正面/负面词汇的评论和评分作为用户对商品偏好的"证据",给出了综合考虑各影响因素的联合算子,以及基于MapReduce的计算方法和用户偏好发现机制。针对正确性、执行时间、加速比和并行效率等指标进行实验,结果验证了所提出方法的有效性。
User rating on products or information services includes reviews and scores,and reflects user behaviorinformation,such as interest,opinions and preferences.In order to represent the degrees of user preferences on productsinherently and quantitatively,starting from the massive rating data,this paper defines user preference based on theidea of marginal utility.Then,this paper describes the uncertainties of relevant influence factors on user preferencesand the mutual relationships among these factors based on the D-S evidence theory.Taking the vocabulary in a review,the vocabulary including positive/negative words and the score as the evidence of user preference respectively,this papergives the operator for combining the relevant factors jointly,as well as the computation method and mechanism for discoveringuser preferences based on MapReduce.The experimental results on correctness,execution time,speedup andparallel efficiency verify the effectiveness of the method proposed in this paper.
作者
郭心宇
岳昆
李劲
武浩
张彬彬
GUO Xinyu;YUE Kun;LI Jin;WU Hao;ZHANG Binbin(School of Information Science and Engineering, Yunnan University, Kunming 650504, China;School of Software, Yunnan University, Kunming 650504, China)
出处
《计算机科学与探索》
CSCD
北大核心
2017年第2期231-241,共11页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金Nos.61472345
61402398
61562090
61562091
云南省应用基础研究计划Nos.2014FA023
2016FB110
第二批"云岭学者"培养项目No.C6153001
云南大学青年英才培养计划No.XT412003~~