期刊文献+

融合信息增益和梯度下降算法的在线评论有用程度预测模型 被引量:2

Helpfulness Degree Prediction Model of Online Reviews Fusing Information Gain and Gradient Decline Algorithms
下载PDF
导出
摘要 由于无法预知产品在线评论的文本内容是否对浏览者有用,大量的无用评论增加了潜在消费者的信息搜索成本,甚至降低了潜在消费者购买产品的可能性。为提高电子商务平台的有用在线评论率,为撰写评论者提供测试功能,建立在线评论有用程度预测模型。根据在线评论的文本特征,所提模型选择在线评论的词语数量、词语的有用值、产品特征数量等3个特征,构建一个预测在线评论有用程度的模型,其中词语的有用值是词语区分在线评论有用程度的信息增益量,然后根据大量在线评论数据利用梯度下降算法解出模型参数。实验结果显示,随着词语数量、词语有用值、产品特征数量的增长,评论有用程度不断提高。实验中把在线评论分为一般、有用、非常有用3个程度,对于一般的在线评论,预测精确率为92.96%;对于“有用”在线评论,预测精确率为94.83%;对于“非常有用”在线评论,预测精确率为67.63%。实验对模型性能进行测试,得到平均精确率为85.05%,召回率为82.81%,F1值为83.72%,该结果验证了所提模型预测在线评论有用程度的可行性。 Because it is impossible to predict whether the text content of online product reviews is helpful for viewers,many reviewers write a large number of unhelpful reviews,which increases the cost of information search for potential consumers,and even reduces the possibility of potential consumers buying products.In order to improve the helpful online reviews rate of e-commerce platform and provide test function for reviewers,a prediction model of online reviews helpfulness is established.According to the text characteristics of online reviews,the model chooses three features of online reviews:the number of words,the helpful value of words,and the number of product features,to construct a model for predicting the helpfulness of online reviews.The helpful value is the information gain of words to distinguish the helpfulness of online reviews.And then,according to a large number of online reviews,by using the gradient descent algorithm,the model parameters are solved.The experimental results show that with the increase of the number of words,helpful value of words and the number of product features,the helpfulness of reviews increases continuously.The online reviews are divided into three levels:general,helpful and very helpful.The general predicted accuracy of online reviews is 92.96%,helpful accuracy is 94.83%,and very helpful accuracy is 67.63%.The average accuracy,recall and F1 of the model are 85.05%,82.81%and 83.72%,respectively.The results verify the feasibility of the model to predict the helpfulness of online reviews.
作者 冯进展 蔡淑琴 FENG Jin-zhan;CAI Shu-qin(School of Management,Huazhong University of Science and Technology,Wuhan 430074,China)
出处 《计算机科学》 CSCD 北大核心 2020年第10期69-74,共6页 Computer Science
基金 国家自然科学基金(71371081) 教育部博士点基金(20130142110044)。
关键词 在线评论 有用程度 信息增益 梯度下降法 Online reviews Helpfulness degree Information gain Gradient descent algorithm
  • 相关文献

参考文献7

二级参考文献34

  • 1杨芙清.软件工程技术发展思索[J].软件学报,2005,16(1):1-7. 被引量:266
  • 2姜维,王晓龙,关毅,徐志明.应用粗糙集理论提取特征的词性标注模型[J].高技术通讯,2006,16(10):996-1000. 被引量:3
  • 3Pang Bo, Lee L, Vaithyanathan S. Thumbs Up: Sentiment Classi- fication Using Machine Learning Techniques[C]//Proc. of Association for Computational Linguistics Conference on Em- pirical Methods in Natural Language Processing. Stroudsburg, USA: [s. n.]. 2002.
  • 4Pang Bo, Lee L. Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales[C]//Proc. of the 43rd Annual Meeting on Association for Computational Linguistics. Ann Arbor, Michigan, USA: [s. n.], 2005.
  • 5Jindal N, Liu Bing. Identifying Comparative Sentences in Text Doeuments[C]//Proe. of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle, USA: ACM Press, 2006.
  • 6Tumey P D. Thumbs Up or Thumbs Down: Semantic Orientation Applied to Unsupervised Classification of Reviews[C]//Proc. of the 40th Annual Meeting on Association for Computational Linguistics. Philadelphia, Pennsylvania, USA: [s. n.], 2002.
  • 7Gamon M, Aue A. Automatic Identification of Sentiment Vocabu- lary: Exploiting Low Association with Known Sentiment Terms[C]//Proc. of ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing. Ann Arbor, Michigan, USA: [s. n.], 2005.
  • 8Zhao Yan, Wang Xiaolong, Liu Bingquan, et al. Applying Class Triggers in Chinese POS Tagging Based on Maximum Entropy Model[C]//Proc. of International Conference on Machine Learning and Cybernetics. [S. 1.]: IEEE Press, 2004.
  • 9Wojciech Z. Variable Precision Rough Sets Model[J]. Journal of Computer and System Sciences, 1993, 46(1): 39-59.
  • 10Gim6nez J, M~rquez L. SVMTooI: A General POS Tagger Generator Based on Support Vector Machines[C]//Proc. of the 4th International Conference on Language Resources and Evaluation. Lisbon, Portugal: [s. n.], 2004.

共引文献99

同被引文献32

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部