期刊文献+

产品评论特征及观点抽取研究 被引量:11

Extracting Product Features and Opinions from Product Reviews
下载PDF
导出
摘要 随着电子商务的飞速发展,电子商务网站上各种产品的评论数量也在飞速地增长。如何从Web中大量存在的产品评论中挖掘出对消费者和生产厂商都有价值的信息,已经成为一个非常重要的研究领域。产品特征及观点的抽取是产品评论挖掘中的基本工作,其质量的好坏直接决定着后续工作的效果。双向传播算法能有效地抽取产品评论中的特征及观点,但对中文产品评论仍存在一些不足。本文对双向传播算法做了进一步的改进,提高了在中文产品评论中特征及观点抽取的准确率和召回率。首先,增加了两种产品特征和观点的间接句法依存关系模式,并引入了动词产品特征以增加召回率;其次,将产品特征和观点之间的句法依存关系模式作为HUB节点,利用HITS算法对候选产品特征和观点排序,从而提高准确率;最后,提出了模式相关性对最终抽取的产品特征进行优化,进一步提高了产品特征抽取的准确率。实验结果表明,本文的算法在不同产品评论的特征及观点抽取中都取得了较好的效果。 With the great development of e-commerce, the number of product reviews grows rapidly on the e- commerce websites. Review mining has recently received a lot of attention, which aims to discover the valuable information from the massive product reviews. Extraction of product features and opinions are the basic tasks of product review mining. Its effectiveness can influence significantly the performance of subsequent jobs. Double Propagation is a state-of-the-art technique in product features and opinions extraction, but there are some shortcomings when processing Chinese reviews. In this paper, we apply the Double Propagation to the product features and opinions exaction from Chinese product reviews and adopt some techniques to improve the precision and recall. First, indirect relations and verb product features are introduced to increase the recall. Second, the dependency relation patterns between product features and opinion are employed as hubs, and HITS is applied to rank ranking candidate product features and opinions for improving the precision. Finally, the Normalized Pattern Relevance is employed to filter the exacted product features. Experiments on diverse real-life datasets show promising results.
作者 郗亚辉
出处 《情报学报》 CSSCI 北大核心 2014年第3期326-336,共11页 Journal of the China Society for Scientific and Technical Information
基金 国家自然科学基金资助项目(61170039)
关键词 产品评论挖掘 产品特征和观点抽取 双向传播 HITS算法 模式相关性 product review mining, product features and opinions exaction, double propagation, HITS, normalized pattern relevance
  • 相关文献

参考文献23

  • 1Hu M, Liu B. Mining and summarizing customer reviews[ C ]//Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York: ACM Press, 2004:168-177.
  • 2Popescu A M, Etzioni O. Extracting product features and opinions from review [ C ]//Proceedings of the Human Language Technology Conference and the Conference on Empirical Methods in Natural Language Processing, Stroudsburg, USA : Association for Computational Linguistics, 2005:339-346.
  • 3黄晓斌,周珍妮.观点挖掘在竞争对手分析中的应用[J].情报资料工作,2010,31(5):89-93. 被引量:15
  • 4周珍妮,黄晓斌.网络用户评论在企业竞争情报研究中的应用[J].情报理论与实践,2012,35(5):15-20. 被引量:13
  • 5Zhang L, Liu B, Lira S H, et al. Extracting and ranking product features in opinion documents [ C ]//Proceedings of the 23rd International Conference on Computational Linguistics, Stroudsburg, USA : Association for Computational Linguistics, 2010 : 1462-1470.
  • 6Wang B, Wang H. Bootstrapping both product properties and opinion words from Chinese reviews with cross- training [ C ]//Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence,Washington: IEEE Computer Society , 2007:259-262.
  • 7Somprasertsri G, Lalitrojwong P. A maximum entropy model for product feature extraction in online customer reviews [ C ]// Proceedings of the Third IEEE International Conference on Cybernetics and IntelligentSystems, Washington: IEEE Computer Society , 2008: 575-580.
  • 8徐冰,赵铁军,王山雨,郑德权.基于浅层句法特征的评价对象抽取研究[J].自动化学报,2011,37(10):1241-1247. 被引量:48
  • 9Li F, Han C, Huang M, et al. Structure-aware review mining and summarization[ C l// Proceedings of the 23rd International Conference on Computational Linguistics, Stroudsburg, USA: Association for Computational Linguistics, 2010:653-661.
  • 10Yi J ,Nasukawa T,Bunescur R, et al. Sentiment Analyzer : Extracting Sentiments about a Given Topic Using Natural Language Processing Techniques [ C ]/ Proceedings of the 3rd IEEE International Conference on Data Mining, Washington: IEEE Computer Society, 2003:427-434.

二级参考文献22

共引文献193

同被引文献130

引证文献11

二级引证文献92

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部