期刊文献+

基于互自扩展模式的中文产品属性提取算法 被引量:3

Feature extraction method based on mutual self-expanding mode
下载PDF
导出
摘要 针对中文在线评论中产品属性词的提取,提出了一种基于互自扩展模式的半监督学习方法。利用较少的人工参与,通过FP-Growth算法挖掘频繁项集获得种子属性词,通过增量迭代发现新的属性词;在每一轮迭代中,通过计算提取词与提取模式的置信度确保了算法的准确性,同时避免了主题偏移。最后通过相似提取模式获得复合提取词,大大减少了因分词及词性标注错误所导致的属性词挖掘错误,以牺牲较少准确率的代价换取了较高的召回率。实验结果表明,该算法对产品属性提取的F值可以达到78.97%,结果优于其他类似的提取算法。 This paper proposed a feature extraction method based on mutual self-expanding in Chinese product comment. With little manual work, this method found seed features by FP-Growth, then found the other new features by an incremental iterative procedure. During the iteration, the confidence coefficient of the extracted-word and the extracted-mode insured a high precision, avoided deviating theme at the same time. At last, this method found combination extracted-word by similarity ex- tracted-mode. It could reduce many feature extraction mistakes caused by word segmentation technology and part-of-speech tagging technology, and got a high precision with reducing little recall rate. The experimental results indicate that the F-score of the proposed method for product feature extraction can be 78.97%, is better than the other method of the literatures of this paper.
出处 《计算机应用研究》 CSCD 北大核心 2017年第4期977-980,共4页 Application Research of Computers
基金 国家自然科学基金面上项目(61471083) 国家教育部人文社科研究规划基金资助项目(14YJA630044)
关键词 在线评论 产品属性提取 互自扩展 FP-GROWTH算法 置信度 online comment product features extraction mutual sel-expanding FP-Growth method confidence coefficient
  • 相关文献

参考文献3

二级参考文献34

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2姚天昉,聂青阳,李建超,李林琳,陈柯,付宁.一个用于汉语汽车评论的意见挖掘系统[C]//中文信息处理前沿进展-中国中文信息学会二十五周年学术会议论文集.北京:清华大学出版社,2006:260-281.
  • 3赵军,许洪波,黄萱菁,谭松波,刘康,张奇.中文倾向性分析评测技术报告[C]//第一届中文倾向性分析评测会议(The First Chinese Opinion Analysis Evaluation).COAE,2008.
  • 4Hong Yu, Vasileios Hatzivassiloglou. Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences [C]//Proceedings of EMNLP 2003,2003: 129-136.
  • 5Ellen Riloff, Janyce Wiebe, William Phillips. Exploiting subjectivity classification to improve information extraction [ C ]//Proceedings of AAAI-2005, 2005: 1106-1111.
  • 6Minqing Hu,Bing Liu. Mining opinion features in customer reviews[C]//Proceedings of AAAI-2004,2004: 755-760.
  • 7倪茂树,林鸿飞.基于关联规则和极性分析的商品评论挖掘[C]//第三届全国信息检索与内容安全学术会议,2007:635-642.
  • 8Soo-Min Kim,Eduard Hovy. Automatic detection of opinion bearing words and sentences[C]//Proceedings of IJCNLP-2005,2005 : 61-66.
  • 9Jun Zhao,Kang Liu,GenWang. Adding redundant features for crfs based sentence sentiment classification [C]//Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008: 117-126.
  • 10Minqing Hu, Bing Liu. Mining and summarizing customer reviews [C]//Proceedings of KDD-2004, 2004 : 168-177.

共引文献173

同被引文献23

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部