期刊文献+

面向特定领域的产品评价对象自动识别研究 被引量:34

Research on Comment Target Recognition for Specific Domain Products
下载PDF
导出
摘要 产品评价对象的自动识别是文本观点信息抽取和倾向性分析中的重要研究课题之一。该文针对汽车评论,提出了一种不依赖外部资源的无指导评价对象自动识别方法。该方法首先综合使用词形模板和词性模板,采用模糊匹配方法和剪枝法抽取候选评价对象。然后,从候选对象集中,采用双向Bootstrapping方法识别出产品评价对象。最后,通过采用K均值聚类方法对产品评价对象进行聚类,实现从评价对象中自动抽取产品名称和产品属性。实验结果表明,该方法对产品评价对象识别的F值达到58.5%,产品名称识别的F值达到69.48%。 The comment target recognition for the products is one of the important topics in text opinion information extraction and the sentiment analysis. For car product reviews, this paper proposes an unsupervised method to recognize comment targets without relying upon additional resources. In this method, we employ the fuzzy match technique for the word templates and part of speech templates and the pruning technique to extract candidate evaluated objects. Then the bidirectional Bootstrapping approach is used to recognize the comment targets from the candidate set. Lastly, the comment targets of the products are clustered by the K means method to recognize the product name and the product attributes. The experimental results indicate that the F-value of the recognition of the comment targets and the product names can achieve 58.5% and 69.48% respectively.
出处 《中文信息学报》 CSCD 北大核心 2010年第1期89-93,共5页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60875040 60970014) 教育部高等学校博士点基金资助项目(200801080006) 山西省自然科学基金资助项目(2007011042) 教育部科学技术研究重点基金资助项目(2007018) 山西省重点实验室开放基金资助项目(2007031017) 太原市科技局明星专项(09121001)
关键词 计算机应用 中文信息处理 产品评价对象 产品名称 产品属性 模板 K均值聚类 双向Bootstrapping方法 computer application Chinese information processing comment target of product product name prod uct attribute template K-means clustering bidirectional Bootstrapping
  • 相关文献

参考文献15

  • 1刘非凡,赵军,吕碧波,徐波,于浩,夏迎炬.面向商务信息抽取的产品命名实体识别研究[J].中文信息学报,2006,20(1):7-13. 被引量:47
  • 2Hongye Tan,Tieiun Zhao,Jianmin Yao. A Study on Pattern Generalization in Extended Named Entity Recognition[J]. Chinese Journal of Electronic, 2007, 16 (4):675-678.
  • 3Cheng Niu,Wei Li,Jihong Ding, etc. A Bootstrapping Approach to Named Entity Classification Using Successive Learners[C]// Proceedings of the 41st ACL, Sapporo, Japan, 2003 : 335-342.
  • 4赵军,许洪波,黄萱菁,谭松波,刘康,张奇.中文倾向性分析评测技术报告[C]//第一届中文倾向性分析评测会议(The First Chinese Opinion Analysis Evaluation).COAE,2008.
  • 5何慧,李思,肖芬,等.PRIS中文情感倾向性分析技术报告[C]//Proceedings of the COAE2008,Harbin,2008:46-55.
  • 6张姝,贾文杰,夏迎炬,等.基于CRF的评价对象抽取技术研究[C]//Proceedings of the COAE2008,Harbin,2008:32-37.
  • 7王俞霖,孙乐.中国科学院软件研究所COAE2008报告[C]//Proceedings of the COAE2008,Harbin,2008:1-20.
  • 8赵妍妍,刘鸿宇,秦兵,等.HIT_IR_OMS:情感分析系统[C]//Proceedings of the COAE2008,Harbin,2008:81-88.
  • 9Mingqing Hu and Bing Liu. Mining and Summarizing Customer Reviews[C]//Proceedings of the tenth ACM SIGKDD. 2004 : 168-177.
  • 10O. Etzioni,M. Cafarella,D. Downey,etc. Unsupervised Named-Entity Extraction from the Web: An Experimental Study[J]. Artificial Intelligence, 2005, 165(1) :91-134.

二级参考文献14

  • 1John M.Pierre. Mining Knowledge from Text Collections Using Automatically Generated Metadata [A]. In: Proceedings of Fourth International Conference on Practical Aspects of Knowledge Management [C].London, UK: Springer-Verlag, 2002, 537- 548.
  • 2Bick, Eekhard. A Named Entity Recognizer for Danish[A]. In:IAno et al. (eds.), Proc. of 4th International Conf.on Language Resources and Evaluation(LRE2004)[C], Lisbon, 2004, 305-308.
  • 3Jian Sun, Jianfeng Gao, Lei Zhang, Ming Zhou, Changning Huang. Chinese Named Entity Identification Using Class-based Language Model [A]. In:Proceedings of the 19th international conference on Computational Linguistics[C]. Morristown, NJ, USA, Association for Computational Linguistics, 2002, 1 - 7.
  • 4Huaping Zhang, et al. Chinese NER Using Role Model [J]. Special Issue of the International Journal of Computational Linguistics and Chinese Language Processing, 2O03, 8(2):29 - 60.
  • 5Guohong Fu and Kang-Kwong Lake. Chinese Unknown Word Identification Using Clags-based LM[A]. In:Proceedings of the First International JointConference on Natural Language Processing (IJCNLP- 04) [C]. Hainan, China,2004, 262-269.
  • 6Tzong-Han Tsai, et al. Mencius: A Chinese Named Entity Recognizer Using the Maximum Entropy-based Hybrid Model [J]. International Journal of Computational Linguistics & Chinese Language Processing, 2004, 9(1):62- 82.
  • 7Cheng Niu, Wei Li, Jihong Ding and Rohini K. Srihari. A Bootstrapping Approach to Named Entity Classification Using Successive Learners [A]. In: Proceedings of the 41st ACL [C], Sappom, Japan, 2003, 335- 342.
  • 8Shai Fine, Yoram Singer, Naftali Tishby. (1998) The Hierarchical Hidden Markov Model: Analysis and Applications[J]. btachine Learning. 1998, 32(1): 41-62.
  • 9Y. Z. Wu, J. Zhao, B. Xu. Chinese Named Entity Recognition Combining Statistical Model with Human Knowledge[A]. Workshop of 41st ACL: nuhilingual and Mix-language NER[C], Sapporo, Japan, 2003, 65 - 72.
  • 10Hatzivassiloglou V, Gravano L and Maganti A. An Investigation of Linguistic Features and Clustering Algorithms for Topical Document Clustering [A]. In:Proceedings of the 23rd ACM SIGIR Conference, Athens [C]. 2000. 224-231.

共引文献84

同被引文献412

引证文献34

二级引证文献262

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部