期刊文献+

基于泛化和繁殖的自举式意见目标抽取方法

Bootstrapping opinion target extraction based on generalization and propagation
原文传递
导出
摘要 意见目标抽取是自然语言处理领域中意见挖掘研究的重要环节。该文提出了一种基于泛化、繁殖和自举的意见目标抽取方法,在泛化过程中提炼原子意见目标和意见目标模式,在繁殖过程中对复合意见目标进行扩展,并采取自举机制实现了意见目标的递增学习。实验结果显示,经过第一轮自举过程后,该方法的F-1 score指标超出基线方法0.078;自举过程完成后,F-1 score指标提高了0.112。这说明,泛化处理对意见目标充分繁殖意义重大,自举过程则有助于充分发挥泛化能力和繁殖能力。 Opinion target extraction is a key step in opinion mining.A method was developed for opinion target extraction based on generalization,propagation and bootstrapping.The generalization module extracts atom opinion targets and opinion target patterns from the compound opinion targets,the propagation module synthesizes compound opinion targets with a reasoning mechanism,and the bootstrapping module provides multi-cycle incremental learning.Tests show that the F-1 score for this method outperforms the baseline by 0.078 in the first cycle and by 0.112 in the last cycle.Thus,generalization improves the propagation and the bootstrapping helps to maximize the contributions of the generalization and propagation.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第S1期1333-1338,共6页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金项目(60703051)
关键词 自然语言处理 意见挖掘 意见目标抽取 文本挖掘 natural language processing opinion mining opinion target extraction text mining
  • 相关文献

参考文献10

  • 1姚天昉,娄德成.汉语语句主题语义倾向分析方法的研究[J].中文信息学报,2007,21(5):73-79. 被引量:77
  • 2YI J,Niblack W.Sentiment Mining in WebFountain. Proc Int Conf on Data Eng . 2005
  • 3HU Minqing,LIU Bing.Mining and summarizing customerreviews. Proc Int Conf on Knowledge Discovery andData . 2004
  • 4Popescu A M,Etzioni O.Extracting product features andopinions from reviews. Proc Human LanguageTechnology Conf and Conf on Empirical Methods in NaturalLanguage Processing . 2005
  • 5Sproat R,Shih C.A statistical method for finding wordboundaries in Chinese text. J Computer Processing ofChinese and Oriental Languages . 1990
  • 6GAO Jianfeng,LI Mu,WU Andi,et al.Chinese wordsegmentation and named entity recognition:A pragmaticapproach. Computational Linguistics . 2005
  • 7XU Ruifeng,XIA Yunqing,WONG Kam Fai,et al.Opinionannotation in on-line Chinese product reviews. Proc IntConf on Language Resources and Evaluation . 2008
  • 8MA Jinshan,ZHANG Yu,LIU Ting,et al.A statisticaldependency parser of Chinese under small training data. Proc Int Joint Conf on Natural Language Processing . 2004
  • 9ZHOU Qiang,YU Hang.Integrate statistical model andlexical knowledge for Chinese multiword chunking. ProcInt Conf on Natural Language Processing and KnowledgeEng . 2008
  • 10Feng Haodi,Chen Kang,Deng Xiaotie,et al.Accessor va- riety criteria for Chinese word extraction. Computa- tional Linguistics . 2004

二级参考文献17

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 3姚天昉,等.一个用于汉语汽车评论的意见挖掘系统[A].中文信息处理前沿进展-中国中文信息学会二十五周年学术会议论文集[C].北京:清华大学出版社,2006,260-281.
  • 4哈尔滨工业大学信息检索研究室.中文依存句法分析概况介绍[EB/OL].http://ir.hit.edu.cn/phpwebsite/index.php?module=pagemaster&PAGE user op=view page&PAGE id=147&MMN position=52:48,2006.
  • 5P.J.Stone,D.C.Dunphy,M.S.Smith,and D.M.Ogilvie.The General Inquirer:A Computer Approach to Content Analysis[M].Cambridge,MA,USA:MIT Press.1966.
  • 6Z.Dong and Q.Dong.HowNet[EB/OL].http://www.keenage.com/zhiwang/e zhiwang.html,2003.
  • 7S.-M.Kim and E.Hovy.Determining the Sentiment of Opinions[A].In:Proceedings of COLING-04,the Conference on Computational Linguistics (COLING-2004)[C].Geneva,Switzerland:2004.1367-1373.
  • 8J.Yi,T.Nasukawa,R.Bunescu,and W.Niblack.Sentiment Analyzer:Extracting Sentiments about a Given Topic Using Natural Language Processing Techniques[A].In:Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM-2003)[C].Melbourne,USA:2003.427-434.
  • 9M.Hu and B.Liu.Mining Opinion Features in Customer Reviews[A].In:Proceedings of Nineteeth National Conference on Artificial Intellgience (AAAI-2004)[C].San Jose,USA:2004.755-760.
  • 10A.-M.Popescu and O.Etzioni.Extracting Product Features and Opinions from Reviews[A].In:Proceedings of the Human Language Technology Conference/Conference on Empirical Methods in Natural Language Processing (HLT-EMNLP-05)[C].Vancouver,Canada:2005.339-346.

共引文献76

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部