期刊文献+

用于细颗粒度挖掘的产品评论语料库构建技术 被引量:1

Research of product review corpus constructing technology for fine-granularity mining
下载PDF
导出
摘要 为了辅助进行产品评论中特征-观点对识别的挖掘工作,对细颗粒度产品评论语料库的构建技术进行了研究.介绍了用于产品评论细颗粒度挖掘的语料库构建方法,以及目前初步进行的语料标注工作.标注数据可以数据库形式存储,从而实现了无结构化到结构化的转变,为自动查询等处理提供了极大方便.实验结果表明:虽然文中的标注方法以手机产品为例,但具有良好的移植性,可以应用到其他产品评论的细颗粒度语料库构建.相应的语料库构建对于高性能机器学习方法的应用、特征-观点对识别算法的性能提高以及自动评价等具有重要意义. Quantitative analysis and mining of product reviews posted by users are helpful for both manufacturers and consumers.During the work of fine-granularity product review mining,extracting feature-opinion pair is one of the core works.The corresponding corpus construction is of great significance for the application of high performance machine learning methods,improving the performance of feature-opinion extraction algorithm and automatic evaluation.This article introduces corpus constructing technology for fine-granularity product review mining and the initial corpus labeling work,thus realizing non-structured to structural changes.The corpus can be stored in database and thus provide great convenience for automatic query processing.Although current labeling work was performed in mobile phone products,it can be applied also to other product types for fine granularity corpus construction.So our work has good transplantation ability.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2012年第3期64-68,共5页 Journal of Harbin Institute of Technology
基金 教育部人文社会科学研究青年基金资助项目(10YJCZH099) 中央高校基本科研业务费专项资金资助项目(HIT.NSRIF.2009065) 语言语音教育部-微软重点实验室开放基金资助项目(HIT.KLOF.2009022)
关键词 产品意见挖掘 细颗粒度语料库构建 语料标注 product review mining fine-granularity corpus construction corpus annotation
  • 相关文献

参考文献12

  • 1LIU J, WU G, YAO J. Opinion searching in multi- product reviews [ C ]//Proceedings of the Sixth IEEE In-ternational Conference Ion Computer And Information Technology. Washington, DC: IEEE Computer Society, 2006 : 25 - 31.
  • 2DAVE K, LAWRENCE S, PENNOCK D. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews [ C ]//Proceedings of the 12th International Conference on World Wide Web. New York, NY: ACM, 2003:519-528.
  • 3ACIAR S, ZHANG D, SIMOFF S, et aL Informed recom- mender: Basing recommendations on consumer product reviews[J]. Intelligent System, 21XI7, 22(3) : 39 -47.
  • 4HU Minqing, LIU Bing. Mining opinion features in customer reviews [ C]//Proceedings of the 19'h National Conference on Artificial Intelligence. San Jose: AAAI press, 2004 : 755 - 760.
  • 5LIU Bing, HU Minqing, CHENG Junsheng. Opinion observer: Analyzing and comparing opinions on the Web [ C ]//Proceedings of the 14th International Conference on World Wide Web. New York, NY: ACM, 2005: 342 - 351.
  • 6HU N, PAVLOU P A, ZHANG J. Can online reviews reveal a product's true quality? Empirical findings and analytical modeling of online word-of-mouth communication[ C]//Proceedings of the 7th ACM Conference on Electronic Commerce. New York, NY: ACM, 2006: 324 - 330.
  • 7GHOSE A, IPEIROTIS P G. Designing novel review ranking systems: Predicting the usefulness and impact of reviews [ C ]//Proceedings of the Ninth International Conference on Electronic Commerce. New York, NY: ACM, 2007:303-310.
  • 8JOHANSSON S, ATWELL E, GARSIDE R, et al. The tagged LOB corpus user's manual[J]. Norwegian Computing Centre for the Humanities, 1986 : 10 - 12.
  • 9PANG Bo, LEE Lillian. Thumbs up? sentiment classication using machine learning techniques [ C ]//Proceedings of the ACL - 02 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA: Association for Computational Linguistics, 2002: 79 - 86.
  • 10WIEBE J, WILSON T, CARDIE C. Annotating expressions of opinions and emotions in language [ J ]. Language Resources and Evaluation, 2005, 39(2/3) :165 -210.

二级参考文献13

  • 1刘飞,黎建辉,阎保平.XML Schema在科学数据库元数据互操作中的应用[J].计算机应用研究,2005,22(5):199-201. 被引量:5
  • 2曾一,许娴,张元平.一种基于Schema的XML索引结构[J].计算机工程,2006,32(18):64-66. 被引量:8
  • 3Chan Lois Mai, Zeng Marcia Lei. Metadata interoperability and standardization-a study of methodology part I, achieving interoperability at the schema level. D-Lib Magazine,2006,12(6) :121-123.
  • 4Furo,H.2001.Turn-taking in English and Japanese.New York/London:Routledge.
  • 5Granger,S.1998.The Computer Learner Corpus:A versatile new source of data for SLA research.In S.Granger,ed.,Learner English on Computer.London:Longman.Pp.3-18.
  • 6Pravec,N.A.2002.Survey of learner corpora.ICAME Journal 26,81-114.
  • 7Sinclair,J.1991.Corpus,Concordance,Collocation.Oxford:Oxford University Press.
  • 8Sinclair,J.2002.Corpus Linguistics at the Millennium.In Yang Huizhong (杨惠中),ed.,An Introduction to Corpus Linguistics.Shanghai:Shanghai Foreign Language Education Press.Pp.310-30.
  • 9Sinclair,J.and M.Coulthard.1975.Towards an Analysis of Discourse.London:Oxford University Press.
  • 10Tsui,A.B.M (徐碧美).2000[1994].English Conversation.Shanghai:Shanghai Foreign Language Education Press.

共引文献35

同被引文献92

引证文献1

二级引证文献48

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部