期刊文献+

并行化的情感分类算法的研究 被引量:4

Research on Parallelized Sentiment Classification Algorithms
下载PDF
导出
摘要 在海量数据集上执行情感分类任务时,传统的单机情感分类算法的扩展性成为系统的瓶颈。在云计算平台Hadoop上,实现了情感分类任务中特征提取、特征向量加权和情感分类等算法的MapReduce化。在情感语料数据集上,对各种子步骤组合下情感分类算法的精度及每种算法的时间开销进行了对比分析。实验结果验证了实现的并行化情感分类算法的有效性,同时它为用户选择合适算法实现情感分类任务提供了有价值的参考信息。 Abstract The scalability problem becomes a bottleneck for traditional stand-alone sentiment classi{ication algorithms due to the massive data We implemented {eature extraction, feature weighting and classification algorithms involved in sentiment classification task by using MapReduce technique on Hadoop platform. We evaluated our proposed paralle- lized sentiment classification algorithms on real data sets in terms of precision and time costs. Experimental results show the effectiveness of these parallelized sentiment classification algorithms and also provide valuable references for users to select suitable sentiment classi{ication algorithms according to user requirements.
出处 《计算机科学》 CSCD 北大核心 2013年第6期206-210,共5页 Computer Science
基金 国家自然科学基金项目(61035003) 科技部国际科技合作计划项目(2010DFA11030) 江苏省自然科学基金项目(BK2010054)资助
关键词 情感分类 HADOOP 云计算 MAPREDUCE Sentiment classification, Hadoop, Cloud computing, MapReduce
  • 相关文献

参考文献11

  • 1Pang B,Lee L,Vaithyanathan S.Thumbs up? Sentiment Classification Using Machine Learning Techniques[C]//Proceedings of the EMNLP'02.2002:79-86.
  • 2Pang B,Lee L.A Sentimental Education:Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts[C]//Proceeding of ACL.2004:271-278.
  • 3Dave K,Lawrence S,Pennock D.Mining the peanut gallery:opinion extraction and semantic classification of product reviews[C]//Proceedings of WWW2003.2003.
  • 4Mullen T,Collier N.Sentiment analysis using support vector machines with diverse information sources[C]//Proceedings of EMNLP' 2004.2004.
  • 5Li J,Sun M.Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques[C]//Proceedings of IEEE NLP-KE'2007.2007.
  • 6Zhai Zhong-wu,Xu Hua,Li Juu,et al.Sentiment Classification for Chinese Reviews Based on Key Substring Features[C]//Proceedings of Natural Language Processing and Knowledge Engineering.2009:24-27.
  • 7Devitt A,Ahmad K.Sentiment polarity identification in financial news:a cohesion-based approach[C]// Proceedings of ACL.2007:984-991.
  • 8Shein K P P,Nyunt T T S.Sentiment classification based on Ontology and SVM Classifier[C]//Proceedings of ICCSN.2010:169-172.
  • 9Zhai Zhong wu,Xu Hua,Kang Ba-da,et al.Exploiting effective features for Chinese sentiment classification[J].Expert Syst.Appl.,2011,38 (8):9139-9146.
  • 10Jeffrey D,Sanjay G.MapReduce:simplified data processing on large clusters[J].Commun.ACM,2008,51 (1):107-113.

同被引文献40

引证文献4

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部