摘要
面对某些热点事件,微博评论者经常使用反讽来表达对于该事件的看法,以往的情感分析任务往往忽略这一语言现象。为了提高微博情感分析的准确率,该文对反讽识别开展了研究。通过分析中文文本的语言现象和社交网络的特性,归纳了中文微博反讽的语言特征,提出了一种融合语言特征的卷积神经网络(CNN)的反讽识别方法。该方法将反讽特征和句子分别采用Word Embedding作为输入,再卷积、池化后,将其全连接融合,构建了新的卷积神经网络模型。实验结果表明,该方法在反讽识别的性能上优于传统的基于机器学习的方法。
Irony is popular in Weibo comments, which is less addressed in past sentiment analysis community. To improve the accuracy of Weibo sentiment analysis,we study irony recognition in this paper. By analyzing the characteristics of Chinese language and social networks, irony linguistic features are summarized. Combining these linguistic features with convolutional neural networks (CNN),a novel method is proposed for recognizing irony. The proposed method combines irony features representation and sentences representation using word embedding as the input of a convolution network. The experimental results indicate that the proposed method is superior to classical machine learning methods for irony recognition.
作者
卢欣
李旸
王素格
LU Xin;lI Yang;WANG Suge(School of Computer and Information Technology,Shanxi University,Taiyuan ,Shanxi 030006 ,China)
出处
《中文信息学报》
CSCD
北大核心
2019年第5期31-38,共8页
Journal of Chinese Information Processing
基金
国家自然科学基金(61632011
61672331
61573231
61432011
61603229)
山西省重点研发计划(201803D421024)
关键词
微博
反讽识别
卷积神经网络
语言特征
Weibo
irony recognition
convolutional neural networks
linguistic features