基于不同关键词提取算法的维吾尔文本情感辨识

Keyword extraction algorithms for emotion recognition from Uyghur text

导出

摘要该文在研究不同的关键词提取方法的基础上,针对维吾尔语文本中的生气、高兴等常见情感类型进行情感辨识研究。结合维吾尔文本句子中的情感表达特点,用TextRank、稀疏判别分析(sparse discriminant analysis,SDA)和稀疏支持向量机(sparse support vector machine,Sparse SVM)等提取方法得到具有代表性的关键词集,并基于这些关键词集进行特征提取和情感模型构造。该文从电影电视剧中演员的维吾尔语台词、小说等文本中选取含有生气和高兴2种情感文本的句子,构造实验数据集并验证所提出的文本情感倾向性分析方法的有效性。实验结果表明:该文用多种方法所提取的关键词集都能有效地对维吾尔语文本句子进行情感分类,尤其是基于Sparse SVM的稀疏性分析的关键词提取方法在少量关键词语集上能有效地进行较高准确率的情感分类。 This paper describes sentiment classification research on Uyghur text using different keyword extraction methods to recognize common emotions like anger and happiness. The keywords expressing happiness and anger are extracted using the TextRank, sparse discriminant analysis （SDA） and sparse support vector machine （Sparse SVM） methods to train feature extraction and sentiment models. A sentiment text database was built by excerpting the anger and happiness sentiments from Uyghur movies and novels with several validation experiments based on those text databases. The tests show that the keyword extraction methods presented in this paper are effective for emotion classification from Uyghur sentences. The Sparse SVM method is robustness and has higher accuracy in recognition tests with a smaller number of keywords extracted.

作者赛牙热.依马木热依莱木.帕尔哈提艾斯卡尔.艾木都拉李志军 MAM Seyyare PARHAT Rayilam HAMDULLA Askar LI Zhijun(Key Laboratory of Signal and Information Processing Xinjiang University, Urumqi 830046, Chin)

机构地区新疆大学信号与信息处理重点实验室

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2017年第3期270-273,共4页 Journal of Tsinghua University(Science and Technology)

基金国家社科基金资助项目(13BYY062) 国家自然科学基金资助项目(61163033 61065005) 教育部新世纪优秀人才支持计划资助项目(NCET-10-0969) 新疆维吾尔自治区高新技术发展研究计划项目(201312103)

关键词 TextRank 稀疏判别分析(SDA) 稀疏支持向量机(Sparse SVM) 情感识别维吾尔语 TextRank sparse discriminant analysis（SDA） sparsesupport vector machine（Sparse SVM） emotion recognition Uyghur

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1禹龙,田生伟,冯冠军.维吾尔语情感词汇自动识别[J].计算机工程,2011,37(7):213-215. 被引量：8
2热依莱木.帕尔哈提,孟祥涛,艾斯卡尔.艾木都拉.基于区分性关键词模型的维吾尔文本情感分类[J].计算机工程,2014,40(10):132-136. 被引量：11
3黄俊,田生伟,禹龙,冯冠军.基于维吾尔语情感词的句子情感分析[J].计算机工程,2012,38(9):183-185. 被引量：5
4张靖,金浩.汉语词语情感倾向自动判断研究[J].计算机工程,2010,36(23):194-196. 被引量：16
5陈小冬,林焕祥.稀疏判别分析[J].计算机应用,2012,32(4):1017-1021. 被引量：2
6杨鼎,阳爱民.一种基于情感词典和朴素贝叶斯的中文文本情感分类方法[J].计算机应用研究,2010,27(10):3737-3739. 被引量：44
7LI Juanzi FAN Qi＇na ZHANG Kuo.Keyword Extraction Based on tf/idf for Chinese News Document[J].Wuhan University Journal of Natural Sciences,2007,12(5):917-921. 被引量：24

二级参考文献63

1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量：198
2YANG Yi-ming, PEDERSEN J O. A comparative study on feature selection in text categorization[ C ]//Proc of the 14th International Conference on Machine Learning. San Francisco, CA: Morgan Kaufmann, 1997: 412-420.
3Yu Hong, Hatzivassiloglou V. Towards Answering Opinion Questions; Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences [C]//Proc. of Conference on Empirical Methods in Natural Language Processing. Morristown, USA; [s. n. ], 2003; 129- 136.
4Turney P. Thumps up or Thumbs down? Semantic Orientation Applied to Unsupervised Classification of Reviews[C]//Proc. of the 40th Annual Meeting of the Association for Computational Linguistics. Morristown, USA: [s. n. ], 2002: 417-424.
5Kanayama H, Nasukawa T. Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis[C]//Proe. of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: [s. n. ], 2006: 355-363.
6KimSoo-Min, Hovy E. Determining The Sentiment of Opinions[C]//Proc. of the 20th International Conference on Computational Linguistics. Morristown, USA: [s. n], 2004: 1367-1373.
7Tarney P D,Littman M L.Unsupervised Learning of Semantic Orientation from a Hundred-billion-word Corpus[EB/OL].(2002 05-15).http://iitatins2.iit.nrc.ca/publications/nrc-44929_e.html.
8Wiebe J,Wilson T,Bell M.Identifying Collocation for Recognizing Opinions[C]//Proc.of Workshop on Collocation:Computational Extraction,Analysis,and Exploitation.[S.1.]:ACL Press,2001:24-31.
9Tumey P D.Thumbs up or Thumbs down? Semantic Orientation Applied to Unsupervised Classification of Reviews[C]//Proc.of the 40th Annual Meeting of the Association for Computational Linguistics.[S.l.]:ACM Press,2002:417-424.
10Dave K,Lawrence S,Pennock D M.Mining the Peanut Gallery Opinion Extraction and Semantic Classification of Product Reviews[C]//Proc.of the International World Wide Web Conference.New York,USA:ACM Press,2003:519-528.

共引文献96

1Shuang Yang,Yan Tang.News Topic Detection Based on Capsule Semantic Graph[J].Big Data Mining and Analytics,2022,5(2):98-109. 被引量：2
2尹倩,胡学钢,谢飞,吴信东.基于密度聚类模式的中文新闻网页关键词提取[J].广西师范大学学报（自然科学版）,2009,27(1):201-204. 被引量：2
3胡学钢,李星华,谢飞,吴信东.基于词汇链的中文新闻网页关键词抽取方法[J].模式识别与人工智能,2010,23(1):45-51. 被引量：22
4李纲,戴强斌.基于词汇链的关键词自动标引方法[J].图书情报知识,2011,28(3):67-71. 被引量：27
5张震,胡学钢.基于互信息量的分类模型[J].计算机应用,2011,31(6):1678-1680. 被引量：5
6魏韡,向阳,陈千.中文文本情感分析综述[J].计算机应用,2011,31(12):3321-3323. 被引量：70
7薛丽敏,肖斌.基于五元模型的中文句子情感倾向性判断[J].计算机工程,2012,38(3):178-179. 被引量：1
8谢晋.基于词跨度的中文文本关键词自动提取方法[J].现代物业（中旬刊）,2012,11(4):108-111. 被引量：6
9黄俊,田生伟,禹龙,冯冠军.基于维吾尔语情感词的句子情感分析[J].计算机工程,2012,38(9):183-185. 被引量：5
10林江豪,阳爱民,周咏梅,陈锦,蔡泽键.一种基于朴素贝叶斯的微博情感分类[J].计算机工程与科学,2012,34(9):160-165. 被引量：44

1潘晓英,胡开开,朱静.一种基于TextRank的文本二次聚类算法[J].计算机技术与发展,2016,26(8):7-11. 被引量：3
2方康,韩立新.基于HMM的加权Textrank单文档的关键词抽取算法[J].信息技术,2015,39(4):114-116. 被引量：12
3顾益军,夏天.融合LDA与TextRank的关键词抽取研究[J].现代图书情报技术,2014(7):41-47. 被引量：70
4顶级视听娱乐辜受步步高vivo Xplay智能手机[J].计算机应用文摘,2013(20):72-73.
5田长波,林民,斯日古楞.融合PAM和主题偏好TextRank的历史沿革信息抽取[J].计算机应用研究,2017,34(1):123-127. 被引量：6
6李亚芬,李征.基于Alfresco的出版社资源库系统的研究与实现[J].软件,2015,36(5):34-39.
7陆伟,程齐凯.一种基于加权网络和句子窗口方案的信息检索模型[J].情报学报,2013,32(8):797-804. 被引量：9
8段艳会,李晓林,黄爽.基于条件随机场的中文地址行政区划提取方法[J].武汉工程大学学报,2015,37(11):47-51. 被引量：7
9王宁宁,鲁燃,王智昊,刘承运.基于用户标签的微博推荐算法[J].计算机应用研究,2017,34(1):58-61. 被引量：8
10曾超,刘晓宇,林艺滨,温若辉.基于电子取证数据的内容分析技术和应用[J].计算机科学,2016,43(B12):228-230.

清华大学学报（自然科学版）

2017年第3期

浏览历史

内容加载中请稍等...

基于不同关键词提取算法的维吾尔文本情感辨识

参考文献7

二级参考文献63

共引文献96

相关作者

相关机构

相关主题

浏览历史