期刊文献+

基于双层树状支持向量机的观点挖掘与倾向分析 被引量:3

View mining and trend analysis based on double-layer tree Support Vector Machine
下载PDF
导出
摘要 本文通过进行大量预处理工作,将经过词袋模型和Word2Vec两种不同向量化方法处理后的文本数据分别输入到SVM和LSTM模型中,训练出可以识别文本情感倾向的模型。进而对新产生的评论进行分类。根据实际数据量的倾斜状况,基于传统机器学习算法支持向量机(SVM),本文提出双层支持向量机,采用2种不同的方法分别训练模型并预测。最后再使用深度学习算法长短时记忆模型(LSTM)再次训练并预测,并对这3种方法做出比较和总结。结果显示,双层SVM比单层SVM的准确度提高了8个百分点;而LSTM比单层SVM低了2个百分点,比双层SVM低了接近10个百分点。 In this paper,a large amount of preprocessing work is carried out,and the text data processed by the following two different vectorization methods as the word bag model and Word2 Vec are input into the SVM and LSTM models,respectively to train a model that can recognize the emotional tendency of the text.Further the newly generated comments are classified.According to the tilt of the actual data volume,based on support vector machine(SVM)that is the traditional machine learning algorithm,this paper proposes a two-layer support vector machine,using two different methods to train the model and predict.Thus,the deep learning algorithm long-term memory model(LSTM)is used to train and predict again,and the three methods are compared and summarized.The results show that the accuracy of the two-layer SVM is 8 percentage points higher than that of the single-layer SVM;while the LSTM is two percentage points lower than the single-layer SVM,which is nearly 10 percentage points lower than the double-layer SVM.
作者 孙红 黎铨祺 赵娜 SUN Hong;LI Quanqi;ZHAO Na(School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China;Shanghai Key Lab of Modern Optical System(University of Shanghai for Science and Technology),Shanghai 200093,China)
出处 《智能计算机与应用》 2021年第3期44-47,共4页 Intelligent Computer and Applications
关键词 商品评论 网络爬虫 SVM LSTM 情感分类 数据挖掘 product reviews Web crawler SVM LSTM emotion classification data mining
  • 相关文献

参考文献10

二级参考文献91

  • 1王兴玲,李占斌.基于网格搜索的支持向量机核函数参数的确定[J].中国海洋大学学报(自然科学版),2005,35(5):859-862. 被引量:123
  • 2徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 3Fung B C M,Wang K,Ester M.Hierarchical document clustering//Wang John ed.The Encyclopedia of Data Warehousing and Mining,idea Group.2005:970-975.
  • 4Salton G.The SMART Retrieval System-Experiments in Automatic Document Processing.Englewood Cliffs,New Jersey:Prentice Hall Inc,1971.
  • 5Wang Y,Julia H.Document clustering with semantic analysis//Proceedings of the 39th Hawaii International Conferences on System Sciences.Hawaii,US,2006:54-63.
  • 6Hotho A,Staab S,Stumme G.Wordnet improves text document clustering//Proceedings of the Semantic Web Workshop at SIGIR-2003,26th Annual International ACM SIGIR Conference.Toronto,Canada,2003:541-550.
  • 7Hall P,Dowling G.Approximate string matching.Computing Survey,1980,12(4):381-402.
  • 8Coelho T,Calado P,Souza L,Ribeiro-Neto B,Muntz R.Image retrieval using multiple evidence ranking.IEEETransactions on Knowledge and Data Engineering,2004,16(4):408-417.
  • 9Ko Y,Park J,Seo J.Improving text categorization using the importance of sentences.lnformation Processing and Management,2004,40(1):65-79.
  • 10Erkan G,Radev D.Lexrank:Graph-based lexical centrality as salience in text summarization.Journal of Artificial Intelligence Research,2004,22(7):457-479.

共引文献408

同被引文献25

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部