期刊文献+

基于混合特征的文本分类研究 被引量:2

Research on text classification based on mixed features
下载PDF
导出
摘要 文本分类技术作为文本数据处理的一种重要手段,如何提高文本分类的效率具有重大的意义。基于传统的文本分类技术采用TFIDF响了文本分类效果。本文通过对TFIDF对比实验,提出了一种基于混合特征的分类方法。实验表明该方法在文本分类效果F著提升,证明了本文改进方法的有效性。 Text classification technology is an important method for text data processing,how to improve the efficiency of text classification has great significance.TFIDF algorithm is applied to calculate the weight of traditional text classification technology without considering the distribution of feature items among categories,which affects the effect of text classification. In this paper,an improved TFIDF is proposed and Labeled-LDA model is integrated. Combined with text classification comparison experiment,a classification method based on mixed characteristics is proposed. The experiment shows that this method has significantly improved the F value of text classification effect,which proves the effectiveness of the improved method in this paper.
作者 黄珊珊 廖闻剑 HUANG Shan-shan;LIAO Wen-jian(Wuhan Researtch Institute of Posts and Telecommunications,Wuhan 430070,China;Nanjing fiberhome starrySky Co.Ltd,Nanjing 210019,China)
出处 《电子设计工程》 2019年第7期61-65,共5页 Electronic Design Engineering
关键词 文本分类 TFIDF Labeled-LDA 混合特征 text classification TFIDF Labeled-LDA mixed features
  • 相关文献

参考文献5

二级参考文献53

共引文献176

同被引文献33

引证文献2

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部