摘要
文本分类技术作为文本数据处理的一种重要手段,如何提高文本分类的效率具有重大的意义。基于传统的文本分类技术采用TFIDF响了文本分类效果。本文通过对TFIDF对比实验,提出了一种基于混合特征的分类方法。实验表明该方法在文本分类效果F著提升,证明了本文改进方法的有效性。
Text classification technology is an important method for text data processing,how to improve the efficiency of text classification has great significance.TFIDF algorithm is applied to calculate the weight of traditional text classification technology without considering the distribution of feature items among categories,which affects the effect of text classification. In this paper,an improved TFIDF is proposed and Labeled-LDA model is integrated. Combined with text classification comparison experiment,a classification method based on mixed characteristics is proposed. The experiment shows that this method has significantly improved the F value of text classification effect,which proves the effectiveness of the improved method in this paper.
作者
黄珊珊
廖闻剑
HUANG Shan-shan;LIAO Wen-jian(Wuhan Researtch Institute of Posts and Telecommunications,Wuhan 430070,China;Nanjing fiberhome starrySky Co.Ltd,Nanjing 210019,China)
出处
《电子设计工程》
2019年第7期61-65,共5页
Electronic Design Engineering