期刊文献+

基于LDA的主题分类系统研究

Study on the topic classification system based on LDA
下载PDF
导出
摘要 当前人类处于信息爆炸的时代,对于海量的文本数据,可以利用人工智能的工具来提高数据分析处理的效率,来挖掘海量数据的宝藏。文章主要对文本的主题分类算法进行研究,通过改进分类方法并提出可视化方案,使主题分类具有更好的应用价值。首先通过利用LDA主题分类算法进行处理,并提出了一些改进方法使分类效果更优,并最终生成可视化的主题分类结果,进而用于推荐系统、数据挖掘、数据分析等领域。 At present, human beings are in the era of information explosion. For massive text data, artificial intelligence tools can be used to improve the efficiency of data analysis and processing, and to excavate the treasure of massive data. This paper mainly focuses on the research of text topic classification algorithm. By improving the classification method and putting forward the visualization scheme, the topic classification has better application value. First, LDA theme classification algorithm is applied to deal with it, and some improvement methods are put forward to make the classification effect better, and finally generate visual subject classification results, which is further applied to the fields of recommender system, data mining, data analysis and so on.
作者 郭英杰 千博
出处 《无线互联科技》 2018年第3期61-62,共2页 Wireless Internet Technology
关键词 自然语言处理 主题分类 数据可视化 natural language processing topic classification data visualization
  • 相关文献

参考文献3

二级参考文献25

  • 1韩维良.汉语自动分词系统中切分歧义与未登录词的处理策略[J].青海师范大学学报(自然科学版),2004,20(2):31-34. 被引量:3
  • 2YU Fei,SHEN Yue,AN Ji-yao,ZHANG Ling-fen,ZHU Miao-liang.Information Audit Based on Image Content Filtering[J].Wuhan University Journal of Natural Sciences,2006,11(1):234-238. 被引量:3
  • 3吴慧玲,沈建京,贺广生.基于不良文本信息过滤预处理方法的研究[J].网络安全技术与应用,2006(11):61-63. 被引量:2
  • 4Wise J A, Pennock K, Lantrip D, et al. Visualizing the Non - visual: Spatial Analysis and Interaction with Information from Text Documents [ C ]. Proceedings on Information Visualization 1995.
  • 5Mladenic M G D. Visualization of News Articles [ EB/OL]. [2008 -06 - 12 ]. http://eprints, pascal - network, org/archive/ 00000742/01/GrobelnikMladenic - Contexter. pdf.
  • 6Hearst M A. TileBars : Visualization of Term Distribution Information in Full Text Information Access [ C ]. In:Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 1995 : 59 - 66.
  • 7TileBars Examples [ EB/OL ]. [ 2008 - 06 - 12 ]. http ://people. ischool, berkeley, edu/-hearst/images/tb - example, html.
  • 8Weber W. Text Visualization - What Colors Tell About a Text[ C ]. In : Proceedings of the 11 th International Conference Information Visualization, 2007 : 354 - 362.
  • 9Leskovec J, Grobelnik M, Milic - Frayling N. Learning Sub - structures of Document Semantic Graphs for Document Summarization [ C ]. LinkKDD. 2004.
  • 10Paley W B. TextAre:Showing Word Frequency and Distribution in Text[ C]. IEEE Symposium on Information Visualization. 2002.

共引文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部