期刊文献+

基于多策略改进蝙蝠算法的文本特征选择

Text Feature Selection Based on Multi-strategy Improved Bat Algorithm
下载PDF
导出
摘要 特征选择是文本分类过程的重要处理步骤,在其他分类预处理环节和分类算法确定的条件下,通过传统特征选择方法很难大幅度提高文本分类的准确率。针对此问题,介绍了一个基于改进蝙蝠优化的新的文本特征选择方法,即利用传统的特征选择方法对原始特征进行预选,在此基础上使用高斯局部扰动和自适应调节权重机制改进传统蝙蝠群算法,并以二进制编码形式对预选特征进行优选,分类准确率作为个体的适应度,提出了多策略改进蝙蝠算法的文本特征选择算法MS-BA,实现对文本特征选择优化模型的高效求解。结果表明,采用MS-BA进行特征优选后,其分类准确率得到有效提高。 Feature selection is an important processing step of the text classification process.It is difficult to greatly improve the accuracy of text classification by traditional feature selection methods , when other classification processing and algorithms are set.Therefore , a new text feature selection method based on improved bat optimization is introduced.It uses traditional feature selection method to pre - select the original features , based on which Gaussian local perturbation and adaptive adjustment weights are used to improve the traditional bat group algorithm.The preference and classification accuracy of pre - selected features is used as the fitness of the individual in binary coding.The multi - strategy improved bat algorithm text feature selection algorithm MS - BA is proposed to realize the efficient solution of text feature selection optimization model.The results show that the classification accuracy of MS - BA is improved.
作者 侯乔 陈宏伟 HOU Qiao;CHEN Hongwei(School of Computer Science, Hubei Univ, of Tech., Wuhan 430068, China)
出处 《湖北工业大学学报》 2019年第5期64-66,71,共4页 Journal of Hubei University of Technology
基金 国家自然科学基金(61772180) 湖北省自然科学基金(2013CFB020)
关键词 特征选择 蝙蝠算法 文本分类 多策略改进 feature selection bat algorithm text classification multi - strategy improvement
  • 相关文献

参考文献7

二级参考文献96

  • 1刘涛,吴功宜,陈正.一种高效的用于文本聚类的无监督特征选择算法[J].计算机研究与发展,2005,42(3):381-386. 被引量:37
  • 2赵世奇,张宇,刘挺,陈毅恒,黄永光,李生.基于类别特征域的文本分类特征选择方法[J].中文信息学报,2005,19(6):21-27. 被引量:21
  • 3苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:383
  • 4黄昌宁 等.对自动分词的反思[A]..语言计算与基于内容的文本处理[C].北京:清华大学出版社,2003,7.26-38.
  • 5ASEERVATHAM S, BENNANI Y. Semi-structured document categorization with a semantic kernel [ J ]. Pattern Recognition, 2009, 42 (9) : 2067-2076.
  • 6LUO P, LIN F, XIONG Y, et al. Towards combining Web classification and Web information extraction: a case study [ C ] //Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2009: 1235-1244.
  • 7EICKHOFF C, SERDYUKOV P, DE VRIES A. Web page classification on child suitability [ C ] //Proceedings of the 19th ACM International Conference on Information and Knowledge Management, 2010: 1425-1428.
  • 8BOLLEN J, PEPE A, MAO H. Modeling public mood and emotion : twitter sentiment and soeio-economic phenomena [ C ] //Proceeding of the 19th International Conference on World Wide Web, 2010.
  • 9THELWALL M, BUCKLEY K, PALTOGLOU G. Sentiment in Twitter events [ J]. Journal of the American Society for Information Science and Technology, 2011, 62 (2) : 406-418.
  • 10PHAN X, NGUYEN L, HORIGUCHI S. Learning to classify short and sparse text & Web with hidden topics from large-scale data collections [ C ] //Proceeding of the 17th International Conference on World Wide Web, 2008: 91-100.

共引文献467

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部