期刊文献+

融入混沌与对立学习机制的二进制粒子群特征选择算法 被引量:2

A FEATURE SELECTION ALGORITHM OF BINARY PARTICLE SWARM OPTIMIZATION BASED ON CHAOS AND OPPOSITION-LEARNING MECHANISM
下载PDF
导出
摘要 为了实现特征空间降维,提高文本聚类准确性,提出一种融入混沌与对立学习的二进制粒子群优化特征选择算法。设计了新的词条权重计算方法,将文本数据表达为矢量空间模型;提出改进二进制粒子群算法求解特征选择问题,引入混沌系统和对立学习机制对粒子随机搜索方向和初始种群分布分别进行优化;在评估粒子适应度中引入词条方差和平均中位数两种方法对特征子集评估,并设计特征合并和交叉机制融合两种适应度的优势,生成最优特征子集;利用K均值算法对特征选择的文本进行聚类。结果表明,该算法在特征降维、聚类准确率、F度量值上均优于同类算法,可以有效实现特征空间降维并提升文本聚类性能。 In order to achieve the feature space dimension reduction and improve the accuracy of the text clustering,a feature selection algorithm of binary particle swarm optimization based on chaos and opposite-learning is proposed.We designed a new term weight calculation method and expressed the text datasets as the vector space model.We presented an improved binary particle swarm optimization to solve the problem of feature selection,and chaotic system and opposition-learning mechanism were introduced to optimize the random search direction and initial population distribution of particles respectively.In the evaluation of particle fitness,the term variance and mean median were introduced to evaluate feature subset,and we designed characteristics of the merger and crossover mechanism combined with the advantage of two kinds of fitness to generate the optimal feature subset.The k-means algorithm was used to carried on the text clustering analysis for feature selection.The results show that our algorithm performs better than the similar algorithms on the feature dimension reduction,clustering accuracy and F measurements,which can effectively achieve feature space dimensionality reduction and promote text clustering performance.
作者 袁明锋 步中华 王强 Yuan Mingfeng;Bu Zhonghua;Wang Qiang(Department of Big Data and Information Industry,Chongqing Vocational College of Light Industry,Chongqing 401329,China;Qingdao Full Big Technology Co.,Ltd.,Qingdao 266580,Shandong,China;School of Information,Qingdao University of Science and Technology,Qingdao 266580,Shandong,China)
出处 《计算机应用与软件》 北大核心 2022年第10期274-284,306,共12页 Computer Applications and Software
基金 山东省自然科学基金项目(2018080712) 教育部产学合作项目(2017HX00223)。
关键词 特征选择 二进制粒子群优化 混沌映射 对立学习 文本聚类 Feature selection Binary particle swarm optimization Chaos mapping Opposition-learning Text clustering
  • 相关文献

参考文献3

二级参考文献34

共引文献78

同被引文献46

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部