期刊文献+

基于聚类和二进制PSO的特征选择 被引量:6

A Novel Algorithm Based on K-Means Clustering and Binary Particle Swarm Optimization
下载PDF
导出
摘要 特征选择是模式识别及数据挖掘等领域的重要问题之一。特征选择不但可以提高分类精度和效率,也可以找出富含信息的特征子集。针对此问题,在分析了常用的一些特征选择算法之后,文中提出一种基于聚类和二进制PSO算法的特征选择方法,首先基于特征之间的相关性聚类来进行特征分组及筛选,然后针对经过筛选而精简的特征子集采用二进制粒子群算法进行随机搜索。实验结果表明,该算法可有效地找出具有较好的线性可分离性的特征子集,具有特征精简幅度较大、运行效率较高等优点。 Feature selection is one of the important problems in the pattern recognition and data mining areas.For high dimensional data feature selection not only can improve the accuracy and efficiency of classification,but also can discover informative feature subset.The new feature selection method combining k-means and PSO was proposed in this paper,which first filters feature by k-means,and realized the near optimal feature subset search on the compact feature subset by PSO algorithm.The experiments show that the proposed algorithm can get a good compact feature subset and run more efficiently.
出处 《计算机技术与发展》 2010年第6期25-28,共4页 Computer Technology and Development
基金 国家自然科学基金资助项目(60773013)
关键词 特征选择 K均值算法 相关性 粒子群优化算法 feature selection k-means algorithm relevance PSO
  • 相关文献

参考文献12

二级参考文献35

  • 1张铃,张钹,吴福朝.神经网络的规划学习算法[J].计算机学报,1994,17(9):669-675. 被引量:13
  • 2张燕平,张铃,吴涛.机器学习中的多侧面递进算法MIDA[J].电子学报,2005,33(2):327-331. 被引量:26
  • 3乔立岩,彭喜元,马云彤.基于遗传算法和支持向量机的特征子集选择方法[J].电子测量与仪器学报,2006,20(1):1-5. 被引量:23
  • 4陈彬,洪家荣,王亚东.最优特征子集选择问题[J].计算机学报,1997,20(2):133-138. 被引量:96
  • 5张铃,张钹.多层反馈神经网络的FP学习和综合算法[J].软件学报,1997,8(4):252-258. 被引量:24
  • 6Yang Yirning, Pederson J O. A Comparative Study on Feature Selection in Text Categorization[C]//Proceedings of the 14th International Conferenee on Machine learning. Nashville: Morgan Kaufmann, 1997:412 - 420.
  • 7Ding C, Peng Hanchuan. Minimum redundancy feature selection from microarray gelle expression data[C]//Proceeding of Second IEEE Computational Systems Bioinformaties Conference.LosA Lamitos, USA: IEEE Computer Society Press, 2003: 523 - 528.
  • 8Peng Hanchuan,Long Fuhui,Ding C. Feature .Selection Based on Mutual Information Criteria of Max - Dependency Max - Relevance and Min-Redundancy[J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005,27 (8):1226 - 1238.
  • 9Frakes W B. Steaming Algofithms[C]//Frakes W B,Baeze - Yates B. In Information Retrieval:Data Structure & Algorithms. [ s. l. ]:T P R Prentice Hall, 1992:131 - 160.
  • 10Salton G, Wong A, Yang C S. On the specification of term values in automatic Indexing[J]. Journal of Documentation, 1973,29(4) :351 - 372.

共引文献66

同被引文献51

  • 1李颖新,阮晓钢.基于支持向量机的肿瘤分类特征基因选取[J].计算机研究与发展,2005,42(10):1796-1801. 被引量:51
  • 2段中兴,张德运.基于误码率的模糊加权无线网络公平调度算法[J].西安交通大学学报,2005,39(12):1303-1306. 被引量:1
  • 3常建龙,曹锋,周傲英+.基于滑动窗口的进化数据流聚类[J].软件学报,2007,18(4):905-918. 被引量:60
  • 4Zadeh L A. Fuzzy sets[ J]. Information and Control, 1965, 8:338-353.
  • 5秦克云 徐扬.L型直觉模糊集.兰州大学学报,1996,32:352-355.
  • 6Li Deng-Feng. Some measures of dissimilarity in intuitionistic fuzzy structures[ J]. Journal of Computer and System Sciences. 2004(1) :115-122.
  • 7Atanassov K T. New operations defined over the intuitionistic fuzzy sets[J]. Fuzzy Sets and Systems, 1994,61:137- 142.
  • 8McQueen J. Some Methods for Classification and Analysis of Multivariate Observations [ C ]//In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probabili- ty. [s. l. ]:[s. n. ] ,1967:281-297.
  • 9Guha S,Rastogi R,Shim K. ROCK:A Roust Clustering-A Fil- ter Solution [ C ]//In Proceeding of the 2nd IEEE International Conference on Data Mining ( ICDM' 02 ). Maebashi City, Ja- pan : [ s. n.] ,2002 : 15-122.
  • 10Aggarwal C, Hart J, Wang J, et al. A framework for cluste- ring evolving data streams [ C ]//In:VLDB 2003. [ s. l. ] :[ s. n. ] ,2003:81-92.

引证文献6

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部