基于粗糙集和蚁群优化算法的特征选择方法被引量：19

A method for feature selection based on rough sets and ant colonyoptimization algorithm

下载PDF

导出

摘要特征选择在许多领域特具有重要的作用.本文将粗糙集方法和蚁群优化算法相结合,提出一种基于粗糙集蚁群优化方法的特征选择的算法.该算法以属性依赖度和属性重要度作为启发因子应用于转移规则中,用粗糙集方法的分类质量和特征子集的长度构建信息素更新策略.通过对数据集的测试,结果表明所提出的方法是可行的. Feature selection has become the focus of research in the field of data mining,machine learning,pattern recognition and so on.Feature selection uses a more stable set and appropriate precision characteristics to describe the original feature set.Feature selection research has focused on two aspects： one is for the search strategy of the subset and the other is the performance evaluation feature subset.Therefore,the research on more effective feature selection algorithm,to obtain the better feature subset,to reduce the time complexity of the algorithm,and to find the fast feature selection algorithm,is still the focus of the study of feature selection.According to the defects and deficiencies of the current algorithm,by analyzing the advantages and disadvantages of the existing algorithms,the current shortcomings and deficiencies of methods have been found to propose a new method for feature selection which combined the rough set method and ant colony optimization algorithm.To improve the algorithm＇s performance,the core attribute as the start of the feature selection.In the transfer rules and the pheromone update strategy,this algorithm uses rough set dependency and attributes significance to guide the ants search process to improve the performance of the algorithm.In addition,the quality of classification based on rough set method and the length of the feature subset are used to measure the strengths and weaknesses of feature subset.By choosing a data set with certain number of data and attributes the proposed method is tested to compare with the feature selection method based on rough set and the feature selection method based on ant colony optimization.Testing and comparison results show that the proposed method is feasible and this method has obvious advantages in the indicators feature subset length and accuracy when the data set have core attributes.Finally,the given example and testing in real datasets show that the proposed method is effective.

作者王璐邱桃荣何妞刘萍

机构地区南昌大学计算机系

出处《南京大学学报（自然科学版）》 CAS CSCD 北大核心 2010年第5期487-493,共7页 Journal of Nanjing University（Natural Science）

基金国家自然科学基金(50863003 61070139) 江西省教育厅科技资助项目(赣教技字[GJJ08042]号)

关键词粗糙集特征选择蚁群算法 rough sets feature selection ant colony algorithm

分类号 TP39 [自动化与计算机技术—计算机应用技术] TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献19

1Liu H, Motoda H. Feature selection for knowledge discovery and data mining. Kluwer: Academic Publishers, 1998, 214.
2Guyon I, Elisseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research, 2003, 3:1157-1182.
3Kudo M, Sklansky J. Comparison of algorithms that select features for pattern classifiers. Pattern Recognition, 2000, 33 (1): 25-41.
4Sun Z H, Bebis G, Miller R. Obieet detection using feature subset selection. Pattern Reeognition, 2004, 37(11):2165-2176.
5Jain A K, Duin R D W, Mao J C. Statistical pattern recognition: A review. Institute of Electrieal and Electronics Engineers Transaction Pattern Analysis and Machine Intelligence, 2000, 22(1): 4-37.
6Kudo M, Sklansky J. Comparison of algorithms that select features for pattern classifiers. Pattern Recognition, 2000,33 (1) : 25-41.
7Chen X W. An improved branch and bound al gorithm for feature selection. Pattern Recognition Letters, 2003, 24(12):1925-1933.
8王凌．智能优化算法及其应用．北京：清华大学出版社，2004
9Wu B L, Abbott T, Fishman D, et al. Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data. Bioin for Maties, 2003,19(13) :1636-1643.
10Swiniarski R W, Skowron A. Rough set methods in feature selection and recognition. Pattern Recognition Letters, 2003, 24(6): 833-849.

二级参考文献36

1杨建林.基于文献集相似度的分类方法[J].情报学报,1999,18(S1):92-94. 被引量：5
2林春燕,朱东华.科学文献的模糊聚类算法[J].计算机应用,2004,24(11):66-67. 被引量：9
3Casillas A,Gonzdlez de Lena M T,Martínez R.Document clustering into an unknown number of clusters using a genetic algorithm.International Conference on Text Speech and Dialogue,2003,43-49.
4Selim S Z,Ismail M A.K-means-type algorithms:a generalized convergence theorem characterization of local optimality.IEEE Transactions Pattern Analysis and Machine Intelligence,1984,6(1):81-87.
5Bradley P S,Fayyad U M.Refining initial points for K-means clustering.Advance in Knowledge Discovery and Data Mining.Cambridge:MIT Press,1996.
6Raymond T N,Han J W.Efficient and effective clustering methods for spatial data mining.Proceeding of the 20th VLDB Conference Santiago,Chile,1994,144-155.
7Shi Z.Efficient online spherical K-means lustering.Proceedings of the 2005 IEEE International Joint Conference on Neural Networks.Montreal,IEEE Press,2005,3180-3185.
8Gareth J,Alexander M R,Chawchat S,et al.Non-hierarchic document clustering using a genetic algorithm.Information Research,1995,1(1).
9Pearson R, Coney G, Shwaber J. Imbalanced clustering for microarray time-series. Proceedings of the ICML' 03 Workshop on Learning from Imbalaneed Data Sets. Washington, DC,2003.
10Wu G, Chang E Y. Class-boundary alignment for imbalanced dataset learning. Proceedings of the ICML' 03 Workshop on Learning from Imbalanced Data Sets. Washington, DC, 2003.

共引文献272

1朱益江.自适应蚁群算法在Flow Shop调度问题上的应用研究[J].常州工学院学报,2007,20(6):42-45.
2陈莉,陈晓云,胡山立.基于蚁群算法的组合拍卖胜者决定问题求解[J].计算机研究与发展,2006,43(z1):69-73. 被引量：4
3武晶杰,潘丽君,张峰刚.虚拟战场环境中武装直升机航路规划算法研究[J].系统仿真学报,2006,18(z2):721-723. 被引量：3
4石为人,余兵,张星.单机作业下的提前/脱期问题的蚁群调度优化算法[J].仪器仪表学报,2003,24(z2):690-692. 被引量：1
5刘延明,方崇,陆克芬,周欢.蚁群投影寻踪回归在农田灌溉水质评价中的应用[J].贵州农业科学,2009,37(9):61-64. 被引量：5
6李志宇,史浩山.基于最小Steiner树的无线传感器网络数据融合算法[J].西北工业大学学报,2009,27(4):558-564. 被引量：6
7孙宏,詹士昌,金柏林.自适应进化的蚁群算法及其仿真研究[J].杭州师范学院学报（自然科学版）,2003,2(5):31-34. 被引量：4
8叶志伟,周欣,夏彬.蚁群算法研究应用现状与展望[J].吉首大学学报（自然科学版）,2010,31(1):35-39. 被引量：2
9胡启国,胡小华,吴泳龙.改进蚁群算法在系统可靠度最优冗余分配的应用[J].重庆交通大学学报（自然科学版）,2013,32(3):543-546. 被引量：8
10林舒杨,李翠华,江弋,林琛,邹权.不平衡数据的降采样方法研究[J].计算机研究与发展,2011,48(S3):47-53. 被引量：31

同被引文献214

1刘启和,李凡,闵帆,叶茂,杨国纬.一种基于新的条件信息熵的高效知识约简算法[J].控制与决策,2005,20(8):878-882. 被引量：31
2刘素华,侯惠芳,李小霞.基于遗传算法和模拟退火算法的特征选择方法[J].计算机工程,2005,31(16):157-159. 被引量：14
3张静,王建民,何华灿.基于属性相关性的属性约简新方法[J].计算机工程与应用,2005,41(28):55-57. 被引量：18
4韩松来,张辉,周华平.基于关联度函数的决策树分类算法[J].计算机应用,2005,25(11):2655-2657. 被引量：36
5高海昌,冯博琴,朱利b.智能优化算法求解TSP问题[J].控制与决策,2006,21(3):241-247. 被引量：120
6徐章艳,刘作鹏,杨炳儒,宋威.一个复杂度为max（O（｜C｜｜U｜），O（｜C^2｜U／C｜））的快速属性约简算法[J].计算机学报,2006,29(3):391-399. 被引量：234
7刘胥影,吴建鑫,周志华.一种基于级联模型的类别不平衡数据分类方法[J].南京大学学报（自然科学版）,2006,42(2):148-155. 被引量：23
8任永功,王杨,闫德勤.基于遗传算法的粗糙集属性约简算法[J].小型微型计算机系统,2006,27(5):862-865. 被引量：32
9陈思睿,张永,杨志勇.基于粗糙集的特征选择方法的研究[J].计算机工程与应用,2006,42(21):159-161. 被引量：7
10周江卫,冯博琴,刘洋.粗糙集高效遗传约简算法[J].西安交通大学学报,2007,41(4):444-447. 被引量：8

引证文献19

1陈玉明,吴克寿,孙金华.基于幂树的决策表最小属性约简[J].南京大学学报（自然科学版）,2012,48(2):164-171. 被引量：5
2黄宇达,王迤冉.基于朴素贝叶斯与ID3算法的决策树分类[J].计算机工程,2012,38(14):41-43. 被引量：19
3黄宇达,范太华.决策树ID3算法的分析与优化[J].计算机工程与设计,2012,33(8):3089-3093. 被引量：16
4于洪,姚园,赵军.一种有效的基于风险最小化的属性约简算法[J].南京大学学报（自然科学版）,2013,49(2):210-216. 被引量：6
5顾沈明,叶晓敏,吴伟志.多标记粒度不完备信息系统的粗糙近似[J].南京大学学报（自然科学版）,2013,49(2):250-257. 被引量：4
6孙佳瑶,詹永照,毛启容,王敏超.基于遗传算法的交通视频事件多特征选择方法[J].微电子学与计算机,2013,30(7):42-46.
7王忠民,曹栋.基于蚁群算法的行为识别特征优选方法[J].西安邮电大学学报,2014,19(1):73-77. 被引量：21
8刘萍,邱桃荣,段文影.基于粗糙集的数据发布多约束匿名保护方法[J].计算机工程与设计,2014,35(8):2769-2772. 被引量：1
9戴志聪,吴伟志.不完备多粒度序信息系统的粗糙近似[J].南京大学学报（自然科学版）,2015,51(2):361-367. 被引量：11
10朱凯峰.电力企业物资管理存在的问题与对策[J].科技与企业,2015(15):43-43. 被引量：2

二级引证文献112

1沈夏炯,薛钰,韩道军,张磊.访问控制系统中客体粒度决策方法研究[J].河南大学学报（自然科学版）,2020,0(1):63-69. 被引量：1
2程玉胜,江效尧,胡林生.基于粗糙集理论的协调集及其决策树构造[J].南京大学学报（自然科学版）,2012,48(6):790-796. 被引量：3
3池静,杨振宇,张婷.基于贝叶斯和决策树的入侵检测方法[J].工矿自动化,2013,39(2):62-65. 被引量：3
4姚双良.基于主题的Deep Web聚焦爬虫研究与设计[J].西北师范大学学报（自然科学版）,2013,49(2):40-43. 被引量：2
5于洪,姚园,赵军.一种有效的基于风险最小化的属性约简算法[J].南京大学学报（自然科学版）,2013,49(2):210-216. 被引量：6
6顾沈明,叶晓敏,吴伟志.多标记粒度不完备信息系统的粗糙近似[J].南京大学学报（自然科学版）,2013,49(2):250-257. 被引量：4
7张建华.知识管理自学习案例的自组织机制与检索算法研究[J].情报杂志,2013,32(12):194-199. 被引量：3
8徐宁,魏晓,章云.基于分类知识结构的最小约简算法[J].计算机应用与软件,2014,31(2):271-274.
9赵刚,王碰,王鑫,金文斌,吴晓婷.基于决策树的二维码恶意网址检测方法[J].信息安全与技术,2014,5(2):36-39. 被引量：3
10徐健锋,张远健,Zhou Duanning,Li Dan,李宇.基于粒计算的不确定性时间序列建模及其聚类[J].南京大学学报（自然科学版）,2014,50(1):87-94. 被引量：7

1宋雪梅,李兵.蚁群优化算法的理论、改进及应用[J].唐山学院学报,2006,19(1):87-88.
2艾凌云.基于蚁群算法和粗糙集方法的图像聚类分析研究[J].西北大学学报（自然科学版）,2011,41(5):808-812. 被引量：2
3于洪,杨大春.基于蚁群优化的多个属性约简的求解方法[J].模式识别与人工智能,2011,24(2):176-184. 被引量：9
4蒋志年.数字图像散斑相关技术的蚁群优化方法[J].应用光学,2012,33(3):527-531. 被引量：2
5路秀英,崔兴凯,霍新丽.求解多目标资源分配问题的改进蚁群优化算法[J].微电子学与计算机,2011,28(10):87-90. 被引量：4
6张钰,袁同山,王哲,江静.基于组件式GIS及领域本体的计算机辅助消防救援系统[J].计算机测量与控制,2010,18(1):214-216. 被引量：3
7黄风立,顾金梅,张礼兵,徐春光,王海燕.基于禁忌制造特征动态调整的STEP-NC工艺路线蚁群优化方法[J].中国机械工程,2016,27(5):596-602. 被引量：5
8张艳霞,宋俊辉,王丹宁.一种工程混合免疫计算的多峰值函数优化方法[J].信阳师范学院学报（自然科学版）,2013,26(1):140-142.
9陈泽恩.基于ACO-LSSVM的城市火灾预测模型仿真[J].计算机仿真,2014,31(1):348-351. 被引量：3
10王小明,安小明.具有能量和位置意识基于ACO的WSN路由算法[J].电子学报,2010,38(8):1763-1769. 被引量：14

南京大学学报（自然科学版）

2010年第5期

浏览历史

内容加载中请稍等...

基于粗糙集和蚁群优化算法的特征选择方法被引量：19

参考文献19

二级参考文献36

共引文献272

同被引文献214

引证文献19

二级引证文献112

相关作者

相关机构

相关主题

浏览历史

基于粗糙集和蚁群优化算法的特征选择方法 被引量：19

参考文献19

二级参考文献36

共引文献272

同被引文献214

引证文献19

二级引证文献112

相关作者

相关机构

相关主题

浏览历史

基于粗糙集和蚁群优化算法的特征选择方法被引量：19