分治策略下的代价敏感属性选择回溯算法被引量：1

Backtracking Algorithm for Cost-Sensitive Feature Selection Based on Divide and Conquer Strategy

下载PDF

导出

摘要代价敏感属性选择是数据挖掘的一个重要研究领域,其目的在于通过权衡测试代价和误分类代价,获得总代价最小的属性子集。针对经典回溯算法运行时间较长的缺点,结合分治思想,提出了一种改进的回溯算法。改进算法引入了两个相关参数,根据数据集规模自适应调整参数,并按参数大小拆分数据集,降低问题规模,以提高经典回溯算法的执行效率。针对较大规模数据集的实验结果表明,与经典的回溯算法相比,改进算法在保证效果的同时至少提高20%的运算效率;与启发式算法相比,改进算法在保证效率的同时取得了具有更小总代价的属性集合,可应用于实际问题。 Cost-sensitive feature selection is an important research field in the process of data mining. It aims at obtaining an attribute subset of the lowest total cost, through balancing test cost and misclassification cost. According to the shortcoming of the classical backtracking algorithm with longer running time, combining divide and conquer thought,this paper proposes an improved backtracking algorithm. Introducing two related parameters, this algorithm computes adaptively parameters according to the dataset scale, and splits the dataset with these parameters. It can enhance the efficiency of the classical backtracking algorithm by reducing the problem size. The experiments on the datasets with large scale show that this improved algorithm is effective and meets the need of practical problems. At the same time guaranteeing the effect, this improved algorithm promotes the efficiency of 20% at least than the classical backtracking algorithm. Compared with heuristic algorithm, this improved algorithm obtains an attribute set with a smaller total cost and ensures the efficiency.

作者黄伟婷赵红祝峰

机构地区闽南师范大学计算机学院闽南师范大学粒计算及其应用重点实验室

出处《计算机科学与探索》 CSCD 北大核心 2016年第10期1451-1458,共8页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金 Nos.61379049 61379089 61170128 福建省科技计划重点项目 No.2012H0043 漳州市自然科学基金 No.ZZ2016J35~~

关键词粗糙集粒计算代价敏感属性选择自适应分治 rough sets granular computing cost-sensitive feature selection adaptive divide and conquer

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1林姿琼,赵红.代价敏感最优误差边界选择[J].计算机科学与探索,2013,7(12):1146-1152. 被引量：2
2李华雄,周献中,黄兵,赵佳宝.决策粗糙集与代价敏感分类[J].计算机科学与探索,2013,7(2):126-135. 被引量：11

二级参考文献16

1Lin T Y. Granular computing on binary relations I: data mining and neighborhood systems[J]. Rough Sets in Knowledge Discovery, 1998, 1: 107-121.
2Domingos P. MetaCost: a general method for making classi- tiers cost-sensitive[C]//Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '99). New York, NY, USA: ACM, 1999: 155-164.
3Du Yong, Hu Qinghua, Zhu Pengfei, et al. Rule learning fl classification based on neighborhood covering reduction[Jt Information Sciences, 2011, 181 (24): 5457-5467. |.
4Fan Wei, Stolfo S, Zhang Junxin, et al. AdaCost: misclassifi- cation cost-sensitive boosting[C]//Proceedings of the 16th Intemational Conference on Machine Learning (ICML '99). San Francisco, CA, USA: Morgan Kaufmann Publishers Inc, 1999: 97-105.
5Kryszkiewicz M. Comparative studies of alternative type of knowledge reduction in inconsistent systems[J]. International Journal of Intelligent Systems, 2001, 16(1): 105-120.
6[ Zhao Hong, Min Fan, Zhu W. Test-cost-sensitive attribute reduction based on neighborhood rough set[C]//Proceedingsof the 2011 IEEE Intemational Conference on Granular Computing (GrC' 11), Nov 8-10, 2011: 802-806.
7Tumey P D. Cost-sensitive classification: empirical evaluation of a hybrid genetic decision tree induction algorithm[J]. Journal of Artificial Intelligence Research, 1995, 2(1): 369-409.
8Yao Yiyu, Wong S K M. A decision theoretic framework for approximating concepts[J]. International Joumal of Man- Machine Studies, 1992, 37(6): 793-809.
9Yao Yiyu, Zhao Yan. Attribute reduction in decision-theoretic rough set models[J]. Information Sciences, 2008, 178(17): 3356-3373.
10Min Fan, Liu Qihe. A hierarchical model for test-cost-sensitive decision systems[J]. Information Sciences, 2009, 179(14): 2442-2452.

共引文献11

1张里博,李华雄,周献中,黄兵.人脸识别中的多粒度代价敏感三支决策[J].山东大学学报（理学版）,2014,49(8):48-57. 被引量：16
2杜丽娜,徐久成,刘洋洋,孙林.基于三支决策风险最小化的风险投资评估应用研究[J].山东大学学报（理学版）,2014,49(8):66-72. 被引量：9
3岑巍.基于动态代价敏感的数据挖掘算法探讨[J].信息安全与技术,2014,5(11):26-28.
4张燕平,邹慧锦,赵姝.基于CCA的代价敏感三支决策模型[J].南京大学学报（自然科学版）,2015,51(2):447-452. 被引量：10
5周步芳,祝峰.基于非负矩阵分解的代价敏感特征选择[J].烟台大学学报（自然科学与工程版）,2017,30(4):341-347.
6刘偲,秦亮曦.测试代价敏感的决策粗糙集正域约简[J].计算机科学与探索,2017,11(6):1014-1020. 被引量：4
7陈婉清,秦亮曦.基于代价敏感和近似分类质量的决策粗糙集属性约简研究[J].计算机应用研究,2019,36(4):1022-1025. 被引量：2
8徐怡,王旭生.多类分类模型和多层次增量算法[J].计算机科学与探索,2019,13(8):1431-1440. 被引量：2
9任杰,闵帆,汪敏.基于最远总距离采样的代价敏感主动学习[J].计算机应用,2019,39(9):2499-2504. 被引量：1
10刘偲,秦亮曦.模糊决策粗糙集代价敏感属性约简研究[J].计算机科学,2016,43(S2):67-72. 被引量：3

同被引文献9

1白鹤翔,王健,李德玉,陈千.基于粗糙集的非监督快速属性选择算法[J].计算机应用,2015,35(8):2355-2359. 被引量：3
2朱付保,徐显景,白庆春,朱颢东.基于数据集对象平均离群因子的离群点选择算法[J].微电子学与计算机,2016,33(1):131-134. 被引量：2
3刘海涛,魏汝祥,袁昊劼.基于互信息的混合属性数据特征选择方法[J].海军工程大学学报,2016,28(4):78-84. 被引量：5
4马远佳.网络动态通信中的节点选择方法仿真分析[J].计算机仿真,2016,33(9):204-207. 被引量：3
5李敬明,倪志伟,许莹,张琛.基于二进制萤火虫算法的属性选择方法研究[J].系统科学与数学,2017,37(2):407-424. 被引量：6
6杨艳.大数据环境下海量多媒体信息过滤技术改进[J].西安工程大学学报,2017,31(4):569-575. 被引量：10
7胡荣耀,刘星毅,程德波,何威,罗噭.鲁棒自表达的低秩属性选择算法[J].计算机工程,2017,43(9):43-50. 被引量：3
8钟智,何威,程德波,胡荣耀,刘星毅.基于子空间学习的图稀疏属性选择算法[J].计算机应用研究,2016,33(9):2679-2682. 被引量：3
9王宏杰,师彦文,王轩.基于互信息分组的名词型数据特征选择方法[J].数码设计,2017,6(3):10-14. 被引量：2

引证文献1

1寇峰,包萍.关于数据库中用户需求信息准确选择仿真[J].计算机仿真,2018,35(11):370-374.

1吴昊,倪志伟,王会颖.基于MapReduce的蚁群算法[J].计算机集成制造系统,2012,18(7):1503-1509. 被引量：22
2刘铭.大数据管理面临的挑战及技术新趋势[J].信息安全与通信保密,2014,0(10):42-43. 被引量：1
3李中,李晓.一种性能优化的防火墙规则匹配算法[J].计算机应用研究,2013,30(4):1205-1207. 被引量：3
4周培德,王文明.确定两个任意多边形的并的算法[J].北京理工大学学报,1998,18(1):87-91. 被引量：2
5黄伟婷,赵红,祝峰.代价敏感属性约简的自适应分治算法[J].山东大学学报（理学版）,2016,51(8):98-104.
6杨智明,李艳.动态规划与贪心法的对比分析[J].保山学院学报,2016,35(5):73-76. 被引量：1
7金汉均,曾婷.基于GPU的视频序列中运动目标轮廓提取[J].电子测量技术,2016,39(11):85-88. 被引量：3
8何美霞,周箩鱼,杨友平.DMC算法在电加热炉时滞系统中的仿真研究[J].长江大学学报（自科版）（上旬）,2016,13(8):23-28. 被引量：2
9陆国栋,黄长林,彭群生.基于分治思想的尺寸自动标注方法的研究与实现[J].计算机辅助设计与图形学学报,2001,13(6):521-526. 被引量：23
10陆国栋,叶金荣,彭群生.基于分治思想的工程图样智能理解方法研究[J].计算机集成制造系统-CIMS,2001,7(5):63-67. 被引量：1

计算机科学与探索

2016年第10期

浏览历史

内容加载中请稍等...

分治策略下的代价敏感属性选择回溯算法被引量：1

参考文献2

二级参考文献16

共引文献11

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

分治策略下的代价敏感属性选择回溯算法 被引量：1

参考文献2

二级参考文献16

共引文献11

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

分治策略下的代价敏感属性选择回溯算法被引量：1