期刊文献+

分治策略下的代价敏感属性选择回溯算法 被引量:1

Backtracking Algorithm for Cost-Sensitive Feature Selection Based on Divide and Conquer Strategy
下载PDF
导出
摘要 代价敏感属性选择是数据挖掘的一个重要研究领域,其目的在于通过权衡测试代价和误分类代价,获得总代价最小的属性子集。针对经典回溯算法运行时间较长的缺点,结合分治思想,提出了一种改进的回溯算法。改进算法引入了两个相关参数,根据数据集规模自适应调整参数,并按参数大小拆分数据集,降低问题规模,以提高经典回溯算法的执行效率。针对较大规模数据集的实验结果表明,与经典的回溯算法相比,改进算法在保证效果的同时至少提高20%的运算效率;与启发式算法相比,改进算法在保证效率的同时取得了具有更小总代价的属性集合,可应用于实际问题。 Cost-sensitive feature selection is an important research field in the process of data mining. It aims at obtaining an attribute subset of the lowest total cost, through balancing test cost and misclassification cost. According to the shortcoming of the classical backtracking algorithm with longer running time, combining divide and conquer thought,this paper proposes an improved backtracking algorithm. Introducing two related parameters, this algorithm computes adaptively parameters according to the dataset scale, and splits the dataset with these parameters. It can enhance the efficiency of the classical backtracking algorithm by reducing the problem size. The experiments on the datasets with large scale show that this improved algorithm is effective and meets the need of practical problems. At the same time guaranteeing the effect, this improved algorithm promotes the efficiency of 20% at least than the classical backtracking algorithm. Compared with heuristic algorithm, this improved algorithm obtains an attribute set with a smaller total cost and ensures the efficiency.
出处 《计算机科学与探索》 CSCD 北大核心 2016年第10期1451-1458,共8页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金 Nos.61379049 61379089 61170128 福建省科技计划重点项目 No.2012H0043 漳州市自然科学基金 No.ZZ2016J35~~
关键词 粗糙集 粒计算 代价敏感 属性选择 自适应分治 rough sets granular computing cost-sensitive feature selection adaptive divide and conquer
  • 相关文献

参考文献2

二级参考文献16

  • 1Lin T Y. Granular computing on binary relations I: data mining and neighborhood systems[J]. Rough Sets in Knowledge Discovery, 1998, 1: 107-121.
  • 2Domingos P. MetaCost: a general method for making classi- tiers cost-sensitive[C]//Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '99). New York, NY, USA: ACM, 1999: 155-164.
  • 3Du Yong, Hu Qinghua, Zhu Pengfei, et al. Rule learning fl classification based on neighborhood covering reduction[Jt Information Sciences, 2011, 181 (24): 5457-5467. |.
  • 4Fan Wei, Stolfo S, Zhang Junxin, et al. AdaCost: misclassifi- cation cost-sensitive boosting[C]//Proceedings of the 16th Intemational Conference on Machine Learning (ICML '99). San Francisco, CA, USA: Morgan Kaufmann Publishers Inc, 1999: 97-105.
  • 5Kryszkiewicz M. Comparative studies of alternative type of knowledge reduction in inconsistent systems[J]. International Journal of Intelligent Systems, 2001, 16(1): 105-120.
  • 6[ Zhao Hong, Min Fan, Zhu W. Test-cost-sensitive attribute reduction based on neighborhood rough set[C]//Proceedingsof the 2011 IEEE Intemational Conference on Granular Computing (GrC' 11), Nov 8-10, 2011: 802-806.
  • 7Tumey P D. Cost-sensitive classification: empirical evaluation of a hybrid genetic decision tree induction algorithm[J]. Journal of Artificial Intelligence Research, 1995, 2(1): 369-409.
  • 8Yao Yiyu, Wong S K M. A decision theoretic framework for approximating concepts[J]. International Joumal of Man- Machine Studies, 1992, 37(6): 793-809.
  • 9Yao Yiyu, Zhao Yan. Attribute reduction in decision-theoretic rough set models[J]. Information Sciences, 2008, 178(17): 3356-3373.
  • 10Min Fan, Liu Qihe. A hierarchical model for test-cost-sensitive decision systems[J]. Information Sciences, 2009, 179(14): 2442-2452.

共引文献11

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部