期刊文献+

基于分解策略的多目标演化子集选择算法 被引量:3

Decomposition-based Pareto optimization for subset selection
原文传递
导出
摘要 在许多现实的机器学习任务中,经常遇到从一组变量中挑选一个子集的问题,即子集选择问题.对于这类问题的求解是NP难的.最近,一种基于多目标演化算法的子集选择算法POSS被提出;无论是在理论上还是在实验上,POSS方法均获得了目前的最佳性能.然而,当问题规模很大的时候,POSS方法的运行时间变得难以令人满意,这阻碍了其在大规模实际问题中的应用.提出了一种基于分解策略的多目标演化子集选择算法DPOSS.DPOSS方法将整个子集空间分解成多个子空间,并依次调用POSS方法来求解.在理论上,DPOSS方法在获得和POSS方法相同近似性能下界的同时,运行时间随着分解个数的增加超线性下降.实验结果验证了这一理论,并显示出,DPOSS方法的实际性能随着分解个数的增加略有下降,但依然优于以往的贪婪算法. In many machine-learning tasks, subset selection, which selects a few variables from a large set, is a fundamental problem; it is, however, NP-hard. The recently emerged Pareto Optimization for Subset Selection(POSS) method is a powerful approximation solver for this problem. However, the POSS running time can be unsatisfactory when the problem size is large, restricting its large-scale applications. In this paper, we propose the DPOSS method, which uses a decomposition strategy. DPOSS decomposes the entire subset space into several subspaces, and then sequentially applies the POSS method. Our theoretical analysis shows that DPOSS can achieve the same approximation guarantee as POSS, while superlinearly reducing its running time with respect to the number of decompositions. Empirical studies show that DPOSS's actual running time decreases superlinearly,and the quality of the produced solution has a little loss. However, it is still better than the greedy algorithm,the previous algorithm with the best known theoretical guarantee.
作者 钱超 周志华
出处 《中国科学:信息科学》 CSCD 北大核心 2016年第9期1276-1287,共12页 Scientia Sinica(Informationis)
基金 国家自然科学基金(批准号:61333014 61321491)资助项目
关键词 机器学习 子集选择 多目标优化 多目标演化算法 分解策略 machine learning subset selection multi-objective optimization multi-objective evolutionary algorithm decomposition
  • 相关文献

参考文献2

二级参考文献41

  • 1Bo L F, Ren X F, Fox D. Kernel descriptor for visual recognition. In: Proceedings of the Annual Conference on Neural Information Process- ing Systems. 2010, 244-252.
  • 2Bosch A, Mun6z X, Marti R. Which is the best way to organize/classify images by content? Image and Vision Computing, 2007, 25(6): 778- 791.
  • 3Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91-110.
  • 4Dalal N, Triggs B. Histograms of oriented gradients for human detec- tion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2005, 886-893.
  • 5Vogel J, Schiele B. Semantic modeling of natural scenes for content- based image retrieval. International Journal of Computer Vision, 2007, 72(2): 133-157.
  • 6Li F E Perona E A bayesian hierarchical model for learning natural scene categories. In: Proceedings of the IEEE Conference on Com- puter Vision and Pattern Recognition. 2005, 524-531.
  • 7Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyra- mid matching for recognizing natural scene categories. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2006, 2169-2178.
  • 8Bo L F, Sminchisescu C. Efficient match kernel between sets of fea- tures for visual recognition. In: Proceedings of the Annual Conference on Neural Information Processing Systems. 2009, 135-143.
  • 9SchSlkopf B, Smola A, Mtiller K. Nonlinear component analysis as a kernel eigenvalue problem. Neurocomputing, 1998, 10(5): 1299-1319.
  • 10Xie B J, Liu Y, Zhang H, Yu J. Efficient kernel descriptor for image categorization via pivots selection. In: Proceedings of the IEEE Inter- national Conference on Image Processing. 2013, 3479-3483.

共引文献7

同被引文献31

引证文献3

二级引证文献133

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部