Dropout training for SVMs with data augmentation 被引量：1

Dropout training for SVMs with data augmentation

导出

摘要 Dropout and other feature noising schemes have shown promise in controlling over-fitting by artificially corrupting the training data. Though extensive studies have been performed for generalized linear models, little has been done for support vector machines （SVMs）, one of the most successful approaches for supervised learning. This paper presents dropout training for both linear SVMs and the nonlinear extension with latent representation learning. For linear SVMs, to deal with the intractable expectation of the non-smooth hinge loss under corrupting distributions, we develop an iteratively re-weighted least square （IRLS） algorithm by exploring data augmentation techniques. Our algorithm iteratively minimizes the expectation of a re- weighted least square problem, where the re-weights are analytically updated. For nonlinear latent SVMs, we con- sider learning one layer of latent representations in SVMs and extend the data augmentation technique in conjunction with first-order Taylor-expansion to deal with the intractable expected hinge loss and the nonlinearity of latent representa- tions. Finally, we apply the similar data augmentation ideas to develop a new IRLS algorithm for the expected logistic loss under corrupting distributions, and we further develop a non-linear extension of logistic regression by incorporating one layer of latent representations. Our algorithms offer insights on the connection and difference between the hinge loss and logistic loss in dropout training. Empirical results on several real datasets demonstrate the effectiveness of dropout training on significantly boosting the classification accuracy of both linear and nonlinear SVMs. Dropout and other feature noising schemes have shown promise in controlling over-fitting by artificially corrupting the training data. Though extensive studies have been performed for generalized linear models, little has been done for support vector machines （SVMs）, one of the most successful approaches for supervised learning. This paper presents dropout training for both linear SVMs and the nonlinear extension with latent representation learning. For linear SVMs, to deal with the intractable expectation of the non-smooth hinge loss under corrupting distributions, we develop an iteratively re-weighted least square （IRLS） algorithm by exploring data augmentation techniques. Our algorithm iteratively minimizes the expectation of a re- weighted least square problem, where the re-weights are analytically updated. For nonlinear latent SVMs, we con- sider learning one layer of latent representations in SVMs and extend the data augmentation technique in conjunction with first-order Taylor-expansion to deal with the intractable expected hinge loss and the nonlinearity of latent representa- tions. Finally, we apply the similar data augmentation ideas to develop a new IRLS algorithm for the expected logistic loss under corrupting distributions, and we further develop a non-linear extension of logistic regression by incorporating one layer of latent representations. Our algorithms offer insights on the connection and difference between the hinge loss and logistic loss in dropout training. Empirical results on several real datasets demonstrate the effectiveness of dropout training on significantly boosting the classification accuracy of both linear and nonlinear SVMs.

作者 Ning CHEN Jun ZHU Jianfei CHEN Ting CHEN

机构地区 MOE Key lab of Bioinformatics State Key Lab of Intelligent Technology and Systems

出处《Frontiers of Computer Science》 SCIE EI CSCD 2018年第4期694-713,共20页 中国计算机科学前沿（英文版）

关键词 DROPOUT SVMS logistic regression data aug- mentation iteratively reweighted least square dropout SVMs logistic regression data aug- mentation iteratively reweighted least square

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论] TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献2

1高上凯.浅谈脑—机接口的发展现状与挑战[J].中国生物医学工程学报,2007,26(6):801-803. 被引量：70
2王行愚,金晶,张宇,王蓓.脑控:基于脑-机接口的人机融合控制[J].自动化学报,2013,39(3):208-221. 被引量：97

引证文献1

1陈娇.基于深度卷积网络的脑电运动想象分类方法[J].中国医疗设备,2019,34(8):37-41. 被引量：3

二级引证文献3

1鲁杰,杨晓栋,彭靖宇,刘鹏伟.基于LSTM的运动想象脑电信号分类方法[J].电子设计工程,2021,29(4):88-92. 被引量：5
2王景凯,周书锋.fNIRS脑功能数据分析与行为分类[J].山东师范大学学报（自然科学版）,2022,37(3):283-291.
3梁国富,黄子君,李运德.基于ELM-SVM的脑电专注度分类器关键技术研究[J].桂林航天工业学院学报,2024,29(3):351-359.

1彭刚,唐松平,张作刚,彭杰,张彦斌.基于改进多分类概率SVM模型的变压器故障诊断[J].机械与电子,2018,36(4):42-47.
2银温社,胡杨升,董青青,易三莉,贺建峰.基于深度学习的细胞癌恶化程度预测方法研究[J].软件导刊,2018,17(3):11-14. 被引量：2

Frontiers of Computer Science

2018年第4期

浏览历史

内容加载中请稍等...

Dropout training for SVMs with data augmentation 被引量：1

同被引文献2

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史