期刊文献+

一种改进的降噪自编码神经网络不平衡数据分类算法 被引量:16

Imbalanced data classification algorithm of improved de-noising auto-encoder neural network
下载PDF
导出
摘要 针对少数类样本合成过采样技术(synthetic minority over-sampling technique,SMOTE)在合成少数类新样本时会带来噪声问题,提出了一种改进降噪自编码神经网络不平衡数据分类算法(SMOTE-SDAE)。该算法通过SMOTE方法合成少数类新样本以均衡原始数据集,考虑到合成样本过程中会产生噪声的影响,利用降噪自编码神经网络算法的逐层无监督降噪学习和有监督微调过程,有效实现对过采样数据集的降噪处理与数据分类。在UCI不平衡数据集上实验结果表明,相比传统SVM算法,该算法显著提高了不平衡数据集中少数类的分类精度。 Aiming at the noise problems of SMOTE algorithm when synthesizing new minority class posed a stacked de-noising auto-encoder neural net-work algorithm based on SMOTE, SMOTE-SDAE. The propobalanced the original data sets by using SMOTE to synthesize new minority class samples, and then effectclassifies the oversampling data sets through the layer-by-layer unsupervised de-noise learning and supervised fine-tuning process of de-noising auto-encoder neural network given the impact of noise produced in Experimental results on UCI imbalanced data sets indicate that compared with traditional SVM algorithms, SMOTE-SDAE algorithm significantly improves the minority class classification accuracy of the imbalanced data sets.
出处 《计算机应用研究》 CSCD 北大核心 2017年第5期1329-1332,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(61672301 61662057) 内蒙古自然科学基金资助项目(2016MS0336) 内蒙古民族大学科学研究资助项目(NMDYB1731) 内蒙古自治区"草原英才工程"基金资助项目(2013) 内蒙古自治区"青年科技领军人才"基金资助项目(NJYT-14-A09) 内蒙古自治区"321人才工程"二层次人选基金资助项目(2010)
关键词 神经网络 过采样 不平衡数据 分类 neural network over-sampling imbalanced data classification
  • 相关文献

参考文献6

二级参考文献91

  • 1韩敏,林丽玉.基于神经网络集成的蛋白质二级结构预测模型[J].计算机与应用化学,2006,23(10):959-962. 被引量:11
  • 2孙啸,陆祖宏,谢建明.生物信息学基础[M].北京:清华大学出版社,2006:249-281.
  • 3Deleage G,Roux 'B.An hlgorithm for protein secondary structure based on class predietion[J].Protein Eng, 1987,1:289-294.
  • 4Chou P Y,Fasman G D.Prediction of protein conformation [J]. Biochem, 1974,13:222.
  • 5Gamier J,Osguthorpe D J,Robson B.Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins[J].J Mol Bio,1978,120:97.
  • 6Rost B,Sander C.Prediction of protein secondary structure at better than 70% accuracy[J]j Mol Biol, 1993,232:584-599.
  • 7Chen Yue-hui,Zhang Xue-qin,Yang M Q,et al.Ensemble of probabilistic neural networks for protein fold recognition[C]//IEEE 7th International Symposium on Bioinformatics and BioEngineering, 2007.
  • 8Cuff J A,Barton G J.Evaluation and improvement of multiple sequence methods for protein secondary structure prediction[J]. PROTEIN:Structure,Function,and Genetics, 1999,34:508-519.
  • 9Jones D T.Protein secondary structure prediction based on position-specific scoring matrices[J].J Mol Biol, 1999,292:195-202.
  • 10Pagni M,Cerutti L, Bordoli LAn introduction to patterns,profiles[C]// HMMs and PSI-BLAST,2003.

共引文献655

同被引文献152

引证文献16

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部