期刊文献+

基于鲸鱼优化和深度学习的不平衡大数据分类算法 被引量:8

Unbalanced Big Data Classification Algorithm Based on Whale Optimization and Deep Learning
下载PDF
导出
摘要 针对当前不平衡数据分类算法中存在的分类精度低和容易陷入局部最优状态的问题,提出一种基于鲸鱼优化和深度学习的不平衡大数据分类算法.该算法由特征选择、预处理和分类3个阶段组成:①为了提高分类精度,使用鲸鱼优化算法(whale optimization algorithm,WOA)在不平衡数据中寻找最优特征子集,消除不相关和多余的特征;②采用局部敏感哈希的合成少数类过采样技术(locality sensitive hashing synthetic minority oversampling technique,LSH-SMOT)对数据集进行预处理,解决类不平衡问题;③使用基于WOA算法优化的双向递归神经网络(bidirectional recurrent neural networks,BRNN)对预处理后的数据集进行分类.实验结果表明:本文算法能够有效解决不平衡数据集的分类问题,相比于其他算法,本文算法在分类精度和局部最优避免率方面具有明显优势. Aiming at the problems of low classification accuracy,which is easy to fall into local optimal state in the current unbalanced data classification algorithm,an unbalanced big data classification algorithm based on whale optimization and deep learning has been proposed.The algorithm consists of three parts:feature selection,preprocessing and classification.Firstly,in order to improve the classification accuracy,the whale optimization algorithm(WOA)has been used to find the optimal feature subset in the unbalanced data to eliminate the irrelevant and redundant features.Secondly,the locality sensitive hashing synthetic minority oversampling technique(LSH-SMOTE)has been used to preprocess the dataset to solve the class imbalance problem.And,finally,the bidirectional recurrent neural networks(BRNN)optimized based on WOA algorithm been used to classify the preprocessed dataset.The experimental results show that the proposed algorithm can effectively solve the classification problem of unbalanced data sets.Compared with other algorithms,it has obvious advantages in classification accuracy and local optimal avoidance rate.
作者 孙二华 胡云冰 SUN Er-hua;HU Yun-bing(School of Information Engineering, Chongqing Real Estate College, Chongqing 401331, China;College of Information Science and Technology, Xiamen University, Xiamen Fujian 361005, China)
出处 《西南师范大学学报(自然科学版)》 CAS 2021年第5期127-133,共7页 Journal of Southwest China Normal University(Natural Science Edition)
基金 重庆市教委高职教育双基地建设项目(20180310).
关键词 不平衡大数据分类 鲸鱼优化算法 深度学习 合成少数类过采样技术 unbalanced big data classification whale optimization algorithm deep learning synthetic minority oversampling technique
  • 相关文献

参考文献1

二级参考文献3

共引文献25

同被引文献102

引证文献8

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部