基于M-estimator函数的加权深度随机配置网络

Weighted Deep Stochastic Configuration Networks Based on M-estimator Functions

下载PDF

导出

摘要深度随机配置网络(Deep Stochastic Configuration Network,DSCN)是一种增量式随机化学习模型,具有人为干预程度低、学习效率高和泛化能力强等优点.但是,面向噪声数据回归与分析时,传统的DSCN易受到异常值影响,从而降低了模型的泛化性.因此,为提高噪声数据回归的精度和鲁棒性,提出了基于M-estimator函数的加权深度随机配置网络(Weighted Deep Stochastic Configuration Networks,WDSCN).首先,选取Huber和Bisquare 2个常用的M-estimator函数计算样本权重,利用加权最小二乘法和L2正则化策略替代最小二乘来更新WDSCN输出权重,以降低异常值对WDSCN的负面影响;其次,为提高WDSCN模型表征能力,设计了一种随机配置稀疏自编码器(Stochastic Configuration Sparse Autoencoder,SC-SAE),SC-SAE基于DSCN其独有的监督机制随机分配输入参数,采用基于L1正则化的目标函数,并利用交替方向乘子法(Alternating Direction Method of Multipliers,ADMM)计算SC-SAE输出权重;然后,为获取有效的特征表示,利用SC-SAE生成特征的随机性和多样性,采用多个SC-SAE进行特征学习并融合,用于WDSCN模型训练;最后,在真实数据集上的实验结果表明,WDSCN-Huber、WDSCN-Bisquare相比于DSCN、SCN以及RSC-KDE、RSC-Huber、RSC-IQR、RSCN-KDE、WBLS-KDE和RBLS-Huber等加权模型具有更高的泛化性能和回归精度. Deep stochastic configuration network(DSCN)is an randomized incremental learning model,it can start from a small structure,increase the nodes and hidden layers gradually.As the input weights and biases of nodes are assigned according to supervisory mechanism,meantime,all the nodes in hidden layer are fully connected to the outputs,the output weights of DSCN are determined through the least square method.Therefore,DSCN has the advantages of less manual intervention,high learning efficiency,strong generalization ability.However,although the randomized feedforward learning process of DSCN has faster efficiency,the feature learning ability is still insufficient.In the meantime,with the increase of nodes and hidden layers,it is easy to lead to overfitting phenomenon.When solving regression problems with noise,the performance of original DSCN is easily affected by outliers,which reduces the generalization ability of the model.Therefore,to improve the regression performance and robustness of DSCN,weighted deep sto-chastic configuration networks(WDSCN)based on M-Estimator functions are proposed.First of all,we adopt two common M-estimator functions(i.e.,Huber and Bisquare)to acquire the sample weights for re-ducing the negative impact of outliers.When the sample has a smaller training error,give this sample a lar-ger weight,while when the training error of sample is larger,it is determined to be outlier data and give this sample a smaller weight.The sample weight decreases monotonically with the increase of the absolute value of the error,thus reducing the influence of noisy data onto the model and improving the generaliza-tion of the algorithm.Meanwhile,the weighted least square method and L2 regularization strategy are in-troduced to calculate output weight vector replace the least square method.It can not only solve the noisy data regression problems and avoid over-fitting problem of DSCN.In the second place,the model based on L1 regularization is helpful to extract sparse features and improve the accuracy of supervised learning,for further improve the representation ability of WDSCN,a stochastic configuration sparse autoencoder(SC-SAE)is designed,SC-SAE use the supervision mechanism of DSCN to assign input parameters,at the same time,we adopt the L1 regularization technique to objective function for getting sparse features,alter-nating direction method of multipliers(ADMM)approach is utilized to solve the objective function for de-termining the output weights of SC-SAE.And then,as the randomness encoding process of SC-SAE,we can obtain the diversity of features of different SC-SAE models,consequently effective feature representa-tion can be acquired through fusion features from multiple SC-SAE for the training of WDSCN.Finally,experimental results on real-world datasets show that the proposed WDSCN-Huber and WDSCN-Bisquare have higher generalization performances and regression accuracies than DSCN,SCN,and other weighted models(e.g.,RSC-KDE,RSC-Huber,RSC-IQR,RDSCN-KDE,WBLS-KDE and RBLS-Huber).But in the meantime,the results of ablation experiment show that WDSCN with fusion sparse features which exacted from multiple different SC-SAE models are superior to those models with fusion sparse feature.Therefore,it is verified that SC-SAE can extract effective sparse features and improve the learning ability of weighted models.

作者丁世飞张成龙郭丽丽张健丁玲 DING Shi-Fei;ZHANG Cheng-Long;GUO Li-Li;ZHANG Jian;DING Ling(School of Computer Science and Technology,China University of Mining and Technology,Xuzhou,Jiangsu 221116;Mine Digitization Engineering Research Center of Ministry of Education(China University of Mining and Technology),Xuzhou,Jiangsu 221116;College of Intelligence and Computing,Tianjin University,Tianjin 300350)

机构地区中国矿业大学计算机科学与技术学院矿山数字化教育部工程研究中心(中国矿业大学) 天津大学智能与计算学部

出处《计算机学报》 EI CAS CSCD 北大核心 2023年第11期2476-2487,共12页 Chinese Journal of Computers

基金国家自然科学基金(62276265,61976216,62206297,61672522)资助。

关键词深度随机配置网络异常数据鲁棒性回归随机神经网络 deep stochastic configuration network noisy data robustness regression random neural network

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献3

1赵立杰,邹世达,郭烁,黄明忠.基于正则化随机配置网络的球磨机工况识别[J].控制工程,2020,27(1):1-7. 被引量：14
2王前进,杨春雨,马小平,张春富,彭思敏.基于随机配置网络的井下供给风量建模[J].自动化学报,2021,47(8):1963-1975. 被引量：15
3郭威,徐涛.基于M-estimator的鲁棒宽度学习系统[J].控制与决策,2023,38(4):1039-1046. 被引量：6

二级参考文献9

1韩敏,李德才.基于替代函数及贝叶斯框架的1范数ELM算法[J].自动化学报,2011,37(11):1344-1350. 被引量：19
2王前进,马小平,张守田.PLC软冗余在通风机监控系统中的应用[J].工矿自动化,2014,40(1):93-96. 被引量：18
3任子晖,王翠,倪婷婷,成江洋.基于神经网络不停风倒机风量变化的研究[J].煤炭技术,2016,35(11):229-231. 被引量：5
4赵立杰,李彬,汪滢,陈斌,王魏.磨机负荷参数快速去相关神经网络集成模型[J].控制工程,2017,24(9):1952-1957. 被引量：5
5胡显能,蔡改贫,罗小燕,宗路.基于CEEMDAN和多尺度排列熵的球磨机负荷识别方法[J].噪声与振动控制,2018,38(3):146-151. 被引量：16
6徐鹏飞,王敏,刘金平,唐朝晖,马天雨.基于数据分布特性的代价敏感宽度学习系统[J].控制与决策,2021,36(7):1686-1692. 被引量：4
7陈晓云,廖梦真.基于稀疏和近邻保持的极限学习机降维[J].自动化学报,2019,45(2):325-333. 被引量：11
8邹伟东,夏元清.基于压缩动量项的增量型ELM虚拟机能耗预测[J].自动化学报,2019,45(7):1290-1297. 被引量：4
9王前进,代伟,杨春雨,马小平.煤矿主通风机切换系统建模与分析[J].煤炭学报,2018,43(S2):606-614. 被引量：8

共引文献27

1李康,王魏,王奕鹏.集成随机配置网络在养殖水质监测中的应用[J].农业工程学报,2020,36(4):220-226. 被引量：4
2潘承燕,徐进学,翁永鹏.一种基于流形正则化随机配置网络的化工过程故障识别方法[J].仪器仪表学报,2021,42(5):219-226. 被引量：3
3代伟,李德鹏,杨春雨,马小平.一种随机配置网络的模型与数据混合并行学习方法[J].自动化学报,2021,47(10):2427-2437. 被引量：14
4赵立杰,王月,郭烁.基于AdaBoost.RT的污水水质随机配置网络集成模型[J].沈阳大学学报（自然科学版）,2022,34(3):189-196. 被引量：3
5宋旭东,朱大杰,杨杰,丛郁洋.一种基于L2正则化迁移学习的变负载工况条件下故障诊断方法[J].大连交通大学学报,2022,43(2):106-109. 被引量：2
6罗小燕,黄耀锋,李波波,刘吉顺.基于PSO-LSSVM球磨机负荷参数预测及监测系统开发[J].噪声与振动控制,2022,42(4):144-151. 被引量：4
7单显明,刘业峰,那崇正,靳新.基于ASOS-ELM的球磨机负荷参数软测量系统设计[J].计算机测量与控制,2022,30(10):70-75. 被引量：1
8王前进,代伟,陆群,辅小荣,马小平.一种随机配置网络软测量模型的稀疏学习方法[J].控制与决策,2022,37(12):3171-3182. 被引量：6
9孙宝华,程兴,杨刚.一种基于SCN集成学习的稀土元素组分含量软测量方法[J].电子技术与软件工程,2022(22):160-164.
10邓真平.随机配置网络在短时电力负荷曲线预测中的应用[J].西北民族大学学报（自然科学版）,2023,44(1):71-78. 被引量：1

1乐星宇,王玲凤,叶佳濛,孙璐懿,董思越.幼儿园教师的工作拖延与不合规任务的关系[J].中国心理卫生杂志,2023,37(12):1071-1077.
2周玉陶,张正华,朱尔立,金志琦,戚义盛,苏权.基于ADMM优化的停车位分配模型与求解[J].无线电工程,2023,53(12):2783-2790.
3李力,董密,宋冬然,杨建,王其兵.分布式的温控负荷集群负荷跟随控制[J].中国电机工程学报,2023,43(21):8270-8281. 被引量：6
4Han Wang,Mengge Shi,Peng Xie,Chun Sing Lai,Kang Li,Youwei Jia.Electric Vehicle Charging Scheduling Strategy for Supporting Load Flattening Under Uncertain Electric Vehicle Departures[J].Journal of Modern Power Systems and Clean Energy,2023,11(5):1634-1645.
5褚丹娜.数字能力和语言能力关系的新进展[J].汉字文化,2023(15):178-180.
6Ying Wang,Rui Xu,Shiqi Song,Xiaoyang Ma,Huaying Zhang,Xian Wu.Harmonic Data Recovery Method Based on Multivariate Norm Matrix[J].Journal of Modern Power Systems and Clean Energy,2023,11(5):1659-1672.
7杨玉凤.水质氨氮、水质总磷校准曲线加权问题探讨[J].云南化工,2023,50(11):104-109.
8戴云清,李征,安帅,曹光磊.应用3D打印辅助胫骨高位截骨技术对下肢力线矫正的疗效[J].骨科临床与研究杂志,2023,8(6):336-340. 被引量：4
9郝昭昕,张启明,孙进平,王彦平.基于ADMM算法的机载太赫兹SAR成像高频振动补偿[J].信号处理,2023,39(11):1933-1942. 被引量：1
10柳智.基于零模正则的神经网络剪枝方法[J].运筹与管理,2023,32(10):102-107.

计算机学报

2023年第11期

浏览历史

内容加载中请稍等...

基于M-estimator函数的加权深度随机配置网络

参考文献3

二级参考文献9

共引文献27

相关作者

相关机构

相关主题

浏览历史