An ensemble learning strategy for multi-source hydrogen embrittlement data by introducing missing information

导出

摘要 Accurately and quickly predicting hydrogen embrittlement performance is critical for the service of metal materials.However,due to multi-source heterogeneity,existing hydrogen embrittlement data are missing,making it impractical to train reliable machine learning models.In this study,we proposed an ensemble learning training strategy for missing data based on the Adaboost algorithm.This method introduced a mask matrix with missing data and enabled each round of training to generate sub-datasets,considering missing value information.The strategy first trained a subset of features based on the existing dataset and a selected method and continuously focused on the combination of features with the highest error for iterative training,where the mask matrix of the missing data was used as the input to fit the weights of each base learner using a neural network.Compared with directly modeling on highly sparse data,the predictive ability of this strategy was significantly improved by approximately 20%.In addition,in the testing of new samples,the predicted mean absolute error of the new model was successfully reduced from 0.2 to 0.09.This strategy offers good adaptability to the hydrogen embrittlement sensitivity of different sizes and can avoid interference from feature importance caused by filling data.

作者 Xujie Gong Ruichao Lei Ruize Sun Xue Jiang Yanjing Su Yu Yan

机构地区 Beijing Advanced Innovation Center for Materials Genome Engineering Beijing Advanced Innovation Center for Materials Genome Engineering

出处《Materials Genome Engineering Advances》 2024年第2期145-157,共13页 材料基因工程前沿（英文）

基金 the support of National Key Research and Development Program of China(2022YFB3707500,2021YFB3802101).

关键词 ensemble learning hydrogen embrittlement machine learning missing data

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Min LUO,Yuzhi LIU,Jie GAO,Run LUO,Jinxia ZHANG,Ziyuan TAN,Siyu CHEN,Khan ALAM.A New Merged Product Reveals Precipitation Features over Drylands in China[J].Advances in Atmospheric Sciences,2024,41(10):2079-2091.
2Dascha Dobrovolskij,Hans-Georg Stark.Synthetic demand data generation for individual electricity consumers:Inpainting[J].Energy and AI,2024,15(1):36-43.
3Ziliang Zhao,Yifan Fu,Ji Pu,Zhangu Wang,Senhao Shen,Duo Ma,Qianya Xie,Fojin Zhou.Performance decay prediction model of proton exchange membrane fuel cell based on particle swarm optimization and gate recurrent unit[J].Energy and AI,2024,17(3):487-494.
4F.Heymann,H.Quest,T.Lopez Garcia,C.Ballif,M.Galus.Reviewing 40 years of artificial intelligence applied to power systems-A taxonomic perspective[J].Energy and AI,2024,15(1):136-150.
5Tao Feng,Yu Liu,Yue Yu,Liang Chen,Ruizhi Chen.A data and physical model dual-driven based trajectory estimator for long-term navigation[J].Defence Technology（防务技术）,2024,40(10):78-90.
6Huricha Wu,Yaohua Wang,Jingqiang Tan,Xiao Ma,Ruining Hu,Wenhui Liu.Influences of lithofacies on fluid mobility in mixed sedimentary rocks:Insights from NMR analysis of the middle Permian Lucaogou Formation,Junggar Basin[J].Energy Geoscience,2024,5(4):108-124.
7Zi-Cheng Wang,Dong Li,Zhan-Wei Cao,Feng Gao,Ming-Jia Li.A modified transformer and adapter-based transfer learning for fault detection and diagnosis in HVAC systems[J].Energy Storage and Saving,2024,3(2):96-105.
8Robert Robergs,Bridgette O’Malley,Sam Torrens,Jason Siegler.The missing hydrogen ion, part-2: Where the evidence leads to[J].Sports Medicine and Health Science,2024,6(1):94-100.
9陈简,张惊雷.基于注意力机制的TCN-BiGRU模型短期风电功率预测[J].天津理工大学学报,2024,40(5):69-74.
10Bo Ma,Jinsong Wu,Wei Qi Yan.JudPriNet: Video transition detection based on semantic relationship and Monte Carlo sampling[J].Intelligent and Converged Networks,2024,5(2):134-146.

Materials Genome Engineering Advances

2024年第2期

浏览历史

内容加载中请稍等...

An ensemble learning strategy for multi-source hydrogen embrittlement data by introducing missing information

相关作者

相关机构

相关主题

浏览历史