Magnesium(Mg)based materials hold immense potential for various applications due to their lightweight and high strength-to-weight ratio.However,to fully harness the potential of Mg alloys,structured analytics are esse...Magnesium(Mg)based materials hold immense potential for various applications due to their lightweight and high strength-to-weight ratio.However,to fully harness the potential of Mg alloys,structured analytics are essential to gain valuable insights from centuries of accumulated knowledge.Efficient information extraction from the vast corpus of scientific literature is crucial for this purpose.In this work,we introduce MagBERT,a BERT-based language model specifically trained for Mg-based materials.Utilizing a dataset of approximately 370,000 abstracts focused on Mg and its alloys,MagBERT is designed to understand the intricate details and specialized terminology of this domain.Through rigorous evaluation,we demonstrate the effectiveness of MagBERT for information extraction using a fine-tuned named entity recognition(NER)model,named MagNER.This NER model can extract mechanical,microstructural,and processing properties related to Mg alloys.For instance,we have created an Mg alloy dataset that includes properties such as ductility,yield strength,and ultimate tensile strength(UTS),along with standard alloy names.The introduction of MagBERT is a novel advancement in the development of Mg-specific language models,marking a significant milestone in the discovery of Mg alloys and textual information extraction.By making the pre-trained weights of MagBERT publicly accessible,we aim to accelerate research and innovation in the field of Mg-based materials through efficient information extraction and knowledge discovery.展开更多
This study presents a transfer learning approach for discovering potential Mg-based superconductors utilizing a comprehensive target dataset.Initially,a large source dataset(Bandgap dataset)comprising approximately∼7...This study presents a transfer learning approach for discovering potential Mg-based superconductors utilizing a comprehensive target dataset.Initially,a large source dataset(Bandgap dataset)comprising approximately∼75k compounds is utilized for pretraining,followed by fine-tuning with a smaller Critical Temperature(T_(c))dataset containing∼300 compounds.Comparatively,there is a significant improvement in the performance of the transfer learning model over the traditional deep learning(DL)model in predicting Tc.Subsequently,the transfer learning model is applied to predict the properties of approximately 150k compounds.Predictions are validated computationally using density functional theory(DFT)calculations based on lattice dynamics-related theory.Moreover,to demonstrate the extended predictive capability of the transfer learning model for new materials,a pool of virtual compounds derived from prototype crystal structures from the Materials Project(MP)database is generated.T_(c) predictions are obtained for∼3600 virtual compounds,which underwent screening for electroneutrality and thermodynamic stability.An Extra Trees-based model is trained to utilize E_(hull)values to obtain thermodynamically stable materials,employing a dataset containing Ehull values for approximately 150k materials for training.Materials with Ehull values exceeding 5 meV/atom were filtered out,resulting in a refined list of potential Mg-based superconductors.This study showcases the effectiveness of transfer learning in predicting superconducting properties and highlights its potential for accelerating the discovery of Mg-based materials in the field of superconductivity.展开更多
In the present work,we have employed machine learning(ML)techniques to evaluate ductile-brittle(DB)behaviors in intermetallic compounds(IMCs)which can form magnesium(Mg)alloys.This procedure was mainly conducted by a ...In the present work,we have employed machine learning(ML)techniques to evaluate ductile-brittle(DB)behaviors in intermetallic compounds(IMCs)which can form magnesium(Mg)alloys.This procedure was mainly conducted by a proxy-based method,where the ratio of shear(G)/bulk(B)moduli was used as a proxy to identify whether the compound is ductile or brittle.Starting from compounds information(composition and crystal structure)and their moduli,as found in open databases(AFLOW),ML-based models were built,and those models were used to predict the moduli in other compounds,and accordingly,to foresee the ductile-brittle behaviors of these new compounds.The results reached in the present work showed that the built models can effectively catch the elastic moduli of new compounds.This was confirmed through moduli calculations done by density functional theory(DFT)on some compounds,where the DFT calculations were consistent with the ML prediction.A further confirmation on the reliability of the built ML models was considered through relating between the DB behavior in MgBe_(13) and MgPd_(2),as evaluated by the ML-predicted moduli,and the nature of chemical bonding in these two compounds,which in turn,was investigated by the charge density distribution(CDD)and electron localization function(ELF)obtained by DFT methodology.The ML-evaluated DB behaviors of the two compounds was also consistent with the DFT calculations of CDD and ELF.These findings and confirmations gave legitimacy to the built model to be employed in further prediction processes.Indeed,as examples,the DB characteristics were investigated in IMCs that might from in three Mg alloy series,involving AZ,ZX and WE.展开更多
In this work,a machine learning(ML)model was created to predict intrinsic hardness of various compounds using their crystal chemistry.For this purpose,an initial dataset,containing the hardness values of 270 compounds...In this work,a machine learning(ML)model was created to predict intrinsic hardness of various compounds using their crystal chemistry.For this purpose,an initial dataset,containing the hardness values of 270 compounds and counterpart applied loads,was employed in the learning process.Based on various features generated using crystal information,an ML model,with a high accuracy(R^(2)=0.942),was built using extreme gradient boosting(XGB)algorithm.Experimental validations conducted by hardness measurements of various compounds,including MSi_(2)(M=Nb,Ce,V,and Ta),Al_(2)O_(3),and FeB_(4),showed that the XGB model was able to reproduce load-dependent hardness behaviors of these compounds.In addition,this model was also used to predict the behavior based on prototype crystal structures that are randomly substituted with elements.展开更多
基金supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.RS-2023-00221186).
文摘Magnesium(Mg)based materials hold immense potential for various applications due to their lightweight and high strength-to-weight ratio.However,to fully harness the potential of Mg alloys,structured analytics are essential to gain valuable insights from centuries of accumulated knowledge.Efficient information extraction from the vast corpus of scientific literature is crucial for this purpose.In this work,we introduce MagBERT,a BERT-based language model specifically trained for Mg-based materials.Utilizing a dataset of approximately 370,000 abstracts focused on Mg and its alloys,MagBERT is designed to understand the intricate details and specialized terminology of this domain.Through rigorous evaluation,we demonstrate the effectiveness of MagBERT for information extraction using a fine-tuned named entity recognition(NER)model,named MagNER.This NER model can extract mechanical,microstructural,and processing properties related to Mg alloys.For instance,we have created an Mg alloy dataset that includes properties such as ductility,yield strength,and ultimate tensile strength(UTS),along with standard alloy names.The introduction of MagBERT is a novel advancement in the development of Mg-specific language models,marking a significant milestone in the discovery of Mg alloys and textual information extraction.By making the pre-trained weights of MagBERT publicly accessible,we aim to accelerate research and innovation in the field of Mg-based materials through efficient information extraction and knowledge discovery.
文摘This study presents a transfer learning approach for discovering potential Mg-based superconductors utilizing a comprehensive target dataset.Initially,a large source dataset(Bandgap dataset)comprising approximately∼75k compounds is utilized for pretraining,followed by fine-tuning with a smaller Critical Temperature(T_(c))dataset containing∼300 compounds.Comparatively,there is a significant improvement in the performance of the transfer learning model over the traditional deep learning(DL)model in predicting Tc.Subsequently,the transfer learning model is applied to predict the properties of approximately 150k compounds.Predictions are validated computationally using density functional theory(DFT)calculations based on lattice dynamics-related theory.Moreover,to demonstrate the extended predictive capability of the transfer learning model for new materials,a pool of virtual compounds derived from prototype crystal structures from the Materials Project(MP)database is generated.T_(c) predictions are obtained for∼3600 virtual compounds,which underwent screening for electroneutrality and thermodynamic stability.An Extra Trees-based model is trained to utilize E_(hull)values to obtain thermodynamically stable materials,employing a dataset containing Ehull values for approximately 150k materials for training.Materials with Ehull values exceeding 5 meV/atom were filtered out,resulting in a refined list of potential Mg-based superconductors.This study showcases the effectiveness of transfer learning in predicting superconducting properties and highlights its potential for accelerating the discovery of Mg-based materials in the field of superconductivity.
基金supported by National Research Foundation(NRF)of South Korea(2020R1A2C1004720)。
文摘In the present work,we have employed machine learning(ML)techniques to evaluate ductile-brittle(DB)behaviors in intermetallic compounds(IMCs)which can form magnesium(Mg)alloys.This procedure was mainly conducted by a proxy-based method,where the ratio of shear(G)/bulk(B)moduli was used as a proxy to identify whether the compound is ductile or brittle.Starting from compounds information(composition and crystal structure)and their moduli,as found in open databases(AFLOW),ML-based models were built,and those models were used to predict the moduli in other compounds,and accordingly,to foresee the ductile-brittle behaviors of these new compounds.The results reached in the present work showed that the built models can effectively catch the elastic moduli of new compounds.This was confirmed through moduli calculations done by density functional theory(DFT)on some compounds,where the DFT calculations were consistent with the ML prediction.A further confirmation on the reliability of the built ML models was considered through relating between the DB behavior in MgBe_(13) and MgPd_(2),as evaluated by the ML-predicted moduli,and the nature of chemical bonding in these two compounds,which in turn,was investigated by the charge density distribution(CDD)and electron localization function(ELF)obtained by DFT methodology.The ML-evaluated DB behaviors of the two compounds was also consistent with the DFT calculations of CDD and ELF.These findings and confirmations gave legitimacy to the built model to be employed in further prediction processes.Indeed,as examples,the DB characteristics were investigated in IMCs that might from in three Mg alloy series,involving AZ,ZX and WE.
基金This research was supported by National Research Foundation(NRF)of South Korea(2020R1A2C1004720).
文摘In this work,a machine learning(ML)model was created to predict intrinsic hardness of various compounds using their crystal chemistry.For this purpose,an initial dataset,containing the hardness values of 270 compounds and counterpart applied loads,was employed in the learning process.Based on various features generated using crystal information,an ML model,with a high accuracy(R^(2)=0.942),was built using extreme gradient boosting(XGB)algorithm.Experimental validations conducted by hardness measurements of various compounds,including MSi_(2)(M=Nb,Ce,V,and Ta),Al_(2)O_(3),and FeB_(4),showed that the XGB model was able to reproduce load-dependent hardness behaviors of these compounds.In addition,this model was also used to predict the behavior based on prototype crystal structures that are randomly substituted with elements.