One of the main challenges in materials discovery is efficiently exploring the vast search space for targeted properties as approaches that rely on trial-and-error are impractical.We review how methods from the inform...One of the main challenges in materials discovery is efficiently exploring the vast search space for targeted properties as approaches that rely on trial-and-error are impractical.We review how methods from the information sciences enable us to accelerate the search and discovery of new materials.In particular,active learning allows us to effectively navigate the search space iteratively to identify promising candidates for guiding experiments and computations.The approach relies on the use of uncertainties and making predictions from a surrogate model together with a utility function that prioritizes the decision making process on unexplored data.We discuss several utility functions and demonstrate their use in materials science applications,impacting both experimental and computational research.We summarize by indicating generalizations to multiple properties and multifidelity data,and identify challenges,future directions and opportunities in the emerging field of materials informatics.展开更多
There is intense interest in uncovering design rules that govern the formation of various structural phases as a function of chemical composition in multi-principal element alloys (MPEAs).In this paper,we develop a ma...There is intense interest in uncovering design rules that govern the formation of various structural phases as a function of chemical composition in multi-principal element alloys (MPEAs).In this paper,we develop a machine learning (ML) approach built on the foundations of ensemble learning,post hoc model interpretability of black-box models,and clustering analysis to establish a quantitative relationship between the chemical composition and experimentally observed phases of MPEAs.The originality of our work stems from performing instance-level (or local) variable attribution analysis of ML predictions based on the breakdown method,and then identifying similar instances based on k-means clustering analysis of the breakdown results.We also complement the breakdown analysis with Ceteris Paribus profiles that showcase how the model response changes as a function of a single variable,when the values of all other variables are fixed.Results from local model interpretability analysis uncover key insights into variables that govern the formation of each phase.Our developed approach is generic,model-agnostic,and valuable to explain the insights learned by the black-box models.An interactive web application is developed to facilitate model sharing and accelerate the design of MPEAs with targeted properties.展开更多
基金We are grateful to the Laboratory Directed Research and Development(LDRD)program(project#20180660ER)the Center for Nonlinear Studies at Los Alamos National Laboratory for support.
文摘One of the main challenges in materials discovery is efficiently exploring the vast search space for targeted properties as approaches that rely on trial-and-error are impractical.We review how methods from the information sciences enable us to accelerate the search and discovery of new materials.In particular,active learning allows us to effectively navigate the search space iteratively to identify promising candidates for guiding experiments and computations.The approach relies on the use of uncertainties and making predictions from a surrogate model together with a utility function that prioritizes the decision making process on unexplored data.We discuss several utility functions and demonstrate their use in materials science applications,impacting both experimental and computational research.We summarize by indicating generalizations to multiple properties and multifidelity data,and identify challenges,future directions and opportunities in the emerging field of materials informatics.
基金Research was sponsored by the Defense Advanced Research Project Agency (DARPA) and The Army Research Office and was accomplished under Grant Number W911NF-20-1-0289.
文摘There is intense interest in uncovering design rules that govern the formation of various structural phases as a function of chemical composition in multi-principal element alloys (MPEAs).In this paper,we develop a machine learning (ML) approach built on the foundations of ensemble learning,post hoc model interpretability of black-box models,and clustering analysis to establish a quantitative relationship between the chemical composition and experimentally observed phases of MPEAs.The originality of our work stems from performing instance-level (or local) variable attribution analysis of ML predictions based on the breakdown method,and then identifying similar instances based on k-means clustering analysis of the breakdown results.We also complement the breakdown analysis with Ceteris Paribus profiles that showcase how the model response changes as a function of a single variable,when the values of all other variables are fixed.Results from local model interpretability analysis uncover key insights into variables that govern the formation of each phase.Our developed approach is generic,model-agnostic,and valuable to explain the insights learned by the black-box models.An interactive web application is developed to facilitate model sharing and accelerate the design of MPEAs with targeted properties.