The history and current status of materials data activities from handbook to database are reviewed, with introduction to some important products. Through an example of prediction of interfacial thermal resistance base...The history and current status of materials data activities from handbook to database are reviewed, with introduction to some important products. Through an example of prediction of interfacial thermal resistance based on data and data science methods, we show the advantages and potential of material informatics to study material issues which are too complicated or time consuming for conventional theoretical and experimental methods. Materials big data is the fundamental of material informatics. The challenges and strategy to construct materials big data are discussed, and some solutions are proposed as the results of our experiences to construct National Institute for Materials Science(NIMS) materials databases.展开更多
Symbolic regression(SR),exploring mathematical expressions from a given data set to construct an interpretable model,emerges as a powerful computational technique with the potential to transform the“black box”machin...Symbolic regression(SR),exploring mathematical expressions from a given data set to construct an interpretable model,emerges as a powerful computational technique with the potential to transform the“black box”machining learning methods into physical and chemistry interpretable expressions in material science research.In this review,the current advancements in SR are investigated,focusing on the underlying theories,fundamental flowcharts,various techniques,implemented codes,and application fields.More predominantly,the challenging issues and future opportunities in SR that should be overcome to unlock the full potential of SR in material design and research,including graphics processing unit accelera-tion and transfer learning algorithms,the trade-off between expression accuracy and complexity,physical or chemistry interpretable SR with generative large language models,and multimodal SR methods,are discussed.展开更多
Discovering new materials with excellent performance is a hot issue in the materials genome initiative.Traditional experiments and calculations often waste large amounts of time and money and are also limited by vario...Discovering new materials with excellent performance is a hot issue in the materials genome initiative.Traditional experiments and calculations often waste large amounts of time and money and are also limited by various conditions. Therefore, it is imperative to develop a new method to accelerate the discovery and design of new materials. In recent years, material discovery and design methods using machine learning have attracted much attention from material experts and have made some progress. This review first outlines available materials database and material data analytics tools and then elaborates on the machine learning algorithms used in materials science. Next, the field of application of machine learning in materials science is summarized, focusing on the aspects of structure determination, performance prediction, fingerprint prediction, and new material discovery. Finally, the review points out the problems of data and machine learning in materials science and points to future research. Using machine learning algorithms, the authors hope to achieve amazing results in material discovery and design.展开更多
The introduction of density functional theory(DFT)and electronic structure has brought computational methods into the field of materials science.In these theoretical calculations,quantum mechanics is predominantly use...The introduction of density functional theory(DFT)and electronic structure has brought computational methods into the field of materials science.In these theoretical calculations,quantum mechanics is predominantly used.Machine learning(ML)and high-throughput computing share some inherent similarities,as both can extract valuable information from massive datasets and possess parallelism and scalability.ML techniques simulate human thought processes,with algorithms that make decisions and have good scalability and strong generalization abilities.The combination of high-throughput and ML technologies leverages the advantages of high-throughput technology standardization and high capacity,addressing the challenges faced by ML at the front end.This complementary combination is expected to further enhance the efficiency of material screening and development.In data mining,using ML methods on various databases,the interrelationships between molecular structures and properties are discovered from large amounts of data.Mapping,current utilization of DFT,materials genomics,and high-throughput computing have generated a substantial amount of data.This review provides new insights into the development of electrochemistry.展开更多
基金Project supported by “Materials Research by Information Integration” Initiative(MI2I) project of the Support Program for Starting Up Innovation Hub from Japan Science and Technology Agency(JST)
文摘The history and current status of materials data activities from handbook to database are reviewed, with introduction to some important products. Through an example of prediction of interfacial thermal resistance based on data and data science methods, we show the advantages and potential of material informatics to study material issues which are too complicated or time consuming for conventional theoretical and experimental methods. Materials big data is the fundamental of material informatics. The challenges and strategy to construct materials big data are discussed, and some solutions are proposed as the results of our experiences to construct National Institute for Materials Science(NIMS) materials databases.
基金National Natural Science Foundation of China,Grant/Award Number:52332005National Key Research and Development Program of China,Grant/Award Number:2022YFB3807200China Postdoctoral Science Foundation,Grant/Award Number:2022TQ0019。
文摘Symbolic regression(SR),exploring mathematical expressions from a given data set to construct an interpretable model,emerges as a powerful computational technique with the potential to transform the“black box”machining learning methods into physical and chemistry interpretable expressions in material science research.In this review,the current advancements in SR are investigated,focusing on the underlying theories,fundamental flowcharts,various techniques,implemented codes,and application fields.More predominantly,the challenging issues and future opportunities in SR that should be overcome to unlock the full potential of SR in material design and research,including graphics processing unit accelera-tion and transfer learning algorithms,the trade-off between expression accuracy and complexity,physical or chemistry interpretable SR with generative large language models,and multimodal SR methods,are discussed.
基金financially supported by the National Natural Science Foundation of China (Nos. 61971208, 61671225 and 51864027)the Yunnan Applied Basic Research Projects (No. 2018FA034)+2 种基金the Yunnan Reserve Talents of Young and Middleaged Academic and Technical Leaders (Shen Tao, 2018)the Yunnan Young Top Talents of Ten Thousands Plan (Shen Tao, Zhu Yan, Yunren Social Development No. 2018 73)the Scientific Research Foundation of Kunming University of Science and Technology (No. KKSY201703016)。
文摘Discovering new materials with excellent performance is a hot issue in the materials genome initiative.Traditional experiments and calculations often waste large amounts of time and money and are also limited by various conditions. Therefore, it is imperative to develop a new method to accelerate the discovery and design of new materials. In recent years, material discovery and design methods using machine learning have attracted much attention from material experts and have made some progress. This review first outlines available materials database and material data analytics tools and then elaborates on the machine learning algorithms used in materials science. Next, the field of application of machine learning in materials science is summarized, focusing on the aspects of structure determination, performance prediction, fingerprint prediction, and new material discovery. Finally, the review points out the problems of data and machine learning in materials science and points to future research. Using machine learning algorithms, the authors hope to achieve amazing results in material discovery and design.
基金financially supported by the National Key Research and Development Program of China(2021YFB3601502)the Key Research Program of Frontier Sciences,CAS(ZDBS-LY-SLH035)+6 种基金the National Natural Science Foundation of China(22193044,61835014,51972336)the West Light Foundation of CAS(2019-YDYLTD-002)the Natural Science Foundation of Xinjiang(2021D01E05)the CAS Project for Young Scientists in Basic Research(YSBR-024)Xinjiang Major Science and Technology Project(2021A01001)the CAS President’s International Fellowship Initiative(PIFI,2020PM0046)Tianshan Basic Research Talents(2022TSYCJU0001)。
基金supported by the National Natural Science Foundation of China(grant no.U1904215)the Natural Science Foundation of Jiangsu Province,China(grant no.BK20200044)the Changjiang Scholars Program of the Ministry of Education,China(grant no.Q2018270).
文摘The introduction of density functional theory(DFT)and electronic structure has brought computational methods into the field of materials science.In these theoretical calculations,quantum mechanics is predominantly used.Machine learning(ML)and high-throughput computing share some inherent similarities,as both can extract valuable information from massive datasets and possess parallelism and scalability.ML techniques simulate human thought processes,with algorithms that make decisions and have good scalability and strong generalization abilities.The combination of high-throughput and ML technologies leverages the advantages of high-throughput technology standardization and high capacity,addressing the challenges faced by ML at the front end.This complementary combination is expected to further enhance the efficiency of material screening and development.In data mining,using ML methods on various databases,the interrelationships between molecular structures and properties are discovered from large amounts of data.Mapping,current utilization of DFT,materials genomics,and high-throughput computing have generated a substantial amount of data.This review provides new insights into the development of electrochemistry.