期刊文献+

Semi-supervised machine-learning classification of materials synthesis procedures 被引量:5

原文传递
导出
摘要 Digitizing large collections of scientific literature can enable new informatics approaches for scientific analysis and meta-analysis.However,most content in the scientific literature is locked-up in written natural language,which is difficult to parse into databases using explicitly hard-coded classification rules.In this work,we demonstrate a semi-supervised machine-learning method to classify inorganic materials synthesis procedures from written natural language.Without any human input,latent Dirichlet allocation can cluster keywords into topics corresponding to specific experimental materials synthesis steps,such as“grinding”and“heating”,“dissolving”and“centrifuging”,etc.Guided by a modest amount of annotation,a random forest classifier can then associate these steps with different categories of materials synthesis,such as solid-state or hydrothermal synthesis.Finally,we show that a Markov chain representation of the order of experimental steps accurately reconstructs a flowchart of possible synthesis procedures.Our machine-learning approach enables a scalable approach to unlock the large amount of inorganic materials synthesis information from the literature and to process it into a standardized,machine-readable database.
出处 《npj Computational Materials》 SCIE EI CSCD 2019年第1期562-568,共7页 计算材料学(英文)
基金 Funding to support this work was provided by the Energy&Biosciences Institute through the EBI-Shell program,Office of Naval Research(ONR)Award #N00014-14-1-0444 the National Science Foundation under Grant No 5710003959.
  • 相关文献

参考文献1

二级参考文献2

共引文献16

同被引文献18

引证文献5

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部