Recent progress in material data mining has been driven by high-capacity models trained on large datasets.However,collecting experimental data(real data)has been extremely costly owing to the amount of human effort an...Recent progress in material data mining has been driven by high-capacity models trained on large datasets.However,collecting experimental data(real data)has been extremely costly owing to the amount of human effort and expertise required.Here,we develop a novel transfer learning strategy to address problems of small or insufficient data.This strategy realizes the fusion of real and simulated data and the augmentation of training data in a data mining procedure.For a specific task of grain instance image segmentation,this strategy aims to generate synthetic data by fusing the images obtained from simulating the physical mechanism of grain formation and the“image style”information in real images.The results show that the model trained with the acquired synthetic data and only 35%of the real data can already achieve competitive segmentation performance of a model trained on all of the real data.Because the time required to perform grain simulation and to generate synthetic data are almost negligible as compared to the effort for obtaining real data,our proposed strategy is able to exploit the strong prediction power of deep learning without significantly increasing the experimental burden of training data preparation.展开更多
The original version of this article omitted the following from the Acknowledgements:“This work was supported by Beijing Top Discipline for Artificial Intelligent Science and Engineering,University of Science and Tec...The original version of this article omitted the following from the Acknowledgements:“This work was supported by Beijing Top Discipline for Artificial Intelligent Science and Engineering,University of Science and Technology Beijing”.This has now been corrected in both the PDF and HTML versions of the article.展开更多
基金The authors acknowledge financial support from the National Key Research and Development Program of China(No.2016YFB0700500)the National Science Foundation of China(No.51574027,No.61572075,No.6170203,No.61873299)+1 种基金the Finance science and technology project of Hainan province(No.ZDYF2019009)the Fundamental Research Funds for the University of Science and Technology Beijing(No.FRF-BD-19-012A,No.FRF-TP-19-043A2).
文摘Recent progress in material data mining has been driven by high-capacity models trained on large datasets.However,collecting experimental data(real data)has been extremely costly owing to the amount of human effort and expertise required.Here,we develop a novel transfer learning strategy to address problems of small or insufficient data.This strategy realizes the fusion of real and simulated data and the augmentation of training data in a data mining procedure.For a specific task of grain instance image segmentation,this strategy aims to generate synthetic data by fusing the images obtained from simulating the physical mechanism of grain formation and the“image style”information in real images.The results show that the model trained with the acquired synthetic data and only 35%of the real data can already achieve competitive segmentation performance of a model trained on all of the real data.Because the time required to perform grain simulation and to generate synthetic data are almost negligible as compared to the effort for obtaining real data,our proposed strategy is able to exploit the strong prediction power of deep learning without significantly increasing the experimental burden of training data preparation.
文摘The original version of this article omitted the following from the Acknowledgements:“This work was supported by Beijing Top Discipline for Artificial Intelligent Science and Engineering,University of Science and Technology Beijing”.This has now been corrected in both the PDF and HTML versions of the article.