期刊文献+

Machine Learning Based Virtual Screening for Biodegradable Polyesters

Machine Learning Based Virtual Screening for Biodegradable Polyesters
下载PDF
导出
摘要 Current biodegradation timelines show that polyesters take over 200 years to break down. A crucial component of several industries, polyesters are relied upon for materials development and thus require sustainable alternatives. Recent works in generative modeling have made it possible to produce large sets of chemical structures, but current molecular screening methods are expensive, not scalable, and are oversimplified. This work evaluates whether a molecule’s biodegradability potential can be accurately predicted by training a model on recent experimental data. Additionally, three chemical descriptors were evaluated on the final molecules for their effects on biodegradability: molecular structure, bond types, and solubility. A Gradient Boosted Machine was trained on a dataset of 600 molecules and their binary labels on biodegradability. The classification model effectively captured the biodegradability property, yielding an Area Under the Receiver Operating Characteristics, AUROC, of 84% and an Area Under the Precision Recall Curve, or AUPRC, of 87%. Additionally, an existing amortized synthetic tree generation model, SynNet, validated each molecule by showing chemical synthesizability and producing simple and interpretable synthesis pathways. This approach of filtering by prediction and chemical rule interpretation is inexpensive, highly scalable and can capture the necessary complexity. Using this method, novel polyester candidates can be polymerized and produced into sustainable fabrics, reducing environmental stress from textile-reliant industries. Current biodegradation timelines show that polyesters take over 200 years to break down. A crucial component of several industries, polyesters are relied upon for materials development and thus require sustainable alternatives. Recent works in generative modeling have made it possible to produce large sets of chemical structures, but current molecular screening methods are expensive, not scalable, and are oversimplified. This work evaluates whether a molecule’s biodegradability potential can be accurately predicted by training a model on recent experimental data. Additionally, three chemical descriptors were evaluated on the final molecules for their effects on biodegradability: molecular structure, bond types, and solubility. A Gradient Boosted Machine was trained on a dataset of 600 molecules and their binary labels on biodegradability. The classification model effectively captured the biodegradability property, yielding an Area Under the Receiver Operating Characteristics, AUROC, of 84% and an Area Under the Precision Recall Curve, or AUPRC, of 87%. Additionally, an existing amortized synthetic tree generation model, SynNet, validated each molecule by showing chemical synthesizability and producing simple and interpretable synthesis pathways. This approach of filtering by prediction and chemical rule interpretation is inexpensive, highly scalable and can capture the necessary complexity. Using this method, novel polyester candidates can be polymerized and produced into sustainable fabrics, reducing environmental stress from textile-reliant industries.
作者 Navya Nori Navya Nori(Milton High School, Milton, USA)
机构地区 Milton High School
出处 《Journal of Materials Science and Chemical Engineering》 2024年第8期1-11,共11页 材料科学与化学工程(英文)
关键词 BIODEGRADABILITY Molecular Generation Virtual Screening Biodegradability Molecular Generation Virtual Screening
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部