期刊文献+

Machine learning models for predicting non-alcoholic fatty liver disease in the general United States population:NHANES database 被引量:2

下载PDF
导出
摘要 BACKGROUND Non-alcoholic fatty liver disease(NAFLD)is the most common chronic liver disease,affecting over 30% of the United States population.Early patient identification using a simple method is highly desirable.AIM To create machine learning models for predicting NAFLD in the general United States population.METHODS Using the NHANES 1988-1994.Thirty NAFLD-related factors were included.The dataset was divided into the training(70%)and testing(30%)datasets.Twentyfour machine learning algorithms were applied to the training dataset.The bestperforming models and another interpretable model(i.e.,coarse trees)were tested using the testing dataset.RESULTS There were 3235 participants(n=3235)that met the inclusion criteria.In the training phase,the ensemble of random undersampling(RUS)boosted trees had the highest F1(0.53).In the testing phase,we compared selective machine learning models and NAFLD indices.Based on F1,the ensemble of RUS boosted trees remained the top performer(accuracy 71.1%and F10.56)followed by the fatty liver index(accuracy 68.8% and F10.52).A simple model(coarse trees)had an accuracy of 74.9% and an F1 of 0.33.CONCLUSION Not every machine learning model is complex.Using a simpler model such as coarse trees,we can create an interpretable model for predicting NAFLD with only two predictors:fasting C-peptide and waist circumference.Although the simpler model does not have the best performance,its simplicity is useful in clinical practice.
出处 《World Journal of Hepatology》 2021年第10期1417-1427,共11页 世界肝病学杂志(英文版)(电子版)
  • 相关文献

参考文献1

二级参考文献1

共引文献7

同被引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部