摘要
BACKGROUND The development of precision medicine is essential for personalized treatment and improved clinical outcome,whereas biomarkers are critical for the success of precision therapies.AIM To investigate whether iCEMIGE(integration of CEll-morphometrics,MIcro-biome,and GEne biomarker signatures)improves risk stratification of breast cancer(BC)patients.METHODS We used our recently developed machine learning technique to identify cellular morphometric biomarkers(CMBs)from the whole histological slide images in The Cancer Genome Atlas(TCGA)breast cancer(TCGA-BRCA)cohort.Multivariate Cox regression was used to assess whether cell-morphometrics prognosis score(CMPS)and our previously reported 12-gene expression prognosis score(GEPS)and 15-microbe abundance prognosis score(MAPS)were independent prognostic factors.iCEMIGE was built upon the sparse representation learning technique.The iCEMIGE scoring model performance was measured by the area under the receiver operating characteristic curve compared to CMPS,GEPS,or MAPS alone.Nomogram models were created to predict overall survival(OS)and progress-free survival(PFS)rates at 5-and 10-year in the TCGA-BRCA cohort.RESULTS We identified 39 CMBs that were used to create a CMPS system in BCs.CMPS,GEPS,and MAPS were found to be significantly independently associated with OS.We then established an iCEMIGE scoring system for risk stratification of BC patients.The iGEMIGE score has a significant prognostic value for OS and PFS independent of clinical factors(age,stage,and estrogen and progesterone receptor status)and PAM50-based molecular subtype.Importantly,the iCEMIGE score significantly increased the power to predict OS and PFS compared to CMPS,GEPS,or MAPS alone.CONCLUSION Our study demonstrates a novel and generic artificial intelligence framework for multimodal data integration toward improving prognosis risk stratification of BC patients,which can be extended to other types of cancer.
基金
Supported by This work was supported by the Department of Defense(DoD)BCRP,No.BC190820
the National Cancer Institute(NCI)at the National Institutes of Health(NIH),No.R01CA184476
MCIN/AEI/10.13039/501100011039,No.PID2020-118527RB-I00,and No.PDC2021-121735-I00
the“European Union Next Generation EU/PRTR.”the Regional Government of Castile and León,No.CSI144P20.Lawrence Berkeley National Laboratory(LBNL)is a multi-program national laboratory operated by the University of California for the DOE under contract DE AC02-05CH11231.