摘要
Targeted protein degradation(TPD)has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery.However,the susceptibility of proteins for targeting by TPD approaches,termed“degradability”,is largely unknown.Here,we developed a machine learning model,model-free analysis of protein degradability(MAPD),to predict degradability from features intrinsic to protein targets.MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds[with an area under the precision–recall curve(AUPRC)of 0.759 and an area under the receiver operating characteristic curve(AUROC)of 0.775]and is likely generalizable to independent non-kinase proteins.We found five features with statistical significance to achieve optimal prediction,with ubiquitination potential being the most predictive.By structural modeling,we found that E2-accessible ubiquitination sites,but not lysine residues in general,are particularly associated with kinase degradability.Finally,we extended MAPD predictions to the entire proteome to find964 disease-causing proteins(including proteins encoded by 278 cancer genes)that may be tractable to TPD drug development.
基金
supported by grants from the Breast Cancer Research Foundation(Grant No.BCRF-19-100 to X.Shirley Liu)
the Mark Foundation for Cancer Research(Mark Foundation Emerging Leader Award
Grant No.19-001-ELA to Eric S.Fischer)
the National Institutes of Health(NIH
Grant Nos.R01CA218278 and R01CA214608 to Eric S.Fischer)
Cancer Research Institute(Irvington Postdoctoral Fellowship
Grant No.CRI 3442 to Shourya S.Roy Burman),USA
Damon Runyon Fellow supported by the Damon Runyon Cancer Research Foundation,USA(Grant No.DRQ-04-20)。