期刊文献+

TransDFL:Identification of Disordered Flexible Linkers in Proteins by Transfer Learning

原文传递
导出
摘要 Disordered flexible linkers(DFLs)are the functional disordered regions in proteins,which are the sub-regions of intrinsically disordered regions(IDRs)and play important roles in connecting domains and maintaining inter-domain interactions.Trained with the limited available DFLs,the existing DFL predictors based on the machine learning techniques tend to predict the ordered residues as DFLs,leading to a high false positive rate(FPR)and low prediction accuracy.Previous studies have shown that DFLs are extremely flexible disordered regions,which are usually predicted as disordered residues with high confidence[P(D)>0.9]by an IDR predictor.Therefore,transferring an IDR predictor to an accurate DFL predictor is of great significance for understanding the functions of IDRs.In this study,we proposed a new predictor called TransDFL for identifying DFLs by transferring the RFPR-IDP predictor for IDR identification to the DFL prediction.The RFPR-IDP was pre-trained with IDR sequences to learn the general features between IDRs and DFLs,which is helpful to reduce the false positives in the ordered regions.RFPR-IDP was fine-tuned with the DFL sequences to capture the specific features of DFLs so as to be transferred into the TransDFL.Experimental results of two application scenarios(prediction of DFLs only in IDRs or prediction of DFLs in entire proteins)showed that TransDFL consistently outperformed other existing DFL predictors with higher accuracy.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2023年第2期359-369,共11页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by the National Key R&D Program of China(Grant No.2018AAA0100100) the Beijing Natural Science Foundation,China(Grant No.JQ19019).
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部