DeepSecE:A Deep-Learning-Based Framework for Multiclass Prediction of Secreted Proteins in Gram-Negative Bacteria

导出

摘要 Proteins secreted by Gram-negative bacteria are tightly linked to the virulence and adaptability of these microbes to environmental changes.Accurate identification of such secreted proteins can facilitate the investigations of infections and diseases caused by these bacterial pathogens.However,current bioinformatic methods for predicting bacterial secreted substrate proteins have limited computational efficiency and application scope on a genome-wide scale.Here,we propose a novel deep-learning-based framework—DeepSecE—for the simultaneous inference of multiple distinct groups of secreted proteins produced by Gram-negative bacteria.DeepSecE remarkably improves their classification from nonsecreted proteins using a pretrained protein language model and transformer,achieving a macro-average accuracy of 0.883 on 5-fold cross-validation.Performance benchmarking suggests that DeepSecE achieves competitive performance with the state-of-the-art binary predictors specialized for individual types of secreted substrates.The attention mechanism corroborates salient patterns and motifs at the N or C termini of the protein sequences.Using this pipeline,we further investigate the genome-wide prediction of novel secreted proteins and their taxonomic distribution across~1,000 Gram-negative bacterial genomes.The present analysis demonstrates that DeepSecE has major potential for the discovery of disease-associated secreted proteins in a diverse range of Gram-negative bacteria.An online web server of DeepSecE is also publicly available to predict and explore various secreted substrate proteins via the input of bacterial genome sequences.

作者 Yumeng Zhang Jiahao Guan Chen Li Zhikang Wang Zixin Deng Robin BGasser Jiangning Song Hong-Yu Ou

机构地区 state Key Laboratory of Microbial Metabolism Shanghai Key Laboratory of Veterinary Biotechnology Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology Monash Data Futures Institute Melbourne Veterinary School

出处《Research》 SCIE EI CSCD 2024年第3期243-257,共15页 研究（英文）

基金 the National Natural Science Foundation of China(32070572) the Foundation of Key Laboratory of Veterinary Biotechnology(shklab202005) Shanghai,China,and the Science and Technology Commission of Shanghai Municipality(19JC1413000) R.B.G.and J.S.were supported by grants from the Australian Research Council(ARC)(LP220200614).

关键词 DEEP SERVER COMPETITIVE

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献1

1Jianfeng Zhang,Jiahao Guan,Meng Wang,Gang Li,Marko Djordjevic,Cui Tai,Hui Wang,Zixin Deng,Zhaoyan Chen,Hong-Yu Ou.SecReT6 update:a comprehensive resource of bacterial TypeⅥSecretion Systems[J].Science China(Life Sciences),2023,66(3):626-634. 被引量：3

共引文献2

1王海蓉,宁年智,王慧.鲍曼不动杆菌Ⅵ型分泌系统功能蛋白的研究及应用新进展[J].微生物学报,2024,64(2):391-407.
2伍水龙,黄瑜,王蓓,汤菊芬,蔡佳,简纪常.水生动物病原菌Ⅵ型分泌系统(T6SS)及其溶血素共调节蛋白研究进展[J].大连海洋大学学报,2024,39(1):162-171.

1Mengmeng Wu,Zhixiang Lin,Shining Ma,Ting Chen,Rui Jiang,Wing Hung Wong.Simultaneous inference of phenotype-associated genes and relevant tissues from GWAS data via Bayesian integration of multiple tissue-specific gene networks[J].Journal of Molecular Cell Biology,2017,9(6):436-452. 被引量：1
2Jian-Yu Jiao,Rashidin Abdugheni,Dao-Feng Zhang,Iftikhar Ahmed,Mukhtiar Ali,Maria Chuvochina,Svetlana N.Dedysh,Xiuzhu Dong,Markus Göker,Brian P.Hedlund,Philip Hugenholtz,Kamlesh Jangid,Shuang-Jiang Liu,Edward R.B.Moore,Manik Prabhu Narsing Rao,Aharon Oren,Ramon Rossello-Mora,Bhagwan Narayan Rekadwad,Nimaichand Salam,Wensheng Shu,Iain C.Sutcliffe,Wee Fei Aaron Teo,Martha E.Trujillo,Stephanus N.Venter,William B.Whitman,Guoping Zhao,Wen-Jun Li.Advancements in prokaryotic systematics and the role of Bergey's International Society for Microbial Systematicsin addressing challenges in the meta-data era[J].National Science Review,2024,11(7):286-298.
3A Brief Introduction to Sichuan Academy of Social Sciences[J].Contemporary Social Sciences,2024,9(5).
4叶力硕,何志学.融合小波分解的多尺度时间序列异常检测[J].计算机应用,2024,44(10):3300-3306.
5Congyu Li,Yu Ling,Yanjie Zhang,Haiyan Wang,Huan Wang,Guokai Yan,Weiyang Dong,Yang Chang,Liang Duan.Insight into the microbial community of denitrification process using different solid carbon sources: Not only bacteria[J].Journal of Environmental Sciences,2024(10):87-99.
6刘丰丽,张建军,曾战东,黄广锋,马同胜.先天性巨结肠竞争内源性RNA网络构建和分析[J].中华小儿外科杂志,2024,45(9):819-826.
7王霞,王卓然,张珊,王勇.种群熵竞争粒子群算法[J].计算机工程与应用,2024,60(20):96-115.
8Papada Natsathaporn,Gordon Herwig,Stefanie Altenried,Qun Ren,René M.Rossi,Daniel Crespy,Fabian Itel.Functional Fiber Membranes with Antibacterial Properties for Face Masks[J].Advanced Fiber Materials,2023,5(4):1519-1533. 被引量：1
9Qiang Sun,Yan-Wei Fu,Xiang-Yang Xue.Learning a Mixture of Conditional Gating Blocks for Visual Question Answering[J].Journal of Computer Science & Technology,2024,39(4):912-928.
10Yiming Li,Yuying Yang,Yifei Wang,Timothy RWalsh,Shaolin Wang,Chang Cai.Molecular characterization of bla_(NDM)-harboring plasmids reveal its rapid adaptation and evolution in the Enterobacteriaceae[J].One Health Advances,2023(1):26-40.

Research

2024年第3期

浏览历史

内容加载中请稍等...

DeepSecE:A Deep-Learning-Based Framework for Multiclass Prediction of Secreted Proteins in Gram-Negative Bacteria

参考文献1

共引文献2

相关作者

相关机构

相关主题

浏览历史