数据不依赖获取的质谱数据的深度学习分析方法被引量：1

Deep learning analysis for data-independent acquisition mass spectrometry data

下载PDF

导出

摘要近年来,数据不依赖获取(data-independent acquisition,DIA)质谱技术在蛋白质组学领域内被广泛关注.然而DIA质谱数据具有维度高、背景噪声大、多种信号混合等特点,这使得DIA质谱数据的分析成为一大挑战.本文提出一种基于深度学习的可直接处理DIA质谱数据的算法:Ultra-DIA.该算法使用深度变分自动编码器提取离子信号的特征来区分不同肽段产生的子离子,最终生成虚拟谱图,进而对肽段和蛋白进行定性和定量分析.对于测试数据,该算法找到的肽段数量和蛋白数量比主流算法DIA-Umpire分别多61.4%和64.5%.此外,相较于DIA-Umpire,该算法能够找到更多低浓度的蛋白. In recent years,data-independent acquisition(DIA)mass spectrometry techniques have received wide attention in proteomics.However,DIA data are characterized with high dimensionality,large background noises,and mixing of multiple signals,which further challenge the analysis of DIA data.In this work,an algorithm based on deep learning that can directly process DIA mass spectrum data,namely Ultra-DIA,has been developed.It is combined with the deep variational auto-encoder and a variety of machine learning algorithms to directly process DIA data and to extract the features of MS ion signals,so that fragment ions generated by different peptides can be distinguished.Finally,Ultra-DIA generates pseudo-spectra to identify and quantify MS peptides and proteins.For the test data,our algorithm has found 61.4%more peptides and 64.5%more proteins than the mainstream algorithm of DIA-Umpire.In addition,our algorithm is capable of finding more proteins at low concentration compared to the DIA-Umpire.

作者何情祖钟传奇李翔帅建伟韩家淮 HE Qingzu;ZHONG Chuanqi;LI Xiang;SHUAI Jianwei;HAN Jiahuai(College of Physical Science and Technology,Xiamen University,Xiamen 361005,China;School of Life Sciences,Xiamen University,Xiamen 361102,China;National Institute for Data Science in Health and Medicine,Xiamen University,Xiamen 361102,China)

机构地区厦门大学物理科学与技术学院厦门大学生命科学学院厦门大学健康医疗大数据国家研究院

出处《厦门大学学报（自然科学版）》 CAS CSCD 北大核心 2021年第1期97-103,共7页 Journal of Xiamen University：Natural Science

基金国家自然科学基金(11874310,11675134)。

关键词深度学习变分自动编码器数据不依赖获取质谱数据 deep learning variational autoencoders data-independent acquisition mass spectrometry data

分类号 Q633 [生物学—生物物理学]

引文网络
相关文献

引证文献1

1郭欢,何情祖,黎玉林,帅建伟.深度神经网络筛选蛋白质组学高置信度定量肽段[J].生物物理学,2023,11(2):17-29.

1赵婷婷.经阴道超声联合MRI动态增强早期诊断剖宫产术后瘢痕妊娠价值研究[J].中国实用乡村医生杂志,2020,27(11):51-53. 被引量：2
2赵宇杨,宋健,邱丽娟.大豆G位点近等基因系叶片类囊体蛋白质组比较分析[J].作物杂志,2020(6):8-16.
3许安宁.基于深度学习的三维点云语义分割方法综述[J].长江信息通信,2021(1):59-62. 被引量：5
4邢启凯,李铃仙,曹阳,张玮,彭军波,燕继晔,李兴红.可可毛色二孢全基因组分泌蛋白的预测及分析[J].中国农业科学,2020,53(24):5027-5038. 被引量：7
5周留柱,孟祥省,李晓明.气相过渡金属钛-碳链团簇的研究[J].原子与分子物理学报,2021,38(1):77-79.
6杨超,刘健慧,张玮杰,单亦初,戴忠鹏,张丽华,张玉奎.基于末端准等重同位素标记的肽段从头测序方法[J].分析化学,2021,49(3):366-376. 被引量：1
7王薇,徐磊,周梦祥,王宇.乙烯部分预混火焰中苯的生成机理[J].江苏大学学报（自然科学版）,2021,42(1):98-104.
8葛良全,李飞.我国X射线光谱现场分析技术研究进展[J].光谱学与光谱分析,2021,41(3):704-713. 被引量：5
9Shihui Zou,Zhinian Li,Qiuyue Zhou,Yang Pan,Wentao Yuan,Lei He,Shenliang Wang,Wu Wen,Juanjuan Liu,Yong Wang,Yonghua Du,Jiuzhong Yang,Liping Xiao,Hisayoshi Kobayashi,Jie Fan.Surface coupling of methyl radicals for efficient low‐temperature oxidative coupling of methane[J].Chinese Journal of Catalysis,2021,42(7):1117-1125. 被引量：3
10Yunjiang RAO,Zinan WANG,Huijuan WU,Zengling RAN,Bing HAN.Recent Advances in Phase-Sensitive Optical Time Domain Reflectometry (Ф-OTDR)[J].Photonic Sensors,2021,11(1):1-30. 被引量：10

厦门大学学报（自然科学版）

2021年第1期

浏览历史

内容加载中请稍等...

数据不依赖获取的质谱数据的深度学习分析方法被引量：1

引证文献1

相关作者

相关机构

相关主题

浏览历史

数据不依赖获取的质谱数据的深度学习分析方法 被引量：1

引证文献1

相关作者

相关机构

相关主题

浏览历史

数据不依赖获取的质谱数据的深度学习分析方法被引量：1