期刊文献+

数据预处理技术和机器学习方法在质子转移反应质谱中的应用 被引量:1

Review of Data Pre-processing Techniques and Machine Learning in PTR-MS
下载PDF
导出
摘要 质子转移反应质谱(PTR-MS)法是一种用于检测挥发性有机物(VOCs)的分析技术。它具有检测限低、响应速度快、无需样品前处理、实时分析等特点,在大气化学、环境化学、食品、生物医学等领域得到广泛应用。随着PTR-MS应用的扩展和样品种类的增加,如何从复杂的质谱数据中提取特征,并寻找内在规律,对分析算法的处理能力提出了更高的要求。本工作从数据预处理技术和机器学习方法两方面展开论述,归纳了具有PTR-MS特点的数据预处理技术,总结了不同机器学习算法在PTR-MS数据分析中的应用,并讨论了它们的优点和不足。 Proton transfer reaction mass spectrometry(PTR-MS)is an analytical technique developed for the detection of volatile organic compounds(VOCs).It offers many advantages for VOCs analysis,such us ultra-low detection limits,very short response,no sample preparation,real-time analysis,etc.It has been applied in atmospheric chemistry environmental chemistry,food and biomedical.With the expansion of applications of PTR-MS and the increase of sample types,how to analyze the features from complex data and find out the inherent rules have put forward higher requirements on the processing ability of the algorithm.Therefore,this paper discussed the data preprocessing techniques and machine learning methods.Firstly,we summarized the data preprocessing methods with PTR-MS features.The data generated by the instrument cannot be directly used for statistical analysis,otherwise it will bring great error.Therefore,data pre-processing is an essential step.It includes several steps,such as denoising,normalization,and concentration calculation.The purpose of preprocessing is to get data matrix for subsequent analysis.Next,we focused on the use of machine learning methods for data analysis in PTR-MS,and the advantages of this techniques would be demonstrated as well as the drawbacks.The machine learning method can be divided into two parts.Usually unsupervised methods are common choices for initial data analysis.For further analysis and a priori knowledge,a supervised analysis would be a better way.These methods use this knowledge to learn rules and patterns related to classes in the data,and then use these rules and patterns to predict classes in newly acquired data sets.The main goal of all surveillance techniques is to find the relationship between the predictor(VOC)matrix and the response vector.In general,the combination of the unsupervised and supervised methods is a good idea.PTR-MS is a soft ionization technique,however,the presence of a few fragments will still cause great difficulties in spectral analysis,especially for unknown mixtures,which is the main reason why spectral analysis of PTR-MS differs from other mass spectrometry methods.Perhaps,the data fusion of different platform instruments and different samples will be a good way to solve this problem.
作者 孙运 陈一冰 褚美娟 蒋学慧 汪曣 郭冰清 SUN Yun;CHEN Yi-bing;CHU Mei-juan;JIANG Xue-hui;WANG Yan;GUO Bing-qing(School of Precision Instrument and Optoelectronics Engineering,Tianjin University,Tianjin 300072,China;Respiratory Medicine,Chinese PLA General Hospital,Beijing 100853,China)
出处 《质谱学报》 EI CAS CSCD 北大核心 2018年第5期513-523,共11页 Journal of Chinese Mass Spectrometry Society
基金 国家重大科学仪器设备开发专项:质子转移反应质谱仪器研制及应用示范(2013YQ090875) 天津市应用基础与前沿技术研究计划:用于环境监测的离子漏斗-质子转移反应离子源质谱研究(15JCYBJC23300)资助
关键词 质子转移反应质谱(PTR-MS) 挥发性有机物(VOCs) 数据预处理 机器学习 proton transfer reaction mass spectrometry(PTR-MS) volatile organic compounds(VOCs) data pre-processing machine learning
  • 相关文献

参考文献5

二级参考文献88

  • 1何锡文,邢婉丽,史慧明.模式识别及其在分析化学中的应用[J].分析科学学报,1995,11(4):64-70. 被引量:10
  • 2黄海涛,陈章玉,施红林,缪恩铭,刘巍,杨光宇,张承明,孔维松.茶叶香味扫描和挥发性化学成分分析[J].分析化学,2005,33(8):1185-1188. 被引量:42
  • 3Dunn WB, Ellis DI. Metabolomics:Current analytical platforms and methodologies. TrAC Trends in Analytical Chemistry, 2005,24 ( 4 ) : 285 -294.
  • 4Spratlin JL, Serkova NJ, Eckhardt SG. Clinical applications of metabo- lomics in oncology: a review. Clin Cancer Res,2009,15 (2) :431-440.
  • 5Wishart DS. Applications of metabolomics in drug discovery and devel- opment. Drugs R D ,2008,9 (5) :307-322.
  • 6Taylor J, King RD,Altmann T, et al. Application of metabolomics to plant genotype discrimination using statistics and machine learning. Bioinformatics ,2002,18 ( 2 ) :241-248.
  • 7Nicholson JK, Lindon JC, Holmes E. Metabonomics' : understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. Xenohiotica, 1999,29( 11 ) : 1 lg1-1189.
  • 8Smith CA,Want EJ, O'Maille G, et al. XCMS :processing mass spectrom- etry data for metabolite profiling using nonlinear peak alignment, matc- hing, and identification. Analytical Chemistry,2006,78 (3) :779-787.
  • 9Sima C, Dougherty ER. What should be expected from feature selection in small- sample settings. Bioinformatics, 2006,22 ( 19 ) : 2430 -2436.
  • 10Dunn WB, Ellis DI. Metabolomics: Current analytical platforms and methodologies. Trac-Trend Anal Chem,2005,24 ( 4 ) :285-294.

共引文献62

同被引文献2

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部