摘要
Protein structure determination is a very important topic in structural genomics,which helps people to understand varieties of biological functions such as protein-protein interactions,protein-DNA interactions and so on.Nowadays,nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo.This study aims to automate the peak picking step,the most important and tricky step in NMR structure determination.We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem.Under the Bayesian framework,the peak picking problem is casted as a variable selection problem.The proposed method can automatically distinguish true peaks from false ones without preprocessing the data.To the best of our knowledge,this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method.
Protein structure determination is a very important topic in structural genomics,which helps people to understand varieties of biological functions such as protein-protein interactions,protein-DNA interactions and so on.Nowadays,nuclear magnetic resonance (NMR) has often been used to determine the three-dimensional structures of protein in vivo.This study aims to automate the peak picking step,the most important and tricky step in NMR structure determination.We propose to model the NMR spectrum by a mixture of bivariate Gaussian densities and use the stochastic approximation Monte Carlo algorithm as the computational tool to solve the problem.Under the Bayesian framework,the peak picking problem is casted as a variable selection problem.The proposed method can automatically distinguish true peaks from false ones without preprocessing the data.To the best of our knowledge,this is the first effort in the literature that tackles the peak picking problem for NMR spectrum data using Bayesian method.
基金
partially supported by grants from the National Science Foundation of USA(Grant No.DMS1007457 and DMS-1106494)
the award(KUS-C1-01604)made by King Abdullah University of Science and Technology(KAUST)to FL