β-TCP, as one of calcium phosphates ceramics, exerts perfect biocompatibility and osteoconductivity, and is clinically used as a bone graft substitute for decades. Consequently, the effects of β-TCP ceramics on intr...β-TCP, as one of calcium phosphates ceramics, exerts perfect biocompatibility and osteoconductivity, and is clinically used as a bone graft substitute for decades. Consequently, the effects of β-TCP ceramics on intracellular Ca2+ concentration, mineralization of osteoblast and BSA protein structure were studied. Results showed that β-TCP could increase the intracelluar Ca2+ concentration and mineralization of osteoblast, indicating that β-TCP ceramics could take part in the organic metabolism and the degradation product had no detrimental effect on osteoblast in vitro. Furthermore, β-TCP ceramics could increase the content of α-helix and β-pleated sheet and change BSA into more ordering structure, those changes might be favorable for the biomineralization after β-TCP ceramics implanted.展开更多
In this paper, the applications of evolutionary algorithm in prediction of protein secondary structure and tertiary structures are introduced, and recent studies on solving protein structure prediction problems using ...In this paper, the applications of evolutionary algorithm in prediction of protein secondary structure and tertiary structures are introduced, and recent studies on solving protein structure prediction problems using evolutionary algorithms are reviewed, and the challenges and prospects of EAs applied to protein structure modeling are analyzed and discussed.展开更多
The hydrophobic-polar (HP) lattice model is an important simplified model for studying protein folding. In this paper, we present an improved ACO algorithm for the protein structure prediction. In the algorithm, the &...The hydrophobic-polar (HP) lattice model is an important simplified model for studying protein folding. In this paper, we present an improved ACO algorithm for the protein structure prediction. In the algorithm, the "lone"ethod is applied to deal with the infeasible structures, and the "oint mutation and reconstruction"ethod is applied in local search phase. The empirical results show that the presented method is feasible and effective to solve the problem of protein structure prediction, and notable improvements in CPU time are obtained.展开更多
A distance measure that infers to indicate the evolutionary relationship of protein structures has been developed based on spatial preference factors of residues. The spatial preference factor is a reflection of the e...A distance measure that infers to indicate the evolutionary relationship of protein structures has been developed based on spatial preference factors of residues. The spatial preference factor is a reflection of the environment of residues in tertiary structure. Compared with the phyletic relationships derived from sequence homologies and three-dimensional structures, we find that the two lines of evolution are similar in general. This approach is applied to a group of glins here.展开更多
A three-dimensional off-lattice protein model with two species of monomers, hydrophobic and hydrophilic, is studied. Enligh- tened by the law of reciprocity among things in the physical world, a heuristic quasi-physic...A three-dimensional off-lattice protein model with two species of monomers, hydrophobic and hydrophilic, is studied. Enligh- tened by the law of reciprocity among things in the physical world, a heuristic quasi-physical algorithm for protein structure prediction problem is put forward. First, by elaborately simulating the movement of the smooth elastic balls in the physical world, the algorithm finds low energy configurations for a given monomer chain. An "off-trap" strategy is then proposed to get out of local minima. Experimental results show promising performance. For all chains with lengths 13≤n ≤55, the proposed algorithm finds states with lower energy than the putative ground states reported in literatures. Furthermore, for chain lengths n = 21, 34, and 55, the algorithm finds new low energy configurations different from those given in literatures.展开更多
The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weigh...The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weights of features in classification. We propose the GASVM algorithm (classification accuracy of support vector machine is regarded as the fitness value of genetic algorithm) to optimize the coefficients of these 16 features (5 features are proposed first time) in the classification, and further develop a new feature vector. Finally, based on the new feature vector, this paper uses support vector machine and 10-fold cross-validation to classify the protein structure of 3 low similarity datasets (25PDB, 1189, FC699). Experimental results show that the overall classification accuracy of the new method is better than other methods.展开更多
The deep-learning protein structure prediction method AlphaFold2 has garnered enormous attention beyond the realm of structural biology,for its groundbreaking contribution to solving the"protein foiding problem&q...The deep-learning protein structure prediction method AlphaFold2 has garnered enormous attention beyond the realm of structural biology,for its groundbreaking contribution to solving the"protein foiding problem"In this perspective,we explore the connection between protein structure studies and environmental research,delving into the potential for addressing specific environmental challenges.Proteins are promising for environmental applications because of the functional diversity endowed by their structural complexity.However,structural studies on proteins with environmental significance remain scarce.Here,we present the opportunity to study proteins by advancing experimental determination and deep-learning prediction methods.Specifically,the latest progress in environmental research via cryogenic electron microscopy is highlighted.It allows us to determine the structure of protein complexes in their native state within cells at molecular resolution,revealing environmentally-associated structural dynamics.With the remarkable advancements in computational power and experimental resolution,the study of protein structure and dynamics has reached unprecedented depth and accuracy.These advancements will undoubtedly accelerate the establishment of comprehensive environmental protein structural and functional databases.Tremendous opportunities for protein engineering exist to enable innovative solutions for environmental applications,such as the degradation of persistent contaminants,and the recovery of valuable metals as well as rare earth elements.展开更多
[Objective] This study aimed to predict the structure of protein OmpH from Pasteurella multocida C47-8 (PmC47-8) strain of yak. [Method] Online BLAST, signal peptide prediction, secondary structure prediction and pr...[Objective] This study aimed to predict the structure of protein OmpH from Pasteurella multocida C47-8 (PmC47-8) strain of yak. [Method] Online BLAST, signal peptide prediction, secondary structure prediction and protein characteristics of sequencing result of gene OmpH from PmC47-8 strain were analyzed. [Result] The similarities of gene OmpH from PmC47-8 with the published 81 OmpH genes were between 84% and 99%; a signal peptide was found with the cleavage sites between 20 and 21 in the polypeptide; secondary structure prediction showed that folding structure accounted for 49.8% and loop structure for 50.2%; it predicted that there were 7 O-glycosylation sites in OmpH protein with the amino acid residual sites of 2, 45, 48, 330, 716, 721, 723, respectively, and 2 N-glycosylation sites with the amino acid residual sites of 15 and 35. [Conclusion] This study lays the foundation for the study on the immunity of OmpH gene from yak.展开更多
It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional proper...It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional properties of proteins at five different stages, including early milky stage(EMS), middle milky stage(MMS), late milky stage(LMS), waxy ripe stage(WS)and ripening stage(RS), during the seed development were investigated. It was found that with the seed developing, the molecular weight of fresh rice protein gradually become larger while the secondary structure changed from the highest content of disordered structure at MMS to the highest content of ordered structure at RS, which affect the surface hydrophobicity and then the functional properties of proteins, including foaming properties, emulsifying properties and oil holding capacity. Fresh rice protein at MMS has the strongest surface hydrophobicity while fresh edible rice protein at RS has the strongest oil holding capability. The results of our study can provide a theoretical basis for the application of fresh rice protein in the food industry and help to develop new fresh edible rice food.展开更多
Currently many facets of genetic information are illdefined. In particular, how protein folding is genetically regulated has been a long-standing issue for genetics and protein biology. And a generic mechanistic model...Currently many facets of genetic information are illdefined. In particular, how protein folding is genetically regulated has been a long-standing issue for genetics and protein biology. And a generic mechanistic model with supports of genomic data is still lacking. Recent technological advances have enabled much needed genome-wide experiments. While putting the effect of codon optimality on debate, these studies have supplied mounting evidence suggesting a role of m RNA structure in the regulation of protein folding by modulating translational elongation rate. In conjunctions with previous theories, this mechanistic model of protein folding guided by m RNA structure shall expand our understandings of genetic information and offer new insights into various biomedical puzzles.展开更多
Proteomics allows the large-scale study of protein expression either in whole organisms or in purified organelles. In particular, mass spectrometry (MS) analysis of gel-separated proteins produces data not only for ...Proteomics allows the large-scale study of protein expression either in whole organisms or in purified organelles. In particular, mass spectrometry (MS) analysis of gel-separated proteins produces data not only for protein identification, but for protein structure, location, and processing as well. An in-depth analysis was performed on MS data from etiolated hypocotyl cell wall proteomics ofArabidopsis thaliana. These analyses show that highly homologous members of multigene families can be differentiated. Two lectins presenting 93% amino acid identity were identified using peptide mass fingerprinting. Although the identification of structural proteins such as extensins or hydroxyproline/proline-rich proteins (H/PRPs) is arduous, different types of MS spectra were exploited to identify and characterize an H/PRP. Maturation events in a couple of cell wall proteins (CWPs) were analyzed using site mapping. N-glycosylation of CWPs as well as the hydroxylation or oxidation of amino acids were also explored, adding information to improve our understanding of CWP structure/function relationships. A bioinformatic tool was developed to locate by means of MS the N-terminus of mature secreted proteins and N-glycosylation.展开更多
Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt ...Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt various research paradigms to attack the same structure prediction problem:biochemists and physicists attempt to reveal the principles governing protein folding;mathematicians,especially statisticians,usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure,while computer scientists formulate protein structure prediction as an optimization problem-finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure.These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman,namely,data modeling and algorithmic modeling.Recently,we have also witnessed the great success of deep learning in protein structure prediction.In this review,we present a survey of the efforts for protein structure prediction.We compare the research paradigms adopted by researchers from different fields,with an emphasis on the shift of research paradigms in the era of deep learning.In short,the algorithmic modeling techniques,especially deep neural networks,have considerably improved the accuracy of protein structure prediction;however,theories interpreting the neural networks and knowledge on protein folding are still highly desired.展开更多
Protein sequences as special heterogeneous sequences are rare in the amino acid sequence space. The specific sequen- tial order of amino acids of a protein is essential to its 3D structure. On the whole, the correlati...Protein sequences as special heterogeneous sequences are rare in the amino acid sequence space. The specific sequen- tial order of amino acids of a protein is essential to its 3D structure. On the whole, the correlation between sequence and structure of a protein is not so strong. How well would a protein sequence contain its structural information? How does a sequence determine its native structure? Keeping the globular proteins in mind, we discuss several problems from sequence to structure.展开更多
Can we determine a high resolution protein structure quickly, say, in a week? I will show this is possible by the current technologies together with new computational tools discussed in this article. We have three po...Can we determine a high resolution protein structure quickly, say, in a week? I will show this is possible by the current technologies together with new computational tools discussed in this article. We have three potential paths to explore: X-ray crystallography. While this method has produced the most protein structures in the PDB (Protein Data Bank), the nasty trial-and-error crystallization step remains to be an inhibitive obstacle.NMR (Nuclear Magnetic Resonance) spectroscopy. While the NMR experiments are relatively easy to do, the interpretation of the NMR data for structure calculation takes several months on average.In silico protein structure prediction. Can we actually predict high resolution structures consistently? If the predicted models remain to be labeled as “predicted”, and these structures still need to be experimentally verified by the wet lab methods, then this method at best can serve only as a screening tool. I investigate the question of “quick protein structure determination” from a computer scientist point of view and actually answer the more relevant question “what can a computer scientist effectively contribute to this goal”.展开更多
This paper describes a case study of 3D protein structure prediction of six sequences from protein data bank (PDB) by genetic algorithm and tabu search (GATS), where off-lattice AB model is considered as a simplif...This paper describes a case study of 3D protein structure prediction of six sequences from protein data bank (PDB) by genetic algorithm and tabu search (GATS), where off-lattice AB model is considered as a simplified model of protein structure. The lowest-energy values required for forming the native conformation of proteins are searched by GATS, and then the coarse structures (i.e., simplified structure) of the proteins are obtained according to the multiple angle parameters corresponding to the lowest energies. All the coarse structures form single hydrophobic cores surrounded by hydrophilic residues, which stay on the right side of the actual characteristic of protein structure. It demonstrates that this approach can predict the 3D protein structure effectively.展开更多
The HP model for protein structure prediction abstracts the fact that hydrophobicity is a dominant force in the protein folding process. This challenging combinatorial optimization problem has been widely addressed th...The HP model for protein structure prediction abstracts the fact that hydrophobicity is a dominant force in the protein folding process. This challenging combinatorial optimization problem has been widely addressed through metaheuristics. The evaluation function is a key component for the success of metaheuristics; the poor discrimination of the conventional evaluation function of the HP model has motivated the proposal of alternative formulations for this component. This comparative analysis inquires into the effectiveness of seven different evaluation functions for the HP model. The degree of discrimination provided by each of the studied functions, their capability to preserve a rank ordering among potential solutions which is consistent with the original objective of the HP model, as well as their effect on the performance of local search methods are analyzed. The obtained results indicate that studying alternative evaluation schemes for the HP model represents a highly valuable direction which merits more attention.展开更多
Hausdorff distance between two compact sets, defined as the maximum distance from a point of one set to another set, has many application in computer science. It is a good measure for the similarity of two sets. This ...Hausdorff distance between two compact sets, defined as the maximum distance from a point of one set to another set, has many application in computer science. It is a good measure for the similarity of two sets. This paper proves that the shape distance between two compact sets in R^n defined by nfinimum Hausdorff distance under rigid motions is a distance. The authors introduce similarity comparison problems in protein science, and propose that this measure may have good application to comparison of protein structure as well. For calculation of this distance, the authors give one dimensional formulas for problems (2, n), (3, 3), and (3, 4). These formulas can reduce time needed for solving these problems. The authors did some data, this formula can reduce time needed to one As n increases, it would save more time. numerical experiments for (2, n). On these sets of fifteenth of the best algorithms known on average.展开更多
Based on the concept of ant colony optimization and the idea of population in genetic algorithm, a novel global optimization algorithm, called the hybrid ant colony optimization (HACO), is proposed in this paper to ...Based on the concept of ant colony optimization and the idea of population in genetic algorithm, a novel global optimization algorithm, called the hybrid ant colony optimization (HACO), is proposed in this paper to tackle continuous-space optimization problems. It was compared with other well-known stochastic methods in the optimization of the benchmark functions and was also used to solve the problem of selecting appropriate dilation efficiently by optimizing the wavelet power spectrum of the hydrophobic sequence of protein, which is the key step on using continuous wavelet transform (CWT) to predict a-helices and connecting peptides.展开更多
Over the past several decades, biologists have conducted numerous studies examining both general and specific functions of proteins. Generally, if similarities in either the structure or sequence of amino acids exist ...Over the past several decades, biologists have conducted numerous studies examining both general and specific functions of proteins. Generally, if similarities in either the structure or sequence of amino acids exist for two proteins, then a common biological function is expected. Protein function is determined primarily based on the structure rather than the sequence of amino acids. The algorithm for protein structure alignment is an essential tool for the research. The quality of the algorithm depends on the quality of the similarity measure that is used, and the similarity measure is an objective function used to determine the best alignment because of their individual strength and weakness However, none of existing similarity measures became golden standard They require excessive filtering to find a single alignment. In this paper, we introduce a new strategy that finds not a single alignment, but multiple alignments with different lengths. This method has obvious benefits of high quality alignment. However, this novel method leads to a new problem that the running time for this method is considerably longer than that for methods that find only a single alignment. To address this problem~ we propose algorithms that can locate a common region (CORE) of multiple alignment candidates, and can then extend the CORE into multiple alignments. Because the CORE can be defined from a final alignment, we introduce CORE* that is similar to CORE and propose an algorithm to identify the CORE*. By adopting CORE* and dynamic programming, our proposed method produces multiple alignments of various lengths with higher accuracy than previous methods. In the experiments, the alignments identified by our algorithm are longer than those obtained by TM-align by 17% and 15.48%, on average, when the comparison is conducted at the level of super-family and fold, respectively.展开更多
Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing ch...Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing cheese quality.Fourier transform infrared spectroscopy(FTIR),with self-deconvolution,second derivative analysis and band curve-fitting,was used to characterize the secondary structure of proteins in Cheddar cheese during ripening.The spectra of the amide I region showed great similarity,while the relative contents of the secondary structures underwent a series of changes.As ripening progressed,the α-helix content decreased and the β-sheet content increased.This structural shift was attributed to the strengthening of hydrogen bonds that resulted from hydrolysis of caseins.In summary,FTIR could provide the basis for rapid characterization of cheese that is undergoing ripening.展开更多
基金Funded by the Research Fund of Key Laboratory for Advanced Technology in Environmental Protection of Jiangsu Province (AE201037)the Foundation for Talent Recruitment of Yancheng Institute of Technology (XKR2011007)"973" Chinese National Key Fundamental Research and Development Program (No.G1999064701)
文摘β-TCP, as one of calcium phosphates ceramics, exerts perfect biocompatibility and osteoconductivity, and is clinically used as a bone graft substitute for decades. Consequently, the effects of β-TCP ceramics on intracellular Ca2+ concentration, mineralization of osteoblast and BSA protein structure were studied. Results showed that β-TCP could increase the intracelluar Ca2+ concentration and mineralization of osteoblast, indicating that β-TCP ceramics could take part in the organic metabolism and the degradation product had no detrimental effect on osteoblast in vitro. Furthermore, β-TCP ceramics could increase the content of α-helix and β-pleated sheet and change BSA into more ordering structure, those changes might be favorable for the biomineralization after β-TCP ceramics implanted.
基金Supported by the National Natural Science Foundation of China(60133010,70071042,60073043)
文摘In this paper, the applications of evolutionary algorithm in prediction of protein secondary structure and tertiary structures are introduced, and recent studies on solving protein structure prediction problems using evolutionary algorithms are reviewed, and the challenges and prospects of EAs applied to protein structure modeling are analyzed and discussed.
文摘The hydrophobic-polar (HP) lattice model is an important simplified model for studying protein folding. In this paper, we present an improved ACO algorithm for the protein structure prediction. In the algorithm, the "lone"ethod is applied to deal with the infeasible structures, and the "oint mutation and reconstruction"ethod is applied in local search phase. The empirical results show that the presented method is feasible and effective to solve the problem of protein structure prediction, and notable improvements in CPU time are obtained.
文摘A distance measure that infers to indicate the evolutionary relationship of protein structures has been developed based on spatial preference factors of residues. The spatial preference factor is a reflection of the environment of residues in tertiary structure. Compared with the phyletic relationships derived from sequence homologies and three-dimensional structures, we find that the two lines of evolution are similar in general. This approach is applied to a group of glins here.
基金The National Natural Science Founda-tion of China (No.10471051) and the National Basic Research Program (973) of China (No.2004CB318000)
文摘A three-dimensional off-lattice protein model with two species of monomers, hydrophobic and hydrophilic, is studied. Enligh- tened by the law of reciprocity among things in the physical world, a heuristic quasi-physical algorithm for protein structure prediction problem is put forward. First, by elaborately simulating the movement of the smooth elastic balls in the physical world, the algorithm finds low energy configurations for a given monomer chain. An "off-trap" strategy is then proposed to get out of local minima. Experimental results show promising performance. For all chains with lengths 13≤n ≤55, the proposed algorithm finds states with lower energy than the putative ground states reported in literatures. Furthermore, for chain lengths n = 21, 34, and 55, the algorithm finds new low energy configurations different from those given in literatures.
文摘The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weights of features in classification. We propose the GASVM algorithm (classification accuracy of support vector machine is regarded as the fitness value of genetic algorithm) to optimize the coefficients of these 16 features (5 features are proposed first time) in the classification, and further develop a new feature vector. Finally, based on the new feature vector, this paper uses support vector machine and 10-fold cross-validation to classify the protein structure of 3 low similarity datasets (25PDB, 1189, FC699). Experimental results show that the overall classification accuracy of the new method is better than other methods.
基金Financial support from the National Natural Science Foundation of China(Grant Nos.52225001 and 51978485)the State Key Laboratory for Pollution Control(China)is acknowledged.
文摘The deep-learning protein structure prediction method AlphaFold2 has garnered enormous attention beyond the realm of structural biology,for its groundbreaking contribution to solving the"protein foiding problem"In this perspective,we explore the connection between protein structure studies and environmental research,delving into the potential for addressing specific environmental challenges.Proteins are promising for environmental applications because of the functional diversity endowed by their structural complexity.However,structural studies on proteins with environmental significance remain scarce.Here,we present the opportunity to study proteins by advancing experimental determination and deep-learning prediction methods.Specifically,the latest progress in environmental research via cryogenic electron microscopy is highlighted.It allows us to determine the structure of protein complexes in their native state within cells at molecular resolution,revealing environmentally-associated structural dynamics.With the remarkable advancements in computational power and experimental resolution,the study of protein structure and dynamics has reached unprecedented depth and accuracy.These advancements will undoubtedly accelerate the establishment of comprehensive environmental protein structural and functional databases.Tremendous opportunities for protein engineering exist to enable innovative solutions for environmental applications,such as the degradation of persistent contaminants,and the recovery of valuable metals as well as rare earth elements.
基金Supported by the Project for High-level Talents of Qinghai University (2008-QGC-7)~~
文摘[Objective] This study aimed to predict the structure of protein OmpH from Pasteurella multocida C47-8 (PmC47-8) strain of yak. [Method] Online BLAST, signal peptide prediction, secondary structure prediction and protein characteristics of sequencing result of gene OmpH from PmC47-8 strain were analyzed. [Result] The similarities of gene OmpH from PmC47-8 with the published 81 OmpH genes were between 84% and 99%; a signal peptide was found with the cleavage sites between 20 and 21 in the polypeptide; secondary structure prediction showed that folding structure accounted for 49.8% and loop structure for 50.2%; it predicted that there were 7 O-glycosylation sites in OmpH protein with the amino acid residual sites of 2, 45, 48, 330, 716, 721, 723, respectively, and 2 N-glycosylation sites with the amino acid residual sites of 15 and 35. [Conclusion] This study lays the foundation for the study on the immunity of OmpH gene from yak.
基金the financial support from the Postdoctoral Research Project of Heilongjiang Provincial Department of Human Resources and Social Security (LBH-Q21156)Heilongjiang BaYi Agricultural University Support Program for San Zong San Heng (ZDZX202104)+3 种基金Science Foundation Project of Heilongjiang Province (QC2015028)National Natural Science Foundation of China (32072258)Major Science and technology Program of Heilongjiang (2019ZX08B02,2020ZX08B02)Central financial support for the development of local colleges and universities,Graduate research and innovation project of Harbin University of Commerce (YJSCX2020636HSD)。
文摘It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional properties of proteins at five different stages, including early milky stage(EMS), middle milky stage(MMS), late milky stage(LMS), waxy ripe stage(WS)and ripening stage(RS), during the seed development were investigated. It was found that with the seed developing, the molecular weight of fresh rice protein gradually become larger while the secondary structure changed from the highest content of disordered structure at MMS to the highest content of ordered structure at RS, which affect the surface hydrophobicity and then the functional properties of proteins, including foaming properties, emulsifying properties and oil holding capacity. Fresh rice protein at MMS has the strongest surface hydrophobicity while fresh edible rice protein at RS has the strongest oil holding capability. The results of our study can provide a theoretical basis for the application of fresh rice protein in the food industry and help to develop new fresh edible rice food.
基金supported by the start-up grant from“Top 100 Talents Program”of Sun Yat-sen University to JRY(50000-31131114)General Program of National Natural Science Foundation of China to JRY(31671320)
文摘Currently many facets of genetic information are illdefined. In particular, how protein folding is genetically regulated has been a long-standing issue for genetics and protein biology. And a generic mechanistic model with supports of genomic data is still lacking. Recent technological advances have enabled much needed genome-wide experiments. While putting the effect of codon optimality on debate, these studies have supplied mounting evidence suggesting a role of m RNA structure in the regulation of protein folding by modulating translational elongation rate. In conjunctions with previous theories, this mechanistic model of protein folding guided by m RNA structure shall expand our understandings of genetic information and offer new insights into various biomedical puzzles.
文摘Proteomics allows the large-scale study of protein expression either in whole organisms or in purified organelles. In particular, mass spectrometry (MS) analysis of gel-separated proteins produces data not only for protein identification, but for protein structure, location, and processing as well. An in-depth analysis was performed on MS data from etiolated hypocotyl cell wall proteomics ofArabidopsis thaliana. These analyses show that highly homologous members of multigene families can be differentiated. Two lectins presenting 93% amino acid identity were identified using peptide mass fingerprinting. Although the identification of structural proteins such as extensins or hydroxyproline/proline-rich proteins (H/PRPs) is arduous, different types of MS spectra were exploited to identify and characterize an H/PRP. Maturation events in a couple of cell wall proteins (CWPs) were analyzed using site mapping. N-glycosylation of CWPs as well as the hydroxylation or oxidation of amino acids were also explored, adding information to improve our understanding of CWP structure/function relationships. A bioinformatic tool was developed to locate by means of MS the N-terminus of mature secreted proteins and N-glycosylation.
基金the National Key R&D Program of China(Grant No.2020YFA0907000)lthe National Natural Science Foundation of China(Grant Nos.32271297,62072435,31770775,and 31671369)for providing financial support for this study and publication charges.
文摘Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt various research paradigms to attack the same structure prediction problem:biochemists and physicists attempt to reveal the principles governing protein folding;mathematicians,especially statisticians,usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure,while computer scientists formulate protein structure prediction as an optimization problem-finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure.These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman,namely,data modeling and algorithmic modeling.Recently,we have also witnessed the great success of deep learning in protein structure prediction.In this review,we present a survey of the efforts for protein structure prediction.We compare the research paradigms adopted by researchers from different fields,with an emphasis on the shift of research paradigms in the era of deep learning.In short,the algorithmic modeling techniques,especially deep neural networks,have considerably improved the accuracy of protein structure prediction;however,theories interpreting the neural networks and knowledge on protein folding are still highly desired.
基金supported by the National Natural Science Foundation of China (Grant Nos. 11175224 and 11121403)
文摘Protein sequences as special heterogeneous sequences are rare in the amino acid sequence space. The specific sequen- tial order of amino acids of a protein is essential to its 3D structure. On the whole, the correlation between sequence and structure of a protein is not so strong. How well would a protein sequence contain its structural information? How does a sequence determine its native structure? Keeping the globular proteins in mind, we discuss several problems from sequence to structure.
基金supported by the National High Tech Research and Development 863 Program under Grant No.2008AA02Z313 from China’s Ministry of Science and TechnologyCanada’s NSERC under Grant No. OGP0046506Canada Research Chair Program,an NSERC Collaborative Grant,Ontario’s Premier’s Discovery Award
文摘Can we determine a high resolution protein structure quickly, say, in a week? I will show this is possible by the current technologies together with new computational tools discussed in this article. We have three potential paths to explore: X-ray crystallography. While this method has produced the most protein structures in the PDB (Protein Data Bank), the nasty trial-and-error crystallization step remains to be an inhibitive obstacle.NMR (Nuclear Magnetic Resonance) spectroscopy. While the NMR experiments are relatively easy to do, the interpretation of the NMR data for structure calculation takes several months on average.In silico protein structure prediction. Can we actually predict high resolution structures consistently? If the predicted models remain to be labeled as “predicted”, and these structures still need to be experimentally verified by the wet lab methods, then this method at best can serve only as a screening tool. I investigate the question of “quick protein structure determination” from a computer scientist point of view and actually answer the more relevant question “what can a computer scientist effectively contribute to this goal”.
基金Supported by the National Natural Science Foundation of China (60975031)the Scientific Research Foundation for the Returned Overseas Chinese Scholars of Ministry of Education of China, the Open Foundation of State Key Laboratory of Bioelectronics of Southeast University, China, and the Natural Science Foundation of Hubei Province, China (2008CDB344 and 2009CDA034)
文摘This paper describes a case study of 3D protein structure prediction of six sequences from protein data bank (PDB) by genetic algorithm and tabu search (GATS), where off-lattice AB model is considered as a simplified model of protein structure. The lowest-energy values required for forming the native conformation of proteins are searched by GATS, and then the coarse structures (i.e., simplified structure) of the proteins are obtained according to the multiple angle parameters corresponding to the lowest energies. All the coarse structures form single hydrophobic cores surrounded by hydrophilic residues, which stay on the right side of the actual characteristic of protein structure. It demonstrates that this approach can predict the 3D protein structure effectively.
基金partially supported by the National Council of Science and Technology of México (CO NACyT) under Grant Nos. 105060 and 99276
文摘The HP model for protein structure prediction abstracts the fact that hydrophobicity is a dominant force in the protein folding process. This challenging combinatorial optimization problem has been widely addressed through metaheuristics. The evaluation function is a key component for the success of metaheuristics; the poor discrimination of the conventional evaluation function of the HP model has motivated the proposal of alternative formulations for this component. This comparative analysis inquires into the effectiveness of seven different evaluation functions for the HP model. The degree of discrimination provided by each of the studied functions, their capability to preserve a rank ordering among potential solutions which is consistent with the original objective of the HP model, as well as their effect on the performance of local search methods are analyzed. The obtained results indicate that studying alternative evaluation schemes for the HP model represents a highly valuable direction which merits more attention.
基金supported by the National Natural Science Foundation of China under Grant No. 10771206973 Project (2004CB318000) of China
文摘Hausdorff distance between two compact sets, defined as the maximum distance from a point of one set to another set, has many application in computer science. It is a good measure for the similarity of two sets. This paper proves that the shape distance between two compact sets in R^n defined by nfinimum Hausdorff distance under rigid motions is a distance. The authors introduce similarity comparison problems in protein science, and propose that this measure may have good application to comparison of protein structure as well. For calculation of this distance, the authors give one dimensional formulas for problems (2, n), (3, 3), and (3, 4). These formulas can reduce time needed for solving these problems. The authors did some data, this formula can reduce time needed to one As n increases, it would save more time. numerical experiments for (2, n). On these sets of fifteenth of the best algorithms known on average.
基金the National Natural Science Foundation of China(No.20475068) the Guangdong Provincial Natural Science Foundation(No.031577).
文摘Based on the concept of ant colony optimization and the idea of population in genetic algorithm, a novel global optimization algorithm, called the hybrid ant colony optimization (HACO), is proposed in this paper to tackle continuous-space optimization problems. It was compared with other well-known stochastic methods in the optimization of the benchmark functions and was also used to solve the problem of selecting appropriate dilation efficiently by optimizing the wavelet power spectrum of the hydrophobic sequence of protein, which is the key step on using continuous wavelet transform (CWT) to predict a-helices and connecting peptides.
基金supported by Basic Science Research Program through the National Research Foundation of Korea (NRF)funded by the Ministry of Education,Science and Technology of Korea under Grant No.2012R1A1A3013084
文摘Over the past several decades, biologists have conducted numerous studies examining both general and specific functions of proteins. Generally, if similarities in either the structure or sequence of amino acids exist for two proteins, then a common biological function is expected. Protein function is determined primarily based on the structure rather than the sequence of amino acids. The algorithm for protein structure alignment is an essential tool for the research. The quality of the algorithm depends on the quality of the similarity measure that is used, and the similarity measure is an objective function used to determine the best alignment because of their individual strength and weakness However, none of existing similarity measures became golden standard They require excessive filtering to find a single alignment. In this paper, we introduce a new strategy that finds not a single alignment, but multiple alignments with different lengths. This method has obvious benefits of high quality alignment. However, this novel method leads to a new problem that the running time for this method is considerably longer than that for methods that find only a single alignment. To address this problem~ we propose algorithms that can locate a common region (CORE) of multiple alignment candidates, and can then extend the CORE into multiple alignments. Because the CORE can be defined from a final alignment, we introduce CORE* that is similar to CORE and propose an algorithm to identify the CORE*. By adopting CORE* and dynamic programming, our proposed method produces multiple alignments of various lengths with higher accuracy than previous methods. In the experiments, the alignments identified by our algorithm are longer than those obtained by TM-align by 17% and 15.48%, on average, when the comparison is conducted at the level of super-family and fold, respectively.
基金financially supported by Beijing Municipal Commission of Education Co-Constructed Programand Chinese Universities Scientific Fund(2009-4-25)
文摘Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing cheese quality.Fourier transform infrared spectroscopy(FTIR),with self-deconvolution,second derivative analysis and band curve-fitting,was used to characterize the secondary structure of proteins in Cheddar cheese during ripening.The spectra of the amide I region showed great similarity,while the relative contents of the secondary structures underwent a series of changes.As ripening progressed,the α-helix content decreased and the β-sheet content increased.This structural shift was attributed to the strengthening of hydrogen bonds that resulted from hydrolysis of caseins.In summary,FTIR could provide the basis for rapid characterization of cheese that is undergoing ripening.