期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Protein Secondary Structure Prediction with Dynamic Self-Adaptation Combination Strategy Based on Entropy 被引量:1
1
作者 Yuehan Du Ruoyu Zhang +4 位作者 Xu Zhang Antai Ouyang Xiaodong Zhang Jinyong Cheng Wenpeng Lu 《Journal of Quantum Computing》 2019年第1期21-28,共8页
The algorithm based on combination learning usually is superior to a singleclassification algorithm on the task of protein secondary structure prediction. However,the assignment of the weight of the base classifier us... The algorithm based on combination learning usually is superior to a singleclassification algorithm on the task of protein secondary structure prediction. However,the assignment of the weight of the base classifier usually lacks decision-makingevidence. In this paper, we propose a protein secondary structure prediction method withdynamic self-adaptation combination strategy based on entropy, where the weights areassigned according to the entropy of posterior probabilities outputted by base classifiers.The higher entropy value means a lower weight for the base classifier. The final structureprediction is decided by the weighted combination of posterior probabilities. Extensiveexperiments on CB513 dataset demonstrates that the proposed method outperforms theexisting methods, which can effectively improve the prediction performance. 展开更多
关键词 Multi-classifier combination ENTROPY protein secondary structure prediction dynamic self-adaptation
下载PDF
Ensemble Machine Learning to Enhance Q8 Protein Secondary Structure Prediction
2
作者 Moheb R.Girgis Rofida M.Gamal Enas Elgeldawi 《Computers, Materials & Continua》 SCIE EI 2022年第11期3951-3967,共17页
Protein structure prediction is one of the most essential objectives practiced by theoretical chemistry and bioinformatics as it is of a vital importance in medicine,biotechnology and more.Protein secondary structure ... Protein structure prediction is one of the most essential objectives practiced by theoretical chemistry and bioinformatics as it is of a vital importance in medicine,biotechnology and more.Protein secondary structure prediction(PSSP)has a significant role in the prediction of protein tertiary structure,as it bridges the gap between the protein primary sequences and tertiary structure prediction.Protein secondary structures are classified into two categories:3-state category and 8-state category.Predicting the 3 states and the 8 states of secondary structures from protein sequences are called the Q3 prediction and the Q8 prediction problems,respectively.The 8 classes of secondary structures reveal more precise structural information for a variety of applications than the 3 classes of secondary structures,however,Q8 prediction has been found to be very challenging,that is why all previous work done in PSSP have focused on Q3 prediction.In this paper,we develop an ensemble Machine Learning(ML)approach for Q8 PSSP to explore the performance of ensemble learning algorithms compared to that of individual ML algorithms in Q8 PSSP.The ensemble members considered for constructing the ensemble models are well known classifiers,namely SVM(Support Vector Machines),KNN(K-Nearest Neighbor),DT(Decision Tree),RF(Random Forest),and NB(Naïve Bayes),with two feature extraction techniques,namely LDA(Linear Discriminate Analysis)and PCA(Principal Component Analysis).Experiments have been conducted for evaluating the performance of single models and ensemble models,with PCA and LDA,in Q8 PSSP.The novelty of this paper lies in the introduction of ensemble learning in Q8 PSSP problem.The experimental results confirmed that ensemble ML models are more accurate than individual ML models.They also indicated that features extracted by LDA are more effective than those extracted by PCA. 展开更多
关键词 Protein secondary structure prediction(PSSP) Q3 prediction Q8 prediction ensemble machine leaning BOOSTING BAGGING
下载PDF
IMPROVED METHOD FOR RNA SECONDARY STRUCTURE PREDICTION'
3
作者 Xue Mei YUAN Yu LUO Lu Hua LAI Xiao Jie XU Institute of Physical Chemistry,Peking University,Beijing 100871 《Chinese Chemical Letters》 SCIE CAS CSCD 1993年第8期737-740,共4页
A simple stepwise folding process has been developed to simulate RNA secondary structure formation.Modifications for the energy parameters of various loops were included in the program.Five possible types of pseudokno... A simple stepwise folding process has been developed to simulate RNA secondary structure formation.Modifications for the energy parameters of various loops were included in the program.Five possible types of pseudoknots including the well known H-type pseudoknot were permitted to occur if reasonable.We have applied this approach to e number of RNA sequences.The prediction accuracies we obtained were higher than those in published papers. 展开更多
关键词 RNA IMPROVED METHOD FOR RNA secondary structure prediction 吐司
下载PDF
A Deep Learning Approach for Prediction of Protein Secondary Structure
4
作者 Muhammad Zubair Muhammad Kashif Hanif +4 位作者 Eatedal Alabdulkreem Yazeed Ghadi Muhammad Irfan Khan Muhammad Umer Sarwar Ayesha Hanif 《Computers, Materials & Continua》 SCIE EI 2022年第8期3705-3718,共14页
The secondary structure of a protein is critical for establishing a link between the protein primary and tertiary structures.For this reason,it is important to design methods for accurate protein secondary structure p... The secondary structure of a protein is critical for establishing a link between the protein primary and tertiary structures.For this reason,it is important to design methods for accurate protein secondary structure prediction.Most of the existing computational techniques for protein structural and functional prediction are based onmachine learning with shallowframeworks.Different deep learning architectures have already been applied to tackle protein secondary structure prediction problem.In this study,deep learning based models,i.e.,convolutional neural network and long short-term memory for protein secondary structure prediction were proposed.The input to proposed models is amino acid sequences which were derived from CulledPDB dataset.Hyperparameter tuning with cross validation was employed to attain best parameters for the proposed models.The proposed models enables effective processing of amino acids and attain approximately 87.05%and 87.47%Q3 accuracy of protein secondary structure prediction for convolutional neural network and long short-term memory models,respectively. 展开更多
关键词 Convolutional neural network machine learning protein secondary structure deep learning long short-term memory protein secondary structure prediction
下载PDF
Functional structures and folding dynamics of two peptides
5
作者 盛乐标 李菁 +1 位作者 马保亮 王炜 《Chinese Physics B》 SCIE EI CAS CSCD 2005年第11期2365-2369,共5页
The folding dynamics and structural characteristics of peptides RTKAWNRQLYPEW (P1) and RTKQLYPEW (P2) are investigated by using all-atomic simulation procedure CHARMM in this work. The results show that P1, a segm... The folding dynamics and structural characteristics of peptides RTKAWNRQLYPEW (P1) and RTKQLYPEW (P2) are investigated by using all-atomic simulation procedure CHARMM in this work. The results show that P1, a segment of an antigen, has a folding motif of α-helix, whereas P2, which is derived by deleting four residues AWNR from peptide P1, prevents the formation of helix and presents a β-strand. And peptlde P1 experiences a more rugged energy landscape than peptide P2. From our results, it is inferred that the antibody CD8 cytolytic T lymphocyte prefers an antigen with a β-folding structure to that with an α-helical one. 展开更多
关键词 peptide folding molecular dynamics protein secondary structure prediction
下载PDF
Nef Mutations in Long-term Non-progressors from Former Plasma Donors Infected with HIV-1 Subtype B in China
6
作者 SHU-HUA WANG HUI XING +4 位作者 XIANG HE FENG-XIA ZHU ZHE-FENG MENG Yu-HUA RUAN YI-MING SHAO 《Biomedical and Environmental Sciences》 SCIE CAS CSCD 2008年第6期485-491,共7页
Objective To study the specific amino acid variation in Nef that may be related to disease progression after infection with HIV-1 subtype B, a predominant strain circulating in China, and to determine whether changes ... Objective To study the specific amino acid variation in Nef that may be related to disease progression after infection with HIV-1 subtype B, a predominant strain circulating in China, and to determine whether changes in Nef secondary structure may influence different stages of AIDS development based on the concept that the Nef gene of HIV infection dramatically alter the severity of viral infection and virus replication and disease progression, and that long-term non-progressors (LTNP) of HIV infection are commonly associated with either a deletion of the Nef gene or the defective Nef alleles. Methods The study subjects were divided into LTNPI(n=14), LTNP2 (n=16) and slow progressor (SP, n=19) groups for mutational analysis of the Nef sequence. The data were obtained by using Bioedit, MEGA, Anthewin and SAS software. Results Residues in Nef TA48/49 and K151 occurred more frequently in the LTNP group while AA48/49 was more frequently observed in the SP group. Of the differences observed in the secondary structure comparison using Nef consensus sequences of these three groups, one was roughly corresponding to the Nef48/49 mutation site. Conclusion TA48/49, Kiss, and AA48/49 in the Nef gene might be associated with the different stages of HIV infection, and there may be a link between the Nef secondary structure and the progression of HIV- 1 infection. 展开更多
关键词 HIV-1 NEF Long-term nonprogressors Sequence mutations secondary structure prediction
下载PDF
How Many 3D Structures Do We Need to Train a Predictor?
7
作者 Pantelis G. Bagos Georgios N. Tsaousis Stavros J. Hamodrakas 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2009年第3期128-137,共10页
It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the ef... It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the effect of this, on the performance of prediction algorithms for both α-helical and β-barrel membrane proteins, we conducted a prospective study based on historical records. We trained separate hidden Markov models with different sized training sets and evaluated their performance on topology prediction for the two classes of transmembrane proteins. We show that the existing top-scoring algorithms for predicting the transmembrane segments of α-helical membrane proteins perform slightly better than that of β-barrel outer membrane proteins in all measures of accuracy. With the same rationale, a metaoanalysis of the performance of the secondary structure prediction algorithms indicates that existing algorithmic techniques cannot be further improved by just adding more non-homologous sequences to the training sets. The upper limit for secondary structure prediction is estimated to be no more than 70% and 80% of correctly predicted residues for single sequence based methods and multiple sequence based ones, respectively. Therefore, we should concentrate our efforts on utilizing new techniques for the development of even better scoring predictors. 展开更多
关键词 membrane protein secondary structure prediction alpha-helical BETA-BARREL 3D structure
原文传递
Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions 被引量:1
8
作者 Krishna Choudhary Fei Deng Sharon Aviran 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2017年第1期3-24,共22页
Background: Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profilin... Background: Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profiling at transeriptome scale and in living cells, creating unprecedented opportunities for RNA biology. Propelled by these experimental advances, massive data with ever-increasing diversity and complexity have been generated, which give rise to new challenges in interpreting and analyzing these data. Results: We review current practices in analysis of structure profiling data with emphasis on comparative and integrative analysis as well as highlight emerging questions. Comparative analysis has revealed structural patterns across transcriptomes and has become an integral component of recent profiling studies. Additionally, profiling data can be integrated into traditional structure prediction algorithms to improve prediction accuracy. Conclusions: To keep pace with experimental developments, methods to facilitate, enhance and refine such analyses are needed. Parallel advances in analysis methodology will complement profiling technologies and help them reach their full potential. 展开更多
关键词 RNA structure profiling high-throughput sequencing RNA secondary structure prediction chemical structure probing SHAPE-Seq
原文传递
DCCP and DICP: Construction and Analyses of Databases for Copper-and Iron-Chelating Proteins
9
作者 Hao Wu Yan Yang +6 位作者 Sheng-Juan Jiang Ling-Ling Chen Hai-Xia Gao Qing-Shan Fu Feng Li Bin-Guang Ma Hong-Yu Zhang 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2005年第1期52-57,共6页
Copper and iron play important roles in a variety of biological processes, especially when being chelated with proteins. The proteins involved in the metal binding, transporting and metabolism have aroused much intere... Copper and iron play important roles in a variety of biological processes, especially when being chelated with proteins. The proteins involved in the metal binding, transporting and metabolism have aroused much interest. To facilitate the study on this topic, we constructed two databases (DCCP and DICP) containing the known copper- and iron-chelating proteins~ which are freely available from the website http://sdbi.sdut.edu.cn/en. Users can conveniently search and browse all of the entries in the databases. Based on the two databases, bioinformatic analyses were performed, which provided some novel insights into metalloproteins. 展开更多
关键词 copper and iron database primary sequence analysis secondary structure prediction SCOP classification
原文传递
A study of different annealing schedules in SARNA-predict A permutation based SA algorithm for RNA folding
10
作者 Herbert H.Tsang Kay C.Wiese 《International Journal of Intelligent Computing and Cybernetics》 EI 2015年第2期152-171,共20页
Purpose–The purpose of this paper is to present a study of the effect of different types of annealing schedules for a ribonucleic acid(RNA)secondary structure prediction algorithm based on simulated annealing(SA).Des... Purpose–The purpose of this paper is to present a study of the effect of different types of annealing schedules for a ribonucleic acid(RNA)secondary structure prediction algorithm based on simulated annealing(SA).Design/methodology/approach–An RNA folding algorithm was implemented that assembles the final structure from potential substructures(helixes).Structures are encoded as a permutation of helixes.An SA searches this space of permutations.Parameters and annealing schedules were studied and fine-tuned to optimize algorithm performance.Findings–In comparing with mfold,the SA algorithm shows comparable results(in terms of F-measure)even with a less sophisticated thermodynamic model.In terms of average specificity,the SA algorithm has provided surpassing results.Research limitations/implications–Most of the underlying thermodynamic models are too simplistic and incomplete to accurately model the free energy for larger structures.This is the largest limitation of free energy-based RNA folding algorithms in general.Practical implications–The algorithm offers a different approach that can be used in practice to fold RNA sequences quickly.Originality/value–The algorithm is one of only two SA-based RNA folding algorithms.The authors use a very different encoding,based on permutation of candidate helixes.The in depth study of annealing schedules and other parameters makes the algorithm a strong contender.Another benefit is that new thermodynamic models can be incorporated with relative ease(which is not the case for algorithms based on dynamic programming). 展开更多
关键词 Annealing schedule Simulated annealing RNA secondary structure prediction Ribonucleic acid RNA folding PERMUTATION
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部