Identification of the drug-binding residues on the surface of proteins is a vital step in drug discovery and it is important for understanding protein function. Most previous researches are based on the structural inf...Identification of the drug-binding residues on the surface of proteins is a vital step in drug discovery and it is important for understanding protein function. Most previous researches are based on the structural information of proteins, but the structures of most proteins are not available. So in this article, a sequence-based method was proposed by combining the support vector machine (SVM)-based ensemble learning and the improved position specific scoring matrix (PSSM). In order to take the local environment information of a drug-binding site into account, an improved PSSM profile scaled by the sliding window and smoothing window was used to improve the prediction result. In addition, a new SVM-based ensemble learning method was developed to deal with the imbalanced data classification problem that commonly exists in the binding site predictions. When performed on the dataset of 985 drug-binding residues, the method achieved a very promising prediction result with the area under the curve (AUC) of 0.9264. Furthermore, an independent dataset of 349 drug- binding residues was used to evaluate the pre- diction model and the prediction accuracy is 84.68%. These results suggest that our method is effective for predicting the drug-binding sites in proteins. The code and all datasets used in this article are freely available at http://cic.scu.edu.cn/bioinformatics/Ensem_DBS.zip.展开更多
Pancreatic ductal adenocarcinoma(PDAC) is one of the most aggressive diseases and is characterized by high chemoresistance, leading to the lack of effective therapeutic approaches and grim prognosis. Despite increasin...Pancreatic ductal adenocarcinoma(PDAC) is one of the most aggressive diseases and is characterized by high chemoresistance, leading to the lack of effective therapeutic approaches and grim prognosis. Despite increasing understanding of the mechanisms of chemoresistance in cancer and the role of ATPbinding cassette(ABC) transporters in this resistance, the therapeutic potential of their pharmacological inhibition has not been successfully exploited yet. In spite of the discovery of potent pharmacological modulators of ABC transporters, the results obtained in clinical trials have been so far disappointing, with high toxicity levels impairing their successful administration to the patients. Critically, although ABC transporters have been mostly studied for their involvement in development of multidrug resistance(MDR), in recent years the contribution of ABC transporters to cancer initiation and progression has emerged as an important area of research, the understanding of which could significantly influence the development of more specific and efficient therapies. In this review, we explore the role of ABC transporters in the development and progression of malignancies, with focus on PDAC. Their established involvement in development of MDR will be also presented. Moreover, an emerging role for ABC transporters as prognostic tools for patients' survival will be discussed, demonstrating the therapeutic potential of ABC transporters in cancer therapy.展开更多
In a recent study published in Nature Medicine,Wang et al.developed an excellent framework called UniBind based on artificial intelligence(AI),which enables accurately predicting infectivity of SARS-CoV-2 variants and...In a recent study published in Nature Medicine,Wang et al.developed an excellent framework called UniBind based on artificial intelligence(AI),which enables accurately predicting infectivity of SARS-CoV-2 variants and evolutionary trends of future viral variants.1 This computational method holds the possibility to not only serve as a valuable early-warning tool for monitoring potential pathogenic SARS-CoV-2 variants but also facilitate fundamental research on protein-protein interactions(PPIs).展开更多
Transcription Factors(TFs) are a very diverse family of DNA-binding proteins that play essential roles in the regulation of gene expression through binding to specific DNA sequences. They are considered as one of th...Transcription Factors(TFs) are a very diverse family of DNA-binding proteins that play essential roles in the regulation of gene expression through binding to specific DNA sequences. They are considered as one of the prime drug targets since mutations and aberrant TF-DNA interactions are implicated in many diseases.Identification of TF-binding sites on a genomic scale represents a critical step in delineating transcription regulatory networks and remains a major goal in genomic annotations. Recent development of experimental high-throughput technologies has provided valuable information about TF-binding sites at genome scale under various physiological and developmental conditions. Computational approaches can provide a cost-effective alternative and complement the experimental methods by using the vast quantities of available sequence or structural information. In this review we focus on structure-based prediction of transcription factor binding sites. In addition to its potential in genomescale predictions, structure-based approaches can help us better understand the TF-DNA interaction mechanisms and the evolution of transcription factors and their target binding sites. The success of structure-based methods also bears a translational impact on targeted drug design in medicine and biotechnology.展开更多
Effective identification of major histocompatibility complex (MHC) molecules restricted peptides is a critical step in discovering immune epitopes. Although many online servers have been built to predict class Ⅱ MH...Effective identification of major histocompatibility complex (MHC) molecules restricted peptides is a critical step in discovering immune epitopes. Although many online servers have been built to predict class Ⅱ MHC-peptide binding affinity, they have been trained on different datasets, and thus fail in providing a unified comparison of various methods. In this paper, we present our implementation of seven popular predictive methods, namely SMM-align, ARB, SVR-pairwise, Gibbs sampler. ProPred, LP-top2, and MHCPred, on a single web server named BiodMHC (http://biod.whu.edu.cn/BiodMHC/index.html, the software is available upon request). Using a standard measure of AUC (Area Under the receiver operating characteristic Curves), we compare these methods by means of not only cross validation but also prediction on independent test datasets. We find that SMM-align, ProPred, SVR-pairwise, ARB, and Gibbs sampler are the five best-performing methods. For the binding affinity prediction of class Ⅱ MHC-peptide, BiodMHC provides a convenient online platform for researchers to obtain binding information simultaneously using various methods.展开更多
Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,n...Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,not free from drug resistance problems.Genomic studies have shown that some genetic mutations in Mycobacterium tuberculosis(Mtb)EmbR,and EmbC/A/B genes cause EMB resistance.EmbR-PknH pair controls embC/A/B operon,which encodes EmbC/A/B genes,and EMB interacts with EmbA/B proteins.However,the EmbR binding site on PknH was unknown.We conducted molecular simulation on the EmbR-peptides binding structures and discovered phosphorylated PknH 273-280(N′-HEALS^(P)DPD-C′)makesβstrand with the EmbR FHA domain,asβ-MoRF(MoRF;molecular recognition feature)does at its binding site.Hydrogen bond number analysis also supported the peptides’β-MoRF forming activity at the EmbR FHA domain.Also,we discovered that previously known phosphorylation residues might have their chronological order according to the phosphorylation status.The discovery validated that Mtb PknH 273-280(N′-HEALSDPD-C′)has reliable EmbR binding affinity.This approach is revolutionary in the computer-aided drug discovery field,because it is the first trial to discover the protein-protein interaction site,and find binding partner in nature from this site.展开更多
This study describes a method for predicting and classifying oxygen-binding proteins. Firstly, support vector machine (SVM) modules were developed using amino acid composition and dipeptide composition for predictin...This study describes a method for predicting and classifying oxygen-binding proteins. Firstly, support vector machine (SVM) modules were developed using amino acid composition and dipeptide composition for predicting oxygen-binding proteins, and achieved maximum accuracy of 85.5% and 87.8%, respectively. Secondly, an SVM module was developed based on amino acid composition, classifying the predicted oxygen-binding proteins into six classes with accuracy of 95.8%, 97.5%, 97.5%, 96.9%, 99.4%, and 96.0% for erythrocruorin, hemerythrin, hemocyanin, hemoglobin, leghemoglobin, and myoglobin proteins, respectively. Finally, an SVM module was developed using dipeptide composition for classifying the oxygen-binding proteins, and achieved maximum accuracy of 96.1%, 98.7%, 98.7%, 85.6%, 99.6%, and 93.3% for the above six classes, respectively. All modules were trained and tested by five-fold cross validation. Based on the above approach, a web server Oxypred was developed for predicting and classifying oxygen-binding proteins (available from http://www.imtech.res.in/raghava/oxypred/).展开更多
文摘Identification of the drug-binding residues on the surface of proteins is a vital step in drug discovery and it is important for understanding protein function. Most previous researches are based on the structural information of proteins, but the structures of most proteins are not available. So in this article, a sequence-based method was proposed by combining the support vector machine (SVM)-based ensemble learning and the improved position specific scoring matrix (PSSM). In order to take the local environment information of a drug-binding site into account, an improved PSSM profile scaled by the sliding window and smoothing window was used to improve the prediction result. In addition, a new SVM-based ensemble learning method was developed to deal with the imbalanced data classification problem that commonly exists in the binding site predictions. When performed on the dataset of 985 drug-binding residues, the method achieved a very promising prediction result with the area under the curve (AUC) of 0.9264. Furthermore, an independent dataset of 349 drug- binding residues was used to evaluate the pre- diction model and the prediction accuracy is 84.68%. These results suggest that our method is effective for predicting the drug-binding sites in proteins. The code and all datasets used in this article are freely available at http://cic.scu.edu.cn/bioinformatics/Ensem_DBS.zip.
文摘Pancreatic ductal adenocarcinoma(PDAC) is one of the most aggressive diseases and is characterized by high chemoresistance, leading to the lack of effective therapeutic approaches and grim prognosis. Despite increasing understanding of the mechanisms of chemoresistance in cancer and the role of ATPbinding cassette(ABC) transporters in this resistance, the therapeutic potential of their pharmacological inhibition has not been successfully exploited yet. In spite of the discovery of potent pharmacological modulators of ABC transporters, the results obtained in clinical trials have been so far disappointing, with high toxicity levels impairing their successful administration to the patients. Critically, although ABC transporters have been mostly studied for their involvement in development of multidrug resistance(MDR), in recent years the contribution of ABC transporters to cancer initiation and progression has emerged as an important area of research, the understanding of which could significantly influence the development of more specific and efficient therapies. In this review, we explore the role of ABC transporters in the development and progression of malignancies, with focus on PDAC. Their established involvement in development of MDR will be also presented. Moreover, an emerging role for ABC transporters as prognostic tools for patients' survival will be discussed, demonstrating the therapeutic potential of ABC transporters in cancer therapy.
基金the National Natural Science Foundation of China:82201932,82025001ZhongNanShan Medical Foundation of Guangdong Province:ZNSA-2020012Guangdong Basic and Applied Basic Research Foundation:2022B1515020059,2021B1515130005.
文摘In a recent study published in Nature Medicine,Wang et al.developed an excellent framework called UniBind based on artificial intelligence(AI),which enables accurately predicting infectivity of SARS-CoV-2 variants and evolutionary trends of future viral variants.1 This computational method holds the possibility to not only serve as a valuable early-warning tool for monitoring potential pathogenic SARS-CoV-2 variants but also facilitate fundamental research on protein-protein interactions(PPIs).
基金supported by the National Science Foundation #DBI-0844749 and #DBI-1356459 to JTG
文摘Transcription Factors(TFs) are a very diverse family of DNA-binding proteins that play essential roles in the regulation of gene expression through binding to specific DNA sequences. They are considered as one of the prime drug targets since mutations and aberrant TF-DNA interactions are implicated in many diseases.Identification of TF-binding sites on a genomic scale represents a critical step in delineating transcription regulatory networks and remains a major goal in genomic annotations. Recent development of experimental high-throughput technologies has provided valuable information about TF-binding sites at genome scale under various physiological and developmental conditions. Computational approaches can provide a cost-effective alternative and complement the experimental methods by using the vast quantities of available sequence or structural information. In this review we focus on structure-based prediction of transcription factor binding sites. In addition to its potential in genomescale predictions, structure-based approaches can help us better understand the TF-DNA interaction mechanisms and the evolution of transcription factors and their target binding sites. The success of structure-based methods also bears a translational impact on targeted drug design in medicine and biotechnology.
基金supported by the National Nature Science Foundation of China (No.60773010)the Shanghai Committee of Science and Technology, China (No.08DZ2271800 and 09DZ2272800)
文摘Effective identification of major histocompatibility complex (MHC) molecules restricted peptides is a critical step in discovering immune epitopes. Although many online servers have been built to predict class Ⅱ MHC-peptide binding affinity, they have been trained on different datasets, and thus fail in providing a unified comparison of various methods. In this paper, we present our implementation of seven popular predictive methods, namely SMM-align, ARB, SVR-pairwise, Gibbs sampler. ProPred, LP-top2, and MHCPred, on a single web server named BiodMHC (http://biod.whu.edu.cn/BiodMHC/index.html, the software is available upon request). Using a standard measure of AUC (Area Under the receiver operating characteristic Curves), we compare these methods by means of not only cross validation but also prediction on independent test datasets. We find that SMM-align, ProPred, SVR-pairwise, ARB, and Gibbs sampler are the five best-performing methods. For the binding affinity prediction of class Ⅱ MHC-peptide, BiodMHC provides a convenient online platform for researchers to obtain binding information simultaneously using various methods.
基金This work was supported by the National Institutes of Health Grant No.7R01GM118467-05the National Natural Science Foundation of China(31720103901).
文摘Tuberculosis drug resistance continues to threaten global health but the underline molecular mechanisms are not clear.Ethambutol(EMB),one of the well-known first-line drugs in tuberculosis treatment is,unfortunately,not free from drug resistance problems.Genomic studies have shown that some genetic mutations in Mycobacterium tuberculosis(Mtb)EmbR,and EmbC/A/B genes cause EMB resistance.EmbR-PknH pair controls embC/A/B operon,which encodes EmbC/A/B genes,and EMB interacts with EmbA/B proteins.However,the EmbR binding site on PknH was unknown.We conducted molecular simulation on the EmbR-peptides binding structures and discovered phosphorylated PknH 273-280(N′-HEALS^(P)DPD-C′)makesβstrand with the EmbR FHA domain,asβ-MoRF(MoRF;molecular recognition feature)does at its binding site.Hydrogen bond number analysis also supported the peptides’β-MoRF forming activity at the EmbR FHA domain.Also,we discovered that previously known phosphorylation residues might have their chronological order according to the phosphorylation status.The discovery validated that Mtb PknH 273-280(N′-HEALSDPD-C′)has reliable EmbR binding affinity.This approach is revolutionary in the computer-aided drug discovery field,because it is the first trial to discover the protein-protein interaction site,and find binding partner in nature from this site.
文摘This study describes a method for predicting and classifying oxygen-binding proteins. Firstly, support vector machine (SVM) modules were developed using amino acid composition and dipeptide composition for predicting oxygen-binding proteins, and achieved maximum accuracy of 85.5% and 87.8%, respectively. Secondly, an SVM module was developed based on amino acid composition, classifying the predicted oxygen-binding proteins into six classes with accuracy of 95.8%, 97.5%, 97.5%, 96.9%, 99.4%, and 96.0% for erythrocruorin, hemerythrin, hemocyanin, hemoglobin, leghemoglobin, and myoglobin proteins, respectively. Finally, an SVM module was developed using dipeptide composition for classifying the oxygen-binding proteins, and achieved maximum accuracy of 96.1%, 98.7%, 98.7%, 85.6%, 99.6%, and 93.3% for the above six classes, respectively. All modules were trained and tested by five-fold cross validation. Based on the above approach, a web server Oxypred was developed for predicting and classifying oxygen-binding proteins (available from http://www.imtech.res.in/raghava/oxypred/).