Background:To develop a protein-protein interaction network of Paroxysmal nocturnal hemoglobinuria(PNH)and Aplastic anemia(AA)based on genetic genes and to predict pathways underlying the molecular complexes in the ne...Background:To develop a protein-protein interaction network of Paroxysmal nocturnal hemoglobinuria(PNH)and Aplastic anemia(AA)based on genetic genes and to predict pathways underlying the molecular complexes in the network.Methods:In this research,the PNH and AA-related genes were screened through Online Mendelian Inheritance in Man(OMIM).The plugins and Cytoscape were used to search literature and build a protein-protein interaction network.Results:The protein-protein interaction network contains two molecular complexes that are five higher than the correlation integral values.The target genes of this study were obtained:CD59,STAT3,TERC,TNF,AKT1,C5AR1,EPO,IL6,IL10 and so on.We also found that many factors regulate biological behaviors:neutrophils,macrophages,vascular endothelial growth factor,immunoglobulin,interleukin,cytokine receptor,interleukin-6 receptor,tumor necrosis factor,and so on.This research provides a bioinformatics foundation for further explaining the mechanism of common development of both.Conclusion:This indicates that the PNH and AA is a complex process regulated by many cellular pathways and multiple genes.展开更多
AIM:To understand the complex reaction of gastric inflammation induced by Helicobacter pylori(H pylori) in a systematic manner using a protein interaction network. METHODS:The expression of genes significantly changed...AIM:To understand the complex reaction of gastric inflammation induced by Helicobacter pylori(H pylori) in a systematic manner using a protein interaction network. METHODS:The expression of genes significantly changed on microarray during H pylori infection was scanned from the web literary database and translated into proteins.A network of protein interactions was constructed by searching the primary interactions of selected proteins.The constructed network was mathematically analyzed and its biological function was examined.In addition,the nodes on the network were checked to determine if they had any further functional importance or relation to other proteins by extending them. RESULTS:The scale-free network showing the relationship between inflammation and carcinogenesis was constructed.Mathematical analysis showed hub and bottleneck proteins,and these proteins were mostly related to immune response.The network contained pathways and proteins related to H pylori infection,such as the JAK-STAT pathway triggered by interleukins.Activation of nuclear factor (NF)-κB,TLR4,and other proteins known to function as core proteins of immune response were also found. These immune-related proteins interacted on the network with pathways and proteins related to the cell cycle,cell maintenance and proliferation,andtranscription regulators such as BRCA1,FOS,REL,and zinc finger proteins.The extension of nodes showed interactions of the immune proteins with cancer- related proteins.One extended network,the core network,a summarized form of the extended network, and cell pathway model were constructed. CONCLUSION:Immune-related proteins activated by H pylori infection interact with proto-oncogene proteins.The hub and bottleneck proteins are potential drug targets for gastric inflammation and cancer.展开更多
Curcumin, the medically active component from Curcuma Tonga (Turmeric), is widely used to treat inflammatory diseases. Protein interaction network (PIN) analysis was used to predict its mechanisms of molecular action....Curcumin, the medically active component from Curcuma Tonga (Turmeric), is widely used to treat inflammatory diseases. Protein interaction network (PIN) analysis was used to predict its mechanisms of molecular action. Targets of curcumin were obtained based on ChEMBL and STITCH databases. Protein protein interactions (PPIs) were extracted from the String database. The PIN of curcumin was constructed by Cytoscape and the function modules identified by gene ontology (GO) enrichment analysis based on molecular complex detection (MCODE). A PIN of curcumin with 482 nodes and 1688 interactions was constructed, which has scale-free, small world and modular properties. Based on analysis of these function modules, the mechanism of curcumin is proposed. Two modules were found to be intimately associated with inflammation. With function modules analysis, the anti-inflammatory effects of curcumin were related to SMAD, ERG and mediation by the TLR family. TLR9 may be a potential target of curcumin to treat inflammation. (C) 2015 Chinese Pharmaceutical Association and Institute of Materia Medica, Chinese Academy of Medical Sciences. Production and hosting by Elsevier B.V.展开更多
In the post-genomic era,proteomics has achieved significant theoretical and practical advances with the development of high-throughput technologies.Especially the rapid accumulation of protein-protein interactions(PPI...In the post-genomic era,proteomics has achieved significant theoretical and practical advances with the development of high-throughput technologies.Especially the rapid accumulation of protein-protein interactions(PPIs)provides a foundation for constructing protein interaction networks(PINs),which can furnish a new perspective for understanding cellular organizations,processes,and functions at network level.In this paper,we present a comprehensive survey on three main characteristics of PINs:centrality,modularity,and dynamics.1)Different centrality measures,which are used to calculate the importance of proteins,are summarized based on the structural characteristics of PINs or on the basis of its integrated biological information;2)Different modularity definitions and various clustering algorithms for predicting protein complexes or identifying functional modules are introduced;3)The dynamics of proteins,PPIs and sub-networks are discussed,respectively.Finally,the main applications of PINs in the complex diseases are reviewed,and the challenges and future research directions are also discussed.展开更多
Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have b...Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein–protein interaction(PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.展开更多
Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing gen...Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments.With the advances of the high-throughput techniques,a large number of protein-protein interactions have been produced.Therefore,to address this issue,several methods based on protein interaction network have been proposed.In this paper,we propose a shortest path-based algorithm,named SPranker,to prioritize disease-causing genes in protein interaction networks.Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes,we further propose an improved algorithm SPGOranker by integrating the semantic similarity of gene ontology(GO)annotations.SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account.The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches,ICN,VS and RWR.The experimental results show that SPranker and SPGOranker outperform ICN,VS,and RWR for the prioritization of orphan disease-causing genes.Importantly,for the case study of severe combined immunodeficiency,SPranker and SPGOranker predict several novel causal genes.展开更多
A fundamental principle of biology is that proteins tend to form complexes to play important roles in the core functions of cells.For a complete understanding of human cellular functions,it is crucial to have a compre...A fundamental principle of biology is that proteins tend to form complexes to play important roles in the core functions of cells.For a complete understanding of human cellular functions,it is crucial to have a comprehensive atlas of human protein complexes.Unfortunately,we still lack such a comprehensive atlas of experimentally validated protein complexes,which prevents us from gaining a complete understanding of the compositions and functions of human protein complexes,as well as the underlying biological mechanisms.To fill this gap,we built Human Protein Complexes Atlas(HPC-Atlas),as far as we know,the most accurate and comprehensive atlas of human protein complexes available to date.We integrated two latest protein interaction networks,and developed a novel computational method to identify nearly 9000 protein complexes,including many previously uncharacterized complexes.Compared with the existing methods,our method achieved outstanding performance on both testing and independent datasets.Furthermore,with HPC-Atlas we identified 751 severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)-affected human protein complexes,and 456 multifunctional proteins that contain many potential moonlighting proteins.These results suggest that HPC-Atlas can serve as not only a computing framework to effectively identify biologically meaningful protein complexes by integrating multiple protein data sources,but also a valuable resource for exploring new biological findings.The HPCAtlas webserver is freely available at http://www.yulpan.top/HPC-Atlas.展开更多
In the present study, 28 Chinese medicinal herbs belonging to traditional Chinese medicine(TCM) for the treatment of type 2 diabetes were selected to explore the application of network pharmacology in developing new C...In the present study, 28 Chinese medicinal herbs belonging to traditional Chinese medicine(TCM) for the treatment of type 2 diabetes were selected to explore the application of network pharmacology in developing new Chinese herbal medicine formulae for the treatment of type 2 diabetes mellitus(T2DM). These herbs have the highest appearance rate in the literature, and their compounds are listed. The human protein–protein interaction network and the T2DM disease protein interaction network were constructed. Then, the related algorithm for network topology was used to perform interventions on the interaction network of disease proteins and normal human proteins to test different Chinese herbal medicine compound combinations, according to the information on the interaction of compounds–targets in two databases, namely TarN et and the Medicinal Plants Database. Results of the intervention scores indicate that the method proposed in this study can provide new effective combinations of Chinese herbal medicines for T2DM. Network pharmacology can effectively promote the modernization and development of TCM.展开更多
Signal transduction pathways play important roles in various biological processes such as cell cycle, apoptosis, proliferation, differentiation and responses to the external stimuli. Efficient computational methods ar...Signal transduction pathways play important roles in various biological processes such as cell cycle, apoptosis, proliferation, differentiation and responses to the external stimuli. Efficient computational methods are of great demands to map signaling pathways systematically based on the interactome and microarray data in the post-genome era. This paper proposes a novel approach to infer the pathways based on the network flow well studied in the operation research. The authors define a potentiality variable for each protein to denote the extent to which it contributes to the objective pathway. And the capacity on each edge is not a constant but a function of the potentiality variables of the corresponding two proteins. The total potentiality of all proteins is given an upper bound. The approach is formulated to a linear programming model and solved by the simplex method. Experiments on the yeast sporulation data suggest this novel approach recreats successfully the backbone of the MAPK signaling pathway with a low upper bound of the total potentiality. By increasing the upper bound, the approach successfully predicts all the members of the Mitogen-activated protein kinases (MAPK) pathway responding to the pheromone. This simple but effective approach can also be used to infer the genetic information processing pathways underlying the expression quantitative trait loci (eQTL) associations, illustrated by the second example.展开更多
Biological network alignment is an important research topic in the field of bioinformatics. Nowadays almost every existing alignment method is designed to solve the deterministic biological network alignment problem.H...Biological network alignment is an important research topic in the field of bioinformatics. Nowadays almost every existing alignment method is designed to solve the deterministic biological network alignment problem.However, it is worth noting that interactions in biological networks, like many other processes in the biological realm,are probabilistic events. Therefore, more accurate and better results can be obtained if biological networks are characterized by probabilistic graphs. This probabilistic information, however, increases difficulties in analyzing networks and only few methods can handle the probabilistic information. Therefore, in this paper, an improved Probabilistic Biological Network Alignment(PBNA) is proposed. Based on Iso Rank, PBNA is able to use the probabilistic information. Furthermore, PBNA takes advantages of Contributor and Probability Generating Function(PGF) to improve the accuracy of node similarity value and reduce the computational complexity of random variables in similarity matrix. Experimental results on dataset of the Protein-Protein Interaction(PPI) networks provided by Todor demonstrate that PBNA can produce some alignment results that ignored by the deterministic methods, and produce more biologically meaningful alignment results than Iso Rank does in most of the cases based on the Gene Ontology Consistency(GOC) measure. Compared with Prob method, which is designed exactly to solve the probabilistic alignment problem, PBNA can obtain more biologically meaningful mappings in less time.展开更多
Background Small cell lung cancer(SCLC)is a highly malignant and aggressive neuroendocrine tumor.With the rise of immunotherapy,it has provided a new direction for SCLC.However,due to the lack of prognostic biomarkers...Background Small cell lung cancer(SCLC)is a highly malignant and aggressive neuroendocrine tumor.With the rise of immunotherapy,it has provided a new direction for SCLC.However,due to the lack of prognostic biomarkers,the median overall survival of SCLC is still to be improved.This study aimed to explore novel biomarkers and tumor-infiltrating immune cell characteristics that may serve as potential diagnostic and prognostic markers in SCLC.Methods Gene expression profiles from patients with SCLC were downloaded from the Gene Expression Omnibus(GEO)database,and tumor microenvironment(TME)infiltration profile data were obtained using CIBERSORT.The robust rank aggregation(RRA)method was utilized to integrate three SCLC microarray datasets downloaded from the GEO database and identify robust differentially expressed genes(DEGs)between normal and tumor tissue samples.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)enrichment analyses were performed to explore the functions of the robust DEGs.Subsequently,protein-protein interaction networks and key modules were constructed by Cytoscape,and hub genes were selected from the whole network using the plugin cytoHubba.Survival analysis of hub genes was performed by Kaplan-Meier plotter in 18 patients with extensive-stage SCLC.Results A total of 312 robust DEGs,including 55 upregulated and 257 downregulated genes,were screened from 129 SCLC tissue samples and 44 normal tissue samples.GO and KEGG enrichment analyses revealed that the robust DEGs were predominantly involved in human T-cell leukemia virus 1 infection,focal adhesion,complement and coagulation cascades,tumor necrosis factor(TNF)signaling pathway,and ECM-receptor interaction,which are closely associated with the development and progression of SCLC.Subsequently,three DEGs modules and six hub genes(ITGA10,DUSP12,PTGS2,FOS,TGFBR2,and ICAM1)were identified through screening with the Cytoscape plugins MCODE and cytoHubba,respectively.Immune cell infiltration analysis by the CIBERSORT algorithm revealed that resting memory CD4+T cells were the predominant infiltrating immune cells in SCLC.In addition,Kaplan-Meier plotter revealed that the gene prostaglandin-endoperoxide synthase 2(PTGS2)was a potential prognostic biomarker of SCLC.Conclusions Hub genes and tumor-infiltrating immune cells may be the molecular mechanisms underlying the development of SCLC,and this finding could contribute to the formulation of individualized immunotherapy strategies for SCLC.展开更多
文摘Background:To develop a protein-protein interaction network of Paroxysmal nocturnal hemoglobinuria(PNH)and Aplastic anemia(AA)based on genetic genes and to predict pathways underlying the molecular complexes in the network.Methods:In this research,the PNH and AA-related genes were screened through Online Mendelian Inheritance in Man(OMIM).The plugins and Cytoscape were used to search literature and build a protein-protein interaction network.Results:The protein-protein interaction network contains two molecular complexes that are five higher than the correlation integral values.The target genes of this study were obtained:CD59,STAT3,TERC,TNF,AKT1,C5AR1,EPO,IL6,IL10 and so on.We also found that many factors regulate biological behaviors:neutrophils,macrophages,vascular endothelial growth factor,immunoglobulin,interleukin,cytokine receptor,interleukin-6 receptor,tumor necrosis factor,and so on.This research provides a bioinformatics foundation for further explaining the mechanism of common development of both.Conclusion:This indicates that the PNH and AA is a complex process regulated by many cellular pathways and multiple genes.
文摘AIM:To understand the complex reaction of gastric inflammation induced by Helicobacter pylori(H pylori) in a systematic manner using a protein interaction network. METHODS:The expression of genes significantly changed on microarray during H pylori infection was scanned from the web literary database and translated into proteins.A network of protein interactions was constructed by searching the primary interactions of selected proteins.The constructed network was mathematically analyzed and its biological function was examined.In addition,the nodes on the network were checked to determine if they had any further functional importance or relation to other proteins by extending them. RESULTS:The scale-free network showing the relationship between inflammation and carcinogenesis was constructed.Mathematical analysis showed hub and bottleneck proteins,and these proteins were mostly related to immune response.The network contained pathways and proteins related to H pylori infection,such as the JAK-STAT pathway triggered by interleukins.Activation of nuclear factor (NF)-κB,TLR4,and other proteins known to function as core proteins of immune response were also found. These immune-related proteins interacted on the network with pathways and proteins related to the cell cycle,cell maintenance and proliferation,andtranscription regulators such as BRCA1,FOS,REL,and zinc finger proteins.The extension of nodes showed interactions of the immune proteins with cancer- related proteins.One extended network,the core network,a summarized form of the extended network, and cell pathway model were constructed. CONCLUSION:Immune-related proteins activated by H pylori infection interact with proto-oncogene proteins.The hub and bottleneck proteins are potential drug targets for gastric inflammation and cancer.
基金supported by grants from the National Natural Science Foundation of China(Grant No.81403103)Chinese Medicine Resources(Sichuan Province)Youth Science and Technology Innovation Team(Grant No.2015TD0028)+1 种基金Sichuan Province Science and Technology Support Plan(Grant No.2014SZ0156)Sichuan Province Education Department Project(Grant No.2013SZB0781)
文摘Curcumin, the medically active component from Curcuma Tonga (Turmeric), is widely used to treat inflammatory diseases. Protein interaction network (PIN) analysis was used to predict its mechanisms of molecular action. Targets of curcumin were obtained based on ChEMBL and STITCH databases. Protein protein interactions (PPIs) were extracted from the String database. The PIN of curcumin was constructed by Cytoscape and the function modules identified by gene ontology (GO) enrichment analysis based on molecular complex detection (MCODE). A PIN of curcumin with 482 nodes and 1688 interactions was constructed, which has scale-free, small world and modular properties. Based on analysis of these function modules, the mechanism of curcumin is proposed. Two modules were found to be intimately associated with inflammation. With function modules analysis, the anti-inflammatory effects of curcumin were related to SMAD, ERG and mediation by the TLR family. TLR9 may be a potential target of curcumin to treat inflammation. (C) 2015 Chinese Pharmaceutical Association and Institute of Materia Medica, Chinese Academy of Medical Sciences. Production and hosting by Elsevier B.V.
基金This work was supported in part by the National Natural Science Foundation of China(Grants Nos.61832019,61622213)the Fundamental Research Funds for the Central Universities,CSU(2282019SYLB004)Hunan Provincial Science and Technology Program(2019CB1007).
文摘In the post-genomic era,proteomics has achieved significant theoretical and practical advances with the development of high-throughput technologies.Especially the rapid accumulation of protein-protein interactions(PPIs)provides a foundation for constructing protein interaction networks(PINs),which can furnish a new perspective for understanding cellular organizations,processes,and functions at network level.In this paper,we present a comprehensive survey on three main characteristics of PINs:centrality,modularity,and dynamics.1)Different centrality measures,which are used to calculate the importance of proteins,are summarized based on the structural characteristics of PINs or on the basis of its integrated biological information;2)Different modularity definitions and various clustering algorithms for predicting protein complexes or identifying functional modules are introduced;3)The dynamics of proteins,PPIs and sub-networks are discussed,respectively.Finally,the main applications of PINs in the complex diseases are reviewed,and the challenges and future research directions are also discussed.
基金Project supported by the Gansu Province Industrial Support Plan (Grant No.2023CYZC-25)the Natural Science Foundation of Gansu Province (Grant No.23JRRA770)the National Natural Science Foundation of China (Grant No.62162040)。
文摘Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein–protein interaction(PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.
基金supported in part by the National Natural Science Foundation of China(61370024,61428209,61232001)Program for New Century Excellent Talents in University(NCET-12-0547)
文摘Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments.With the advances of the high-throughput techniques,a large number of protein-protein interactions have been produced.Therefore,to address this issue,several methods based on protein interaction network have been proposed.In this paper,we propose a shortest path-based algorithm,named SPranker,to prioritize disease-causing genes in protein interaction networks.Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes,we further propose an improved algorithm SPGOranker by integrating the semantic similarity of gene ontology(GO)annotations.SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account.The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches,ICN,VS and RWR.The experimental results show that SPranker and SPGOranker outperform ICN,VS,and RWR for the prioritization of orphan disease-causing genes.Importantly,for the case study of severe combined immunodeficiency,SPranker and SPGOranker predict several novel causal genes.
基金supported by the National Natural Science Foundation of China(Grant Nos.61972100 and 62172300).
文摘A fundamental principle of biology is that proteins tend to form complexes to play important roles in the core functions of cells.For a complete understanding of human cellular functions,it is crucial to have a comprehensive atlas of human protein complexes.Unfortunately,we still lack such a comprehensive atlas of experimentally validated protein complexes,which prevents us from gaining a complete understanding of the compositions and functions of human protein complexes,as well as the underlying biological mechanisms.To fill this gap,we built Human Protein Complexes Atlas(HPC-Atlas),as far as we know,the most accurate and comprehensive atlas of human protein complexes available to date.We integrated two latest protein interaction networks,and developed a novel computational method to identify nearly 9000 protein complexes,including many previously uncharacterized complexes.Compared with the existing methods,our method achieved outstanding performance on both testing and independent datasets.Furthermore,with HPC-Atlas we identified 751 severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)-affected human protein complexes,and 456 multifunctional proteins that contain many potential moonlighting proteins.These results suggest that HPC-Atlas can serve as not only a computing framework to effectively identify biologically meaningful protein complexes by integrating multiple protein data sources,but also a valuable resource for exploring new biological findings.The HPCAtlas webserver is freely available at http://www.yulpan.top/HPC-Atlas.
基金supported by the National Natural Sciences Foundation of China(No.81374011)
文摘In the present study, 28 Chinese medicinal herbs belonging to traditional Chinese medicine(TCM) for the treatment of type 2 diabetes were selected to explore the application of network pharmacology in developing new Chinese herbal medicine formulae for the treatment of type 2 diabetes mellitus(T2DM). These herbs have the highest appearance rate in the literature, and their compounds are listed. The human protein–protein interaction network and the T2DM disease protein interaction network were constructed. Then, the related algorithm for network topology was used to perform interventions on the interaction network of disease proteins and normal human proteins to test different Chinese herbal medicine compound combinations, according to the information on the interaction of compounds–targets in two databases, namely TarN et and the Medicinal Plants Database. Results of the intervention scores indicate that the method proposed in this study can provide new effective combinations of Chinese herbal medicines for T2DM. Network pharmacology can effectively promote the modernization and development of TCM.
基金This research is supported by the National Natural Science Foundation of China under Grant Nos. 10631070, 60873205, the grant kjcx-yw-s7 from Chinese Academy of Sciences, and 2006CB503905 from Ministry of Science and Technology of China.
文摘Signal transduction pathways play important roles in various biological processes such as cell cycle, apoptosis, proliferation, differentiation and responses to the external stimuli. Efficient computational methods are of great demands to map signaling pathways systematically based on the interactome and microarray data in the post-genome era. This paper proposes a novel approach to infer the pathways based on the network flow well studied in the operation research. The authors define a potentiality variable for each protein to denote the extent to which it contributes to the objective pathway. And the capacity on each edge is not a constant but a function of the potentiality variables of the corresponding two proteins. The total potentiality of all proteins is given an upper bound. The approach is formulated to a linear programming model and solved by the simplex method. Experiments on the yeast sporulation data suggest this novel approach recreats successfully the backbone of the MAPK signaling pathway with a low upper bound of the total potentiality. By increasing the upper bound, the approach successfully predicts all the members of the Mitogen-activated protein kinases (MAPK) pathway responding to the pheromone. This simple but effective approach can also be used to infer the genetic information processing pathways underlying the expression quantitative trait loci (eQTL) associations, illustrated by the second example.
基金supported by the Natural Science Foundation of Jiangsu Province under Grant No. BK2012742
文摘Biological network alignment is an important research topic in the field of bioinformatics. Nowadays almost every existing alignment method is designed to solve the deterministic biological network alignment problem.However, it is worth noting that interactions in biological networks, like many other processes in the biological realm,are probabilistic events. Therefore, more accurate and better results can be obtained if biological networks are characterized by probabilistic graphs. This probabilistic information, however, increases difficulties in analyzing networks and only few methods can handle the probabilistic information. Therefore, in this paper, an improved Probabilistic Biological Network Alignment(PBNA) is proposed. Based on Iso Rank, PBNA is able to use the probabilistic information. Furthermore, PBNA takes advantages of Contributor and Probability Generating Function(PGF) to improve the accuracy of node similarity value and reduce the computational complexity of random variables in similarity matrix. Experimental results on dataset of the Protein-Protein Interaction(PPI) networks provided by Todor demonstrate that PBNA can produce some alignment results that ignored by the deterministic methods, and produce more biologically meaningful alignment results than Iso Rank does in most of the cases based on the Gene Ontology Consistency(GOC) measure. Compared with Prob method, which is designed exactly to solve the probabilistic alignment problem, PBNA can obtain more biologically meaningful mappings in less time.
文摘Background Small cell lung cancer(SCLC)is a highly malignant and aggressive neuroendocrine tumor.With the rise of immunotherapy,it has provided a new direction for SCLC.However,due to the lack of prognostic biomarkers,the median overall survival of SCLC is still to be improved.This study aimed to explore novel biomarkers and tumor-infiltrating immune cell characteristics that may serve as potential diagnostic and prognostic markers in SCLC.Methods Gene expression profiles from patients with SCLC were downloaded from the Gene Expression Omnibus(GEO)database,and tumor microenvironment(TME)infiltration profile data were obtained using CIBERSORT.The robust rank aggregation(RRA)method was utilized to integrate three SCLC microarray datasets downloaded from the GEO database and identify robust differentially expressed genes(DEGs)between normal and tumor tissue samples.Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)enrichment analyses were performed to explore the functions of the robust DEGs.Subsequently,protein-protein interaction networks and key modules were constructed by Cytoscape,and hub genes were selected from the whole network using the plugin cytoHubba.Survival analysis of hub genes was performed by Kaplan-Meier plotter in 18 patients with extensive-stage SCLC.Results A total of 312 robust DEGs,including 55 upregulated and 257 downregulated genes,were screened from 129 SCLC tissue samples and 44 normal tissue samples.GO and KEGG enrichment analyses revealed that the robust DEGs were predominantly involved in human T-cell leukemia virus 1 infection,focal adhesion,complement and coagulation cascades,tumor necrosis factor(TNF)signaling pathway,and ECM-receptor interaction,which are closely associated with the development and progression of SCLC.Subsequently,three DEGs modules and six hub genes(ITGA10,DUSP12,PTGS2,FOS,TGFBR2,and ICAM1)were identified through screening with the Cytoscape plugins MCODE and cytoHubba,respectively.Immune cell infiltration analysis by the CIBERSORT algorithm revealed that resting memory CD4+T cells were the predominant infiltrating immune cells in SCLC.In addition,Kaplan-Meier plotter revealed that the gene prostaglandin-endoperoxide synthase 2(PTGS2)was a potential prognostic biomarker of SCLC.Conclusions Hub genes and tumor-infiltrating immune cells may be the molecular mechanisms underlying the development of SCLC,and this finding could contribute to the formulation of individualized immunotherapy strategies for SCLC.