Spatially resolved transcriptomics(SRT)is capable of comprehensively characterizing gene expression patterns and providing an unbiased image of spatial composition.To fully understand the organizational complexity and...Spatially resolved transcriptomics(SRT)is capable of comprehensively characterizing gene expression patterns and providing an unbiased image of spatial composition.To fully understand the organizational complexity and tumor immune escape mechanism,we propose stMGATF,a multiview graph attention fusion model that integrates gene expression,histological images,spatial location,and gene association.To better extract information,stMGATF exploits SimCLRv2 for visual feature exaction and employs edge feature enhanced graph attention networks for the learning potential embedding of each view.A global attention mechanism is used to adaptively integrate 3 views to obtain low-dimensional representation.Applied to diverse SRT datasets,stMGATF is robust and outperforms other methods in detecting spatial domains and denoising data even with different resolutions and platforms.In particular,stMGATF contributes to the elucidation of tissue heterogeneity and extraction of 3-dimensional expression domains.Importantly,considering the associations between genes in tumors,stMGATF can identify the spatial dark genes ignored by traditional methods,which can be used to predict tumor-driving transcription factors and reveal tumor immune escape mechanisms,providing theoretical evidence for the development of new immunotherapeutic strategies.展开更多
Lkb1 deficiency confers the Kras-mutant lung cancer with strong plasticity and the potential for adeno-to-squamous transdifferentiation(AST).However,it remains largely unknown how Lkb1 deficiency dynamically regulates...Lkb1 deficiency confers the Kras-mutant lung cancer with strong plasticity and the potential for adeno-to-squamous transdifferentiation(AST).However,it remains largely unknown how Lkb1 deficiency dynamically regulates AST.Using the classical AST mouse model(Kras LSL-G12D/+;Lkb1flox/flox,KL),we here comprehensively analyze the temporal transcriptomic dynamics of lung tumors at different stages by dynamic network biomarker(DNB)and identify the tipping point at which the Wnt signaling is abruptly suppressed by the excessive accumulation of reactive oxygen species(ROS)through its downstream effector FOXO3A.Bidirectional genetic perturbation of the Wnt pathway using two different Ctnnb1 conditional knockout mouse strains confirms its essential role in the negative regulation of AST.Importantly,pharmacological activation of the Wnt pathway before but not after the tipping point inhibits squamous transdifferentiation,highlighting the irreversibility of AST after crossing the tipping point.Through comparative transcriptomic analyses of mouse and human tumors,we find that the lineage-specific transcription factors(TFs)of adenocarcinoma and squamous cell carcinoma form a“Yin-Yang”counteracting network.Interestingly,inactivation of the Wnt pathway preferentially suppresses the adenomatous lineage TF network and thus disrupts the“Yin-Yang”homeostasis to lean towards the squamous lineage,whereas ectopic expression of NKX2-1,an adenomatous lineage TF,significantly dampens such phenotypic transition accelerated by the Wnt pathway inactivation.The negative correlation between the Wnt pathway and AST is further observed in a large cohort of human lung adenosquamous carcinoma.Collectively,our study identifies the tipping point of AST and highlights an essential role of the ROS-Wnt axis in dynamically orchestrating the homeostasis between adeno-and squamous-specific TF networks at the AST tipping point.展开更多
Little is known about how chronic inflammation contributes to the progression of hepatoceUular carcinoma (HCC), especially the initiation of cancer. To uncover the critical transition from chronic inflammation to HC...Little is known about how chronic inflammation contributes to the progression of hepatoceUular carcinoma (HCC), especially the initiation of cancer. To uncover the critical transition from chronic inflammation to HCC and the molecular mechanisms at a network level, we analyzed the time-series proteomic data of woodchuck hepatitis virus/c.myc mice and age-matched wt-C57BL/6 mice using our dynamical network biomarker (DNB) model. DNB analysis indicated that the 5th month after birth of transgenic mice was the critical period of cancer initiation, just before the critical transition, which is consistent with clinical symptoms. Meanwhile, the DNB-associated network showed a drastic inversion of protein expression and coexpression levels before and after the critical transition. Two members of DNB, PLA2G6 and CYP2C44, along with their associated differentially expressed proteins, were found to induce dysfunction of arachidonic acid metabolism, further activate inflammatory responses through inflammatory mediator regulation of transient receptor potential channels, and finally lead to impairments of liver detoxification and malignant transition to cancer. As a c-Myc target, PLA2G6 positively correlated with c-Myc in expression, showing a trend from decreasing to increasing during carcinogenesis, with the minimal point at the critical transition or tipping point. Such trend of homologous PLA2G6 and c-Myc was also observed during human hepatocarcinogenesis, with the minimal point at high-grade dysplastic nodules (a stage just before the carcinogenesis). Our study implies that PLA2G6 might function as an oncogene like famous c-Myc during hepatocar- cinogenesis, while downregulation of PLA2G6 and c-Myc could be a warning signal indicating imminent carcinogenesis.展开更多
Acquired drug resistance is the major reason why patients fail to respond to cancer therapies.It is a challenging task to deter.mine the tipping point of endocrine resistance and detect the associated molecules.Derive...Acquired drug resistance is the major reason why patients fail to respond to cancer therapies.It is a challenging task to deter.mine the tipping point of endocrine resistance and detect the associated molecules.Derived from new systems biology theory, the dynamic network biomarker (DNB) method is designed to quantitatively identify the tipping point of a drastic system transition and can theoretically identify DNB genes that play key roles in acquiring drug resistance.We analyzed time-course mRNA sequence data generated from the tamoxifen-treated estrogen receptor (ER)-positive MCF-7 cell line, and identified the tipping point of endocrine resistance with its leading molecules.The results show that there is interplay between gene mutations and DNB genes, in which the accumulated mutations eventually affect the DNB genes that subsequently cause the change of transcriptional landscape, enabling full-blown drug resistance. Survival analyses based on clinical datasets validated that the DNB genes were associated with the poor survival of breast cancer patients.The results provided the detection for the pre-resistance state or early signs of endocrine resistance.Our predictive method may greatly benefit the scheduling of treatments for complex diseases in which patients are exposed to considerably different drugs and may become drug resistant.展开更多
This multicenter phase-II trial aimed to investigate the efficacy,safety,and predictive biomarkers of toripalimab plus chemotherapy as second-line treatment in patients with EGFR-mutant-advanced NSCLC.Patients who fai...This multicenter phase-II trial aimed to investigate the efficacy,safety,and predictive biomarkers of toripalimab plus chemotherapy as second-line treatment in patients with EGFR-mutant-advanced NSCLC.Patients who failed from first-line EGFR-TKIs and did not harbor T790M mutation were enrolled.Toripalimab plus carboplatin and pemetrexed were administrated every three weeks for up to six cycles,followed by the maintenance of toripalimab and pemetrexed.The primary endpoint was objective-response rate(ORR).Integrated biomarker analysis of PD-L1 expression,tumor mutational burden(TMB),CD8+tumor-infiltrating lymphocyte(TIL)density,whole-exome,and transcriptome sequencing on tumor biopsies were also conducted.Forty patients were enrolled with an overall ORR of 50.0%and disease-control rate(DCR)of 87.5%.The median progression free survival(PFS)and overall survival were 7.0 and 23.5 months,respectively.The most common treatment-related adverse effects were leukopenia,neutropenia,anemia,ALT/AST elevation,and nausea.Biomarker analysis showed that none of PD-L1 expression,TMB level,and CD8+TIL density could serve as a predictive biomarker.Integrated analysis of whole-exome and transcriptome sequencing data revealed that patients with DSPP mutation had a decreased M2 macrophage infiltration and associated with longer PFS than those of wild type.Toripalimab plus chemotherapy showed a promising anti-tumor activity with acceptable safety profiles as the second-line setting in patients with EGFR-mutant NSCLC.DSPP mutation might serve as a potential biomarker for this combination.A phase-III trial to compare toripalimab versus placebo in combination with chemotherapy in this setting is ongoing(NCT03924050).展开更多
Epithelial–mesenchymal transition(EMT) is a complex nonlinear biological process that plays essential roles in fundamental biological processes such as embryogenesis, wounding healing, tissue regeneration,and cancer ...Epithelial–mesenchymal transition(EMT) is a complex nonlinear biological process that plays essential roles in fundamental biological processes such as embryogenesis, wounding healing, tissue regeneration,and cancer metastasis. A hallmark of EMT is the switch-like behavior during state transition, which is characteristic of phase transitions. Hence, detecting the tipping point just before mesenchymal state transition is critical for understanding molecular mechanism of EMT. Through dynamic network biomarkers(DNB) model, a DNB group with 37 genes was identified which can provide the early-warning signals of EMT. Particularly, we found that two DNB genes, i.e., SMAD7 and SERPINE1 promoted EMT by switching their regulatory network which was further validated by biological experiments. Survival analyses revealed that SMAD7 and SERPINE1 as DNB genes further acted as prognostic biomarkers for lung adenocarcinoma.展开更多
Hepatitis B virus (HBV)-induced hepatocellular carcinoma (HCC) is a major cause of cancer-related deaths in Asia and Africa. Developing effective and non-invasive biomarkers of HCC for individual patients remains an u...Hepatitis B virus (HBV)-induced hepatocellular carcinoma (HCC) is a major cause of cancer-related deaths in Asia and Africa. Developing effective and non-invasive biomarkers of HCC for individual patients remains an urgent task for early diagnosis and convenient monitoring. Analyzing the transcriptomic profiles of peripheral blood mononuclear cells from both healthy donors and patients with chronic HBV infection in different states (i.e. HBV carrier, chronic hepatitis B, cirrhosis, and HCC), we identified a set of 19 candidate genes according to our algorithm of dynamic network biomarkers. These genes can both characterize different stages during HCC progression and identify cirrhosis as the critical transition stage before carcinogenesis. The interaction effects (i.e. coexpressions) of candidate genes were used to build an accurate prediction model: the so-called edge-based biomarker. Considering the convenience and robustness of biomarkers in clinical applications, we performed functional analysis, validated candidate genes in other independent samples of our collected cohort, and finally selected COL5A1, HLA-DQB1, MMP2, and CDK4 to build edge panel as prediction models. We demonstrated that the edge panel had great performance in both diagnosis and prognosis in terms of precision and specificity for HCC, especially for patients with alpha-fetoprotein-negative HCC. Our study not only provides a novel edge-based biomarker for non-invasive and effective diagnosis of HBV-associated HCC to each individual patient but also introduces a new way to integrate the interaction terms of individual molecules for clinical diagnosis and prognosis from the network and dynamics perspectives.展开更多
Significantly increasing crop yield is a major and worldwide challenge for food supply and security.It is well-known that rice cultivated at Taoyuan in Yunnan of China can produce the highest yield worldwide.Yet,the g...Significantly increasing crop yield is a major and worldwide challenge for food supply and security.It is well-known that rice cultivated at Taoyuan in Yunnan of China can produce the highest yield worldwide.Yet,the gene regulatory mechanism underpinning this ultrahigh yield has been a mystery.Here,we systematically collected the transcriptome data for seven key tissues at different developmental stages using rice cultivated both at Taoyuan as the case group and at another regular rice planting place Jinghong as the control group.We identified the top 24 candidate high-yield genes with their network modules from these well-designed datasets by developing a novel computational systems biology method,i.e.,dynamic cross-tissue(DCT)network analysis.We used one of the candidate genes,Os SPL4,whose function was previously unknown,for gene editing experimental validation of the high yield,and confirmed that Os SPL4 significantly affects panicle branching and increases the rice yield.This study,which included extensive field phenotyping,cross-tissue systems biology analyses,and functional validation,uncovered the key genes and gene regulatory networks underpinning the ultrahigh yield of rice.The DCT method could be applied to other plant or animal systems if different phenotypes under various environments with the common genome sequences of the examined sample.DCT can be downloaded from https://github.com/ztpub/DCT.展开更多
t The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity.However,compared to bulk RNA sequencing(RNA-seq),single-cell RNA-seq(scRNA-seq)suffers from hi...t The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity.However,compared to bulk RNA sequencing(RNA-seq),single-cell RNA-seq(scRNA-seq)suffers from higher noise and lower coverage,which brings new computational difficulties.Based on statistical independence,cell-specific network(CSN)is able to quantify the overall associations between genes for each cell,yet suffering from a problem of overestimation related to indirect effects.To overcome this problem,we propose the c-CSN method,which can construct the conditional cell-specific network(CCSN)for each cell.c-CSN method can measure the direct associations between genes by eliminating the indirect associations.c-CSN can be used for cell clustering and dimension reduction on a network basis of single cells.Intuitively,each CCSN can be viewed as the transformation from less“reliable”gene expression to more“reliable”gene–gene associations in a cell.Based on CCSN,we further design network flow entropy(NFE)to estimate the differentiation potency of a single cell.A number of scRNA-seq datasets were used to demonstrate the advantages of our approach.1)One direct association network is generated for one cell.2)Most existing scRNA-seq methods designed for gene expression matrices are also applicable to c-CSN-transformed degree matrices.3)CCSN-based NFE helps resolving the direction of differentiation trajectories by quantifying the potency of each cell.c-CSN is publicly available at https://github.com/LinLi-0909/c-CSN.展开更多
In "Omics" era of the life sciences, data is presented in many forms, which represent the information at various levels of bio- logical systems, including data about genome, transcriptome, epigenome, proteome, metab...In "Omics" era of the life sciences, data is presented in many forms, which represent the information at various levels of bio- logical systems, including data about genome, transcriptome, epigenome, proteome, metabolome, molecular imaging, molec- ular pathways, different population of people and clinical/med- ical records. The biological data is big, and its scale has already been well beyond petabyte (PB) even exabyte (EB). Nobody doubts that the biological data will create huge amount of val- ues, if scientists can overcome many challenges, e.g., how to handle the complexity of information, how to integrate the data from very heterogeneous resources, what kind of principles or standards to be adopted when facing with the big data. Tools and techniques for analyzing big biological data enable us to translate massive amount of information into a better under- standing of the basic biomedical mechanisms, which can be fur- ther applied to translational or personalized medicine.展开更多
Detecting direct associations or inferring networks based on the observed data is an important issue in many fields, including biology, physics, engineering and social studies. In this work, we focus on the informatio...Detecting direct associations or inferring networks based on the observed data is an important issue in many fields, including biology, physics, engineering and social studies. In this work, we focus on the information theoretic approaches in the network reconstruction or the direct association detection, in particular,for biological networks. We not only review the traditional approaches or measurements on the associations among the observed variables, such as correlation coefficient, mutual information and conditional mutual information(CMI), but also summarize recently developed theories and methods. The new theoretic works include:information geometry to give a unified framework in detecting causality/association, the partial independence to alleviate the singularity of CMI, and multiscale analysis of CMI to avoid the underestimation issue of CMI.The new methods include part mutual information(PMI) and partial associations(PA), which improve the old measurements in avoiding both overestimation and underestimation. All those theories and methods make important contributions as major advances in the development of network inference.展开更多
Rapid accumulation of biological data is driving the system-level study from describing complex phe- nomena to understanding molecular mechanisms, from analyzing individual components to under-standing their networks ...Rapid accumulation of biological data is driving the system-level study from describing complex phe- nomena to understanding molecular mechanisms, from analyzing individual components to under-standing their networks and systems (Chen et al., 2009; Chen and Wu, 2015). Data-driven systems biology approaches are emerging as essential tools to gain new insights into biological processes or systems. In this issue, we collect several research articles, which are all related to such data-driven methodologies or their applications, ranged from new computational tools (GWAS and signal pathway studies) to molecu- lar biology (CSRE inference) and disease analyses (detection of the disease tipping point by DNBs and key genes during glioma progression).展开更多
Skin,as the outmost layer of human body,is frequently exposed to environmental stressors including pollutants and ultraviolet(UV),which could lead to skin disorders.Generally,skin response process to ultraviolet B(UVB...Skin,as the outmost layer of human body,is frequently exposed to environmental stressors including pollutants and ultraviolet(UV),which could lead to skin disorders.Generally,skin response process to ultraviolet B(UVB)irradiation is a nonlinear dynamic process,with unknown underlying molecular mechanism of critical transition.Here,the landscape dynamic network biomarker(lDNB)analysis of time series transcriptome data on 3D skin model was conducted to reveal the complicated process of skin response to UV irradiation at both molecular and network levels.The advanced l-DNB analysis approach showed that:(i)there was a tipping point before critical transition state during pigmentation process,validated by 3D skin model;(ii)13 core DNB genes were identified to detect the tipping point as a network biomarker,supported by computational assessment;(iii)core DNB genes such as COL7A1 and CTNNB1 can effectively predict skin lightening,validated by independent human skin data.Overall,this study provides new insights for skin response to repetitive UVB irradiation,including dynamic pathway pattern,biphasic response,and DNBs for skin lightening change,and enables us to further understand the skin resilience process after external stress.展开更多
The estrogen receptor(ER)-negative breast cancer subtype is aggressive with few treatment options available.To identify specific prognostic factors for ER-negative breast cancer,this study included 705,729 and 1034 br...The estrogen receptor(ER)-negative breast cancer subtype is aggressive with few treatment options available.To identify specific prognostic factors for ER-negative breast cancer,this study included 705,729 and 1034 breast invasive cancer patients from the Surveillance,Epidemiology,and End Results(SEER)and The Cancer Genome Atlas(TCGA)databases,respectively.To identify key differential kinase-substrate node and edge biomarkers between ER-negative and ERpositive breast cancer patients,we adopted a network-based method using correlation coefficients between molecular pairs in the kinase regulatory network.Integrated analysis of the clinical and molecular data revealed the significant prognostic power of kinase-substrate node and edge features for both subtypes of breast cancer.Two promising kinase-substrate edge features,CSNK1A1-NFATC3 and SRC-OCLN,were identified for more accurate prognostic prediction in ERnegative breast cancer patients.展开更多
Recent advances of single-cell RNA sequencing(scRNA-seq)technologies have led to extensive study of cellular heterogeneity and cell-to-cell variation.However,the high frequency of dropout events and noise in scRNA-seq...Recent advances of single-cell RNA sequencing(scRNA-seq)technologies have led to extensive study of cellular heterogeneity and cell-to-cell variation.However,the high frequency of dropout events and noise in scRNA-seq data confounds the accuracy of the downstream analysis,i.e.clustering analysis,whose accuracy depends heavily on the selected feature genes.Here,by deriving an entropy decomposition formula,we propose a feature selection method,i.e.an intrinsic entropy(IE)model,to identify the informative genes for accurately clustering analysis.Specifically,by eliminating the‘noisy’fluctuation or extrinsic entropy(EE),we extract the IE of each gene from the total entropy(TE),i.e.TE=IE+EE.We show that the IE of each gene actually reflects the regulatory fluctuation of this gene in a cellular process,and thus high-IE genes provide rich information on celltype or state analysis.To validate the performance of the high-IE genes,we conduct computational analysis on both simulated datasets and real single-cell datasets by comparing with other representative methods.The results show that our IE model is not only broadly applicable and robust for different clustering and classification methods,but also sensitive for novel cell types.Our results also demonstrate that the intrinsic entropy/fluctuation of a gene serves as information rather than noise in contrast to its total entropy/fluctuation.展开更多
Recent trend on biological data at a molecular level is omics data analysis for both bulk and single cells, in eluding genomics, proteomics, metabolomics, and epigenetics data (Wang and Zhang, 2017;Zhang et al., 2017;...Recent trend on biological data at a molecular level is omics data analysis for both bulk and single cells, in eluding genomics, proteomics, metabolomics, and epigenetics data (Wang and Zhang, 2017;Zhang et al., 2017;Zhao and Li, 2017;Cheng and Leung, 2018). Rapid accumulation of such high-dimensional biological data is driving the system-level study from describing complex phenomena to understanding molecular mechanisms (Park et al., 2018;Sun et al., 2018) and from analyzi ng in dividual components to understanding their networks and systems (Chen et al., 2009;Chen, 2017).展开更多
Natural systems are typically nonlinear and complex, and it is of great interest to be able to reconstruct a system in order to understand its mechanism, which cannot only recover nonlinear behaviors but also predict ...Natural systems are typically nonlinear and complex, and it is of great interest to be able to reconstruct a system in order to understand its mechanism, which cannot only recover nonlinear behaviors but also predict future dynamics. Due to the advances of modern technology, big data becomes increasingly accessible and consequently the problem of reconstructing systems from measured data or time series plays a central role in many scientific disciplines. In recent decades, nonlinear methods rooted in state space reconstruction have been developed, and they do not assume any model equations but can recover the dynamics purely from the measured time series data. In this review, the development of state space reconstruction techniques will be introduced and the recent advances in systems prediction and causality inference using state space reconstruction will be presented. Particularly, the cutting-edge method to deal with short-term time series data will be focused on.Finally, the advantages as well as the remaining problems in this field are discussed.展开更多
Network or edge biomarkers area reliable form to characterize phenotypes or diseases.However,obtaining edges orcorrelations between molecules for an individual requires measurement ofmultiple samples of that individua...Network or edge biomarkers area reliable form to characterize phenotypes or diseases.However,obtaining edges orcorrelations between molecules for an individual requires measurement ofmultiple samples of that individual,which are generally unavailable in clinical practice.Thus,it is strongly demanded to diagnose a disease by edge or network biomarkers in one-sample-for-one-individual context.Here,we developed a new computational framework,EdgeBiomarker,to integrate edge and node biomarkers to diagnose phenotype of each single test sample.By applying the method to datasets of lung and breast cancer,it reveals new marker genes/gene-pairs and related sub-networks for distinguishing earlier and advanced cancer stages.Our method shows advantages over traditional methods:(i)edge biomarkers extracted from non-differentially expressed genes achieve better cross-validation accuracy of diagnosis than molecule or node biomarkers from differentially expressed genes,suggesting that certain pathogenic information is only present at the level of network and under-estimated by traditional methods;(ii)edge biomarkers categorize patients into low/high survival rate in a more reliablemanner;(iii)edge biomarkers are significantly enriched in relevant biological functions or pathways,implying that the association changes ina network,rather than expression changes in individual molecules,tendtobe causally related to cancer development.The new frameworkof edgebiomarkers paves theway for diagnosing diseases and analyzing the irmolecular mechanisms by edges or networks in one-sample-for-one-individual basis.This also provides a powerful tool for precision medicine or big-data medicine.展开更多
Detecting communities from complex networks is an important issue and has attracted attention of researchers in many fields. It is relevant to social tasks, biological inquiries, and technological problems since vario...Detecting communities from complex networks is an important issue and has attracted attention of researchers in many fields. It is relevant to social tasks, biological inquiries, and technological problems since various networks exist in these systems. This paper proposes a new self-organizing map (SOM) based approach to community detection. By adopting a new operation and a new weightupdating scheme, a complex network can be organized into dense subgraphs according to the topological connection of each node by the SOM algorithm. Extensive numerical experiments show that the performance of the SOM algorithm is good. It can identify communities more accurately than existing methods. This method can be used to detect communities not only in undirected networks, but also in directed networks and bipartite networks.展开更多
基金supported by the National Key R&D Program of China(No.2022YFA1004800)the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDB38040400)+4 种基金the National Natural Science Foundation of China(Nos.12131020,31930022,T2350003,and T2341007)Special Fund for Science and Technology Innovation Strategy of Guangdong Province(Nos.2021B0909050004 and 2021B0909060002)the Key-Area Research and Development Program of Guangdong Province(No.2021B0909060002)JST Moonshot R&D(No.JPMJMS2021)Major projects of Henan Province(No.231100220100).
文摘Spatially resolved transcriptomics(SRT)is capable of comprehensively characterizing gene expression patterns and providing an unbiased image of spatial composition.To fully understand the organizational complexity and tumor immune escape mechanism,we propose stMGATF,a multiview graph attention fusion model that integrates gene expression,histological images,spatial location,and gene association.To better extract information,stMGATF exploits SimCLRv2 for visual feature exaction and employs edge feature enhanced graph attention networks for the learning potential embedding of each view.A global attention mechanism is used to adaptively integrate 3 views to obtain low-dimensional representation.Applied to diverse SRT datasets,stMGATF is robust and outperforms other methods in detecting spatial domains and denoising data even with different resolutions and platforms.In particular,stMGATF contributes to the elucidation of tissue heterogeneity and extraction of 3-dimensional expression domains.Importantly,considering the associations between genes in tumors,stMGATF can identify the spatial dark genes ignored by traditional methods,which can be used to predict tumor-driving transcription factors and reveal tumor immune escape mechanisms,providing theoretical evidence for the development of new immunotherapeutic strategies.
基金We thank Drs.Tyler Jacks,Ronald A.DePinho,Kwok-kin Wong,and Lijian Hui for the generous gift of various mouse strains.We also thank Ruiqi Wang,Rui Liu,Pei Chen,Chao Zheng,and Jifan Shi for helpful discussion.This work was supported by the National Basic Research Program of China(Nos.2017YFA0505500 to H.J.and L.C.,2020YFA0803300 to H.J.)the National Natural Science Foundation of China(Nos.91731314,82030083,31621003,81872312,82011540007 to H.J.,12131020,31930022,12026608 to L.C.,82273093 to Z.F.,81871875,82173340 to L.H.,81802279 to H.H.,81902326 to X.W.,81402371 to Y.J.)+7 种基金the Strategic Priority Research Program of the Chinese Academy of Sciences(Nos.XDB19020201 to H.J.,XDB38040400 to L.C.)Basic Frontier Scientific Research Program of Chinese Academy of Science(No.ZDBS-LY-SM006 to H.J.)International Cooperation Project of Chinese Academy of Sciences(No.153D31KYSB20190035 to H.J.)the Science and Technology Commission of Shanghai Municipality(No.21ZR1470300 to L.H.)the Youth Innovation Promotion Association CAS(No.Y919S31371 to X.W.)Special Fund for Science and Technology Innovation Strategy of Guangdong Province(Nos.2021B0909050004,2021B0909060002 to L.C.)Major Key Project of PCL(No.PCL2021A12 to L.C.)JST Moonshot R&D Project(No.JPMJMS2021 to L.C.).
文摘Lkb1 deficiency confers the Kras-mutant lung cancer with strong plasticity and the potential for adeno-to-squamous transdifferentiation(AST).However,it remains largely unknown how Lkb1 deficiency dynamically regulates AST.Using the classical AST mouse model(Kras LSL-G12D/+;Lkb1flox/flox,KL),we here comprehensively analyze the temporal transcriptomic dynamics of lung tumors at different stages by dynamic network biomarker(DNB)and identify the tipping point at which the Wnt signaling is abruptly suppressed by the excessive accumulation of reactive oxygen species(ROS)through its downstream effector FOXO3A.Bidirectional genetic perturbation of the Wnt pathway using two different Ctnnb1 conditional knockout mouse strains confirms its essential role in the negative regulation of AST.Importantly,pharmacological activation of the Wnt pathway before but not after the tipping point inhibits squamous transdifferentiation,highlighting the irreversibility of AST after crossing the tipping point.Through comparative transcriptomic analyses of mouse and human tumors,we find that the lineage-specific transcription factors(TFs)of adenocarcinoma and squamous cell carcinoma form a“Yin-Yang”counteracting network.Interestingly,inactivation of the Wnt pathway preferentially suppresses the adenomatous lineage TF network and thus disrupts the“Yin-Yang”homeostasis to lean towards the squamous lineage,whereas ectopic expression of NKX2-1,an adenomatous lineage TF,significantly dampens such phenotypic transition accelerated by the Wnt pathway inactivation.The negative correlation between the Wnt pathway and AST is further observed in a large cohort of human lung adenosquamous carcinoma.Collectively,our study identifies the tipping point of AST and highlights an essential role of the ROS-Wnt axis in dynamically orchestrating the homeostasis between adeno-and squamous-specific TF networks at the AST tipping point.
文摘Little is known about how chronic inflammation contributes to the progression of hepatoceUular carcinoma (HCC), especially the initiation of cancer. To uncover the critical transition from chronic inflammation to HCC and the molecular mechanisms at a network level, we analyzed the time-series proteomic data of woodchuck hepatitis virus/c.myc mice and age-matched wt-C57BL/6 mice using our dynamical network biomarker (DNB) model. DNB analysis indicated that the 5th month after birth of transgenic mice was the critical period of cancer initiation, just before the critical transition, which is consistent with clinical symptoms. Meanwhile, the DNB-associated network showed a drastic inversion of protein expression and coexpression levels before and after the critical transition. Two members of DNB, PLA2G6 and CYP2C44, along with their associated differentially expressed proteins, were found to induce dysfunction of arachidonic acid metabolism, further activate inflammatory responses through inflammatory mediator regulation of transient receptor potential channels, and finally lead to impairments of liver detoxification and malignant transition to cancer. As a c-Myc target, PLA2G6 positively correlated with c-Myc in expression, showing a trend from decreasing to increasing during carcinogenesis, with the minimal point at the critical transition or tipping point. Such trend of homologous PLA2G6 and c-Myc was also observed during human hepatocarcinogenesis, with the minimal point at high-grade dysplastic nodules (a stage just before the carcinogenesis). Our study implies that PLA2G6 might function as an oncogene like famous c-Myc during hepatocar- cinogenesis, while downregulation of PLA2G6 and c-Myc could be a warning signal indicating imminent carcinogenesis.
基金This work was supported by grants from the National Key R&D Program of China (2017YFA0505500)Strategic Priority Research Program of the Chinese Academy of Sciences (XDBl3040700)+6 种基金the National Natural Science Foundation of China (11771152,91529303,31771476,31571363,31771469,91530320,61134013,81573023,81501203,and 11326035)Pearl River Science and Technology Nova Program of Guangzhou (201610010029)FISRT,Aihara Innovative Mathematical Modeling Project from Cabinet Office,JapanFundamental Research Funds for the Central Universities (2017ZD095)JSPS KAKENHI (15H05707)Grant-in-Aid for Scientific Research on Innovative Areas (3901) and SPS KAKENHI (15KT0084,17H06299,17H06302,and 18H04031)RIKEN Epigenome and Single Cell Project Grants to M.O.-H.This work was performed in part under the International Cooperative Research Program of Institute for Protein Research,Osaka University (ICRa-17-01 to L.C.and M.O.-H.).
文摘Acquired drug resistance is the major reason why patients fail to respond to cancer therapies.It is a challenging task to deter.mine the tipping point of endocrine resistance and detect the associated molecules.Derived from new systems biology theory, the dynamic network biomarker (DNB) method is designed to quantitatively identify the tipping point of a drastic system transition and can theoretically identify DNB genes that play key roles in acquiring drug resistance.We analyzed time-course mRNA sequence data generated from the tamoxifen-treated estrogen receptor (ER)-positive MCF-7 cell line, and identified the tipping point of endocrine resistance with its leading molecules.The results show that there is interplay between gene mutations and DNB genes, in which the accumulated mutations eventually affect the DNB genes that subsequently cause the change of transcriptional landscape, enabling full-blown drug resistance. Survival analyses based on clinical datasets validated that the DNB genes were associated with the poor survival of breast cancer patients.The results provided the detection for the pre-resistance state or early signs of endocrine resistance.Our predictive method may greatly benefit the scheduling of treatments for complex diseases in which patients are exposed to considerably different drugs and may become drug resistant.
基金This study was also supported in part by grants from the National Natural Science Foundation of China(No.81871865,81874036,81972167,82102859,31930022,31771476,and 12026608)National Science and Technology Major Project(No.2017YFA0505500)+5 种基金the Strategic Priority Project of Chinese Academy of Sciences(No.XDB38040400,XDB38020000)the Backbone Program of Shanghai Pulmonary Hospital(No.FKGG1802)Shanghai Pujiang Talent Plan(No.2019PJD048)Shanghai Science and Technology Committee Foundation(NO.19411950300)Shanghai Key disciplines of Respiratory(No.2017ZZ02012)the Shanghai Sailing Program(No.20YF1407500).
文摘This multicenter phase-II trial aimed to investigate the efficacy,safety,and predictive biomarkers of toripalimab plus chemotherapy as second-line treatment in patients with EGFR-mutant-advanced NSCLC.Patients who failed from first-line EGFR-TKIs and did not harbor T790M mutation were enrolled.Toripalimab plus carboplatin and pemetrexed were administrated every three weeks for up to six cycles,followed by the maintenance of toripalimab and pemetrexed.The primary endpoint was objective-response rate(ORR).Integrated biomarker analysis of PD-L1 expression,tumor mutational burden(TMB),CD8+tumor-infiltrating lymphocyte(TIL)density,whole-exome,and transcriptome sequencing on tumor biopsies were also conducted.Forty patients were enrolled with an overall ORR of 50.0%and disease-control rate(DCR)of 87.5%.The median progression free survival(PFS)and overall survival were 7.0 and 23.5 months,respectively.The most common treatment-related adverse effects were leukopenia,neutropenia,anemia,ALT/AST elevation,and nausea.Biomarker analysis showed that none of PD-L1 expression,TMB level,and CD8+TIL density could serve as a predictive biomarker.Integrated analysis of whole-exome and transcriptome sequencing data revealed that patients with DSPP mutation had a decreased M2 macrophage infiltration and associated with longer PFS than those of wild type.Toripalimab plus chemotherapy showed a promising anti-tumor activity with acceptable safety profiles as the second-line setting in patients with EGFR-mutant NSCLC.DSPP mutation might serve as a potential biomarker for this combination.A phase-III trial to compare toripalimab versus placebo in combination with chemotherapy in this setting is ongoing(NCT03924050).
基金supported by the National Key Research and Development Program of China (2017YFA0505500)the National Natural Science Foundation of China (31930022, 31771476, 61773196)+5 种基金Shanghai Municipal Science and Technology Major Project (2017SHZDZX01)Key Project of Zhangjiang National Innovation Demonstration Zone Special Development Fund (ZJ2018ZD-013)Ministry of Science and Technology Project (2017YFC0907505)Guangdong Provincial Key Laboratory Funds (2017B030301018, 2019B030301001)Shenzhen Research Funds (JCYJ20170307104535585, ZDSYS20140509142721429)Shenzhen Peacock Plan (KQTD2016053117035204)
文摘Epithelial–mesenchymal transition(EMT) is a complex nonlinear biological process that plays essential roles in fundamental biological processes such as embryogenesis, wounding healing, tissue regeneration,and cancer metastasis. A hallmark of EMT is the switch-like behavior during state transition, which is characteristic of phase transitions. Hence, detecting the tipping point just before mesenchymal state transition is critical for understanding molecular mechanism of EMT. Through dynamic network biomarkers(DNB) model, a DNB group with 37 genes was identified which can provide the early-warning signals of EMT. Particularly, we found that two DNB genes, i.e., SMAD7 and SERPINE1 promoted EMT by switching their regulatory network which was further validated by biological experiments. Survival analyses revealed that SMAD7 and SERPINE1 as DNB genes further acted as prognostic biomarkers for lung adenocarcinoma.
基金This research was supported by the National Key Research and Development Program of China (2O17YFAO5O55OO)the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB13040700)+4 种基金the Major Program ofthe National Natural Science Foundation of China (81330084)the National Natural Science Foundation of China (8150347 81473443,and 31771476)the National Science and Technology Major Project of China (2012ZX10005001-004)the 'Yang Fan' Program of Sha nghai Committee ofScience and Technology Fund Annotation (14YF1411400 and 18YF1420700)E-lnstitutes of Shanghai Municipal Education Commission (E03008).
文摘Hepatitis B virus (HBV)-induced hepatocellular carcinoma (HCC) is a major cause of cancer-related deaths in Asia and Africa. Developing effective and non-invasive biomarkers of HCC for individual patients remains an urgent task for early diagnosis and convenient monitoring. Analyzing the transcriptomic profiles of peripheral blood mononuclear cells from both healthy donors and patients with chronic HBV infection in different states (i.e. HBV carrier, chronic hepatitis B, cirrhosis, and HCC), we identified a set of 19 candidate genes according to our algorithm of dynamic network biomarkers. These genes can both characterize different stages during HCC progression and identify cirrhosis as the critical transition stage before carcinogenesis. The interaction effects (i.e. coexpressions) of candidate genes were used to build an accurate prediction model: the so-called edge-based biomarker. Considering the convenience and robustness of biomarkers in clinical applications, we performed functional analysis, validated candidate genes in other independent samples of our collected cohort, and finally selected COL5A1, HLA-DQB1, MMP2, and CDK4 to build edge panel as prediction models. We demonstrated that the edge panel had great performance in both diagnosis and prognosis in terms of precision and specificity for HCC, especially for patients with alpha-fetoprotein-negative HCC. Our study not only provides a novel edge-based biomarker for non-invasive and effective diagnosis of HBV-associated HCC to each individual patient but also introduces a new way to integrate the interaction terms of individual molecules for clinical diagnosis and prognosis from the network and dynamics perspectives.
基金the National Basic Research Program of China(Grant No.2013CB835200)the National Key R&D Program of China(Grant No.2017YFA0505500)+4 种基金the Key Grant of Yunnan Provincial Science and Technology Department(Grant No.2013GA004)the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB13040700)the National Natural Science Foundation of China(Grant Nos.11871456 and 31771476)the Shanghai Municipal Science and Technology Major Project(Grant No.2017SHZDZX01)the Open Research Fund of State Key Laboratory of Hybrid Rice(Wuhan University,Grant No.KF201806),China。
文摘Significantly increasing crop yield is a major and worldwide challenge for food supply and security.It is well-known that rice cultivated at Taoyuan in Yunnan of China can produce the highest yield worldwide.Yet,the gene regulatory mechanism underpinning this ultrahigh yield has been a mystery.Here,we systematically collected the transcriptome data for seven key tissues at different developmental stages using rice cultivated both at Taoyuan as the case group and at another regular rice planting place Jinghong as the control group.We identified the top 24 candidate high-yield genes with their network modules from these well-designed datasets by developing a novel computational systems biology method,i.e.,dynamic cross-tissue(DCT)network analysis.We used one of the candidate genes,Os SPL4,whose function was previously unknown,for gene editing experimental validation of the high yield,and confirmed that Os SPL4 significantly affects panicle branching and increases the rice yield.This study,which included extensive field phenotyping,cross-tissue systems biology analyses,and functional validation,uncovered the key genes and gene regulatory networks underpinning the ultrahigh yield of rice.The DCT method could be applied to other plant or animal systems if different phenotypes under various environments with the common genome sequences of the examined sample.DCT can be downloaded from https://github.com/ztpub/DCT.
基金supported by the National Key R&D Program of China(2017YFA0505500)National Natural Science Foundation of China(11771152,12026608,11901203,31930022,and 31771476)+7 种基金Guangdong Basic and Applied Basic Research Foundation(2019B151502062,and 2021A1515012317)Strategic Priority Research Program of the Chinese Academy of Sciences(XDB38040400)Shanghai Municipal Science and Technology Major Project(2017SHZDZX01)Japan Society for the Promotion of Science KAKENHI(15H05707)Japan Science and Technology Agency Moonshot R&D(JPMJMS2021)Japan Agency for Medical Research and Development(JP20dm0307009)UTokyo Center for Integrative Science of Human Behavior(CiSHuB)the International Research Center for Neurointelligence(WPI-IRCN)at The University of Tokyo Institutes for Advanced Study(UTIAS).
基金the National Key R&D Program of China(Grant No.2017YFA0505500)the National Natural Science Foundation of China(Grant Nos.31771476 and 31930022)the Shanghai Municipal Science and Technology Major Project,China(Grant No.2017SHZDZX01).
文摘t The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity.However,compared to bulk RNA sequencing(RNA-seq),single-cell RNA-seq(scRNA-seq)suffers from higher noise and lower coverage,which brings new computational difficulties.Based on statistical independence,cell-specific network(CSN)is able to quantify the overall associations between genes for each cell,yet suffering from a problem of overestimation related to indirect effects.To overcome this problem,we propose the c-CSN method,which can construct the conditional cell-specific network(CCSN)for each cell.c-CSN method can measure the direct associations between genes by eliminating the indirect associations.c-CSN can be used for cell clustering and dimension reduction on a network basis of single cells.Intuitively,each CCSN can be viewed as the transformation from less“reliable”gene expression to more“reliable”gene–gene associations in a cell.Based on CCSN,we further design network flow entropy(NFE)to estimate the differentiation potency of a single cell.A number of scRNA-seq datasets were used to demonstrate the advantages of our approach.1)One direct association network is generated for one cell.2)Most existing scRNA-seq methods designed for gene expression matrices are also applicable to c-CSN-transformed degree matrices.3)CCSN-based NFE helps resolving the direction of differentiation trajectories by quantifying the potency of each cell.c-CSN is publicly available at https://github.com/LinLi-0909/c-CSN.
基金partially supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB13040700)the National Program on Key Basic Research Project (973 Program, Grant No. 2014CB910504)the National Natural Science Foundation of China (NSFC) (Grant Nos. 61134013, 91130032, 61103075 and 91029301)
文摘In "Omics" era of the life sciences, data is presented in many forms, which represent the information at various levels of bio- logical systems, including data about genome, transcriptome, epigenome, proteome, metabolome, molecular imaging, molec- ular pathways, different population of people and clinical/med- ical records. The biological data is big, and its scale has already been well beyond petabyte (PB) even exabyte (EB). Nobody doubts that the biological data will create huge amount of val- ues, if scientists can overcome many challenges, e.g., how to handle the complexity of information, how to integrate the data from very heterogeneous resources, what kind of principles or standards to be adopted when facing with the big data. Tools and techniques for analyzing big biological data enable us to translate massive amount of information into a better under- standing of the basic biomedical mechanisms, which can be fur- ther applied to translational or personalized medicine.
基金supported by the National Key R&D Program of China (Grant No. 2017YFA0505500)the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB13040700)National Natural Science Foundation of China (Grant Nos. 31771476, 91529303, 91439103, 11421101 and 91530322)
文摘Detecting direct associations or inferring networks based on the observed data is an important issue in many fields, including biology, physics, engineering and social studies. In this work, we focus on the information theoretic approaches in the network reconstruction or the direct association detection, in particular,for biological networks. We not only review the traditional approaches or measurements on the associations among the observed variables, such as correlation coefficient, mutual information and conditional mutual information(CMI), but also summarize recently developed theories and methods. The new theoretic works include:information geometry to give a unified framework in detecting causality/association, the partial independence to alleviate the singularity of CMI, and multiscale analysis of CMI to avoid the underestimation issue of CMI.The new methods include part mutual information(PMI) and partial associations(PA), which improve the old measurements in avoiding both overestimation and underestimation. All those theories and methods make important contributions as major advances in the development of network inference.
文摘Rapid accumulation of biological data is driving the system-level study from describing complex phe- nomena to understanding molecular mechanisms, from analyzing individual components to under-standing their networks and systems (Chen et al., 2009; Chen and Wu, 2015). Data-driven systems biology approaches are emerging as essential tools to gain new insights into biological processes or systems. In this issue, we collect several research articles, which are all related to such data-driven methodologies or their applications, ranged from new computational tools (GWAS and signal pathway studies) to molecu- lar biology (CSRE inference) and disease analyses (detection of the disease tipping point by DNBs and key genes during glioma progression).
基金partially supported by the National Natural Science Foundation of China(31930022,31771476,12026608,12042104,and 11871456)the Strategic Priority Project of CAS(XDB38040400)+1 种基金the National Key R&D Program of China(2017YFA0505500)JST Moonshot R&D program(JP MJMS2021 to L.C.).
文摘Skin,as the outmost layer of human body,is frequently exposed to environmental stressors including pollutants and ultraviolet(UV),which could lead to skin disorders.Generally,skin response process to ultraviolet B(UVB)irradiation is a nonlinear dynamic process,with unknown underlying molecular mechanism of critical transition.Here,the landscape dynamic network biomarker(lDNB)analysis of time series transcriptome data on 3D skin model was conducted to reveal the complicated process of skin response to UV irradiation at both molecular and network levels.The advanced l-DNB analysis approach showed that:(i)there was a tipping point before critical transition state during pigmentation process,validated by 3D skin model;(ii)13 core DNB genes were identified to detect the tipping point as a network biomarker,supported by computational assessment;(iii)core DNB genes such as COL7A1 and CTNNB1 can effectively predict skin lightening,validated by independent human skin data.Overall,this study provides new insights for skin response to repetitive UVB irradiation,including dynamic pathway pattern,biphasic response,and DNBs for skin lightening change,and enables us to further understand the skin resilience process after external stress.
基金supported by the National Key R&D Program of China(Grant No.2017YFA0505500)the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDA12010000)+2 种基金the National Program on Key Basic Research Project of China(Grant Nos.2014CBA02000 and 2014CB910500)the National Natural Science Foundation of China(Grant Nos.91029301,30700397,91529303,and 31771476)the support of the SANOFI-SIBS Distinguish Young Scientist Award Scholarship Program。
文摘The estrogen receptor(ER)-negative breast cancer subtype is aggressive with few treatment options available.To identify specific prognostic factors for ER-negative breast cancer,this study included 705,729 and 1034 breast invasive cancer patients from the Surveillance,Epidemiology,and End Results(SEER)and The Cancer Genome Atlas(TCGA)databases,respectively.To identify key differential kinase-substrate node and edge biomarkers between ER-negative and ERpositive breast cancer patients,we adopted a network-based method using correlation coefficients between molecular pairs in the kinase regulatory network.Integrated analysis of the clinical and molecular data revealed the significant prognostic power of kinase-substrate node and edge features for both subtypes of breast cancer.Two promising kinase-substrate edge features,CSNK1A1-NFATC3 and SRC-OCLN,were identified for more accurate prognostic prediction in ERnegative breast cancer patients.
基金supported by grants from the National Key R&D Program of China(2017YFA0505500)the National Natural Science Foundation of China(31930022,12131020,12026608,and 31771476)+1 种基金the Strategic Priority Research Program of the Chinese Academy of Sciences(XDB38040400)JST Moonshot R&D(JPMJMS2021).
文摘Recent advances of single-cell RNA sequencing(scRNA-seq)technologies have led to extensive study of cellular heterogeneity and cell-to-cell variation.However,the high frequency of dropout events and noise in scRNA-seq data confounds the accuracy of the downstream analysis,i.e.clustering analysis,whose accuracy depends heavily on the selected feature genes.Here,by deriving an entropy decomposition formula,we propose a feature selection method,i.e.an intrinsic entropy(IE)model,to identify the informative genes for accurately clustering analysis.Specifically,by eliminating the‘noisy’fluctuation or extrinsic entropy(EE),we extract the IE of each gene from the total entropy(TE),i.e.TE=IE+EE.We show that the IE of each gene actually reflects the regulatory fluctuation of this gene in a cellular process,and thus high-IE genes provide rich information on celltype or state analysis.To validate the performance of the high-IE genes,we conduct computational analysis on both simulated datasets and real single-cell datasets by comparing with other representative methods.The results show that our IE model is not only broadly applicable and robust for different clustering and classification methods,but also sensitive for novel cell types.Our results also demonstrate that the intrinsic entropy/fluctuation of a gene serves as information rather than noise in contrast to its total entropy/fluctuation.
文摘Recent trend on biological data at a molecular level is omics data analysis for both bulk and single cells, in eluding genomics, proteomics, metabolomics, and epigenetics data (Wang and Zhang, 2017;Zhang et al., 2017;Zhao and Li, 2017;Cheng and Leung, 2018). Rapid accumulation of such high-dimensional biological data is driving the system-level study from describing complex phenomena to understanding molecular mechanisms (Park et al., 2018;Sun et al., 2018) and from analyzi ng in dividual components to understanding their networks and systems (Chen et al., 2009;Chen, 2017).
基金supported by the National Key Research and Development Program of China (Grant No. 2017YFA0505500)Japan Society for the Promotion of Science KAKENHI Program (Grant No. JP15H05707)National Natural Science Foundation of China (Grant Nos. 11771010,31771476,91530320, 91529303,91439103 and 81471047)
文摘Natural systems are typically nonlinear and complex, and it is of great interest to be able to reconstruct a system in order to understand its mechanism, which cannot only recover nonlinear behaviors but also predict future dynamics. Due to the advances of modern technology, big data becomes increasingly accessible and consequently the problem of reconstructing systems from measured data or time series plays a central role in many scientific disciplines. In recent decades, nonlinear methods rooted in state space reconstruction have been developed, and they do not assume any model equations but can recover the dynamics purely from the measured time series data. In this review, the development of state space reconstruction techniques will be introduced and the recent advances in systems prediction and causality inference using state space reconstruction will be presented. Particularly, the cutting-edge method to deal with short-term time series data will be focused on.Finally, the advantages as well as the remaining problems in this field are discussed.
基金This work was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences(CAS)(No.XDB13040700)the National Program on Key Basic Research Project(No.2014CB910504)+1 种基金the National Natural Science Foundation of China(No.91439103,61134013,31200987)the Knowledge Innovation Program of SIBS of CAS(No.2013KIP218).
文摘Network or edge biomarkers area reliable form to characterize phenotypes or diseases.However,obtaining edges orcorrelations between molecules for an individual requires measurement ofmultiple samples of that individual,which are generally unavailable in clinical practice.Thus,it is strongly demanded to diagnose a disease by edge or network biomarkers in one-sample-for-one-individual context.Here,we developed a new computational framework,EdgeBiomarker,to integrate edge and node biomarkers to diagnose phenotype of each single test sample.By applying the method to datasets of lung and breast cancer,it reveals new marker genes/gene-pairs and related sub-networks for distinguishing earlier and advanced cancer stages.Our method shows advantages over traditional methods:(i)edge biomarkers extracted from non-differentially expressed genes achieve better cross-validation accuracy of diagnosis than molecule or node biomarkers from differentially expressed genes,suggesting that certain pathogenic information is only present at the level of network and under-estimated by traditional methods;(ii)edge biomarkers categorize patients into low/high survival rate in a more reliablemanner;(iii)edge biomarkers are significantly enriched in relevant biological functions or pathways,implying that the association changes ina network,rather than expression changes in individual molecules,tendtobe causally related to cancer development.The new frameworkof edgebiomarkers paves theway for diagnosing diseases and analyzing the irmolecular mechanisms by edges or networks in one-sample-for-one-individual basis.This also provides a powerful tool for precision medicine or big-data medicine.
基金This research is supported by the National Natural Science Foundation of China under Grant Nos 10631070, 60873205, 10701080, and the Beijing Natural Science Foundation under Grant No. 1092011. It is also partially supported by the Foundation of Beijing Education Commission under Grant No. SM200910037005, the Funding Project for Academic Human Resources Development in Institutions of Higher Learning under the Jurisdiction of Beijing Municipality (PHR201006217), and the Foundation of WYJD200902.
文摘Detecting communities from complex networks is an important issue and has attracted attention of researchers in many fields. It is relevant to social tasks, biological inquiries, and technological problems since various networks exist in these systems. This paper proposes a new self-organizing map (SOM) based approach to community detection. By adopting a new operation and a new weightupdating scheme, a complex network can be organized into dense subgraphs according to the topological connection of each node by the SOM algorithm. Extensive numerical experiments show that the performance of the SOM algorithm is good. It can identify communities more accurately than existing methods. This method can be used to detect communities not only in undirected networks, but also in directed networks and bipartite networks.