Coordination of cell differentiation and proliferation is a key issue in the development process of multi-cellular organisms and stem cells. Here we provide evidence that the establishment of adipocyte differentiation...Coordination of cell differentiation and proliferation is a key issue in the development process of multi-cellular organisms and stem cells. Here we provide evidence that the establishment of adipocyte differentiation of 3T3-L1 cells requires two processes: the licensing of an adipogenesis gene-expression program within a particular growth-arrest stage, i.e., the contact-inhibition stage, and then the execution of this program in a cell-cycle-independent manner, by which the licensed progenitors are differentiated into adipocytes in the presence of inducing factors. Our results showed that differentiation licensing of 3T3-L1 cells during the contact-inhibition stage involved epigenetic modifications such as DNA methylation and histone modifications, whereas disturbing these epigenetic modifications by DNA methylation inhibitors or RNAi during the contact-inhibition stage significantly reduced adipogenesis efficiency. More importantly, when these licensed 3T3-L1 cells were re-cultured under non-differentiating conditions or treated only with insulin, this adipogenesis commitment could be maintained from one cell generation to the next, whereby the licensed program could be activated in a cell-cycle-independent manner once these cells were subjected to adipo- genesis-inducing conditions. This result suggests that differentiation licensing and differentiation execution can be uncoupled and disparately linked to cell proliferation. Our findings deliver a new concept that cell-fate decision can be subdivided into at least two stages, licensing and execution, which might have different regulatory relationships with cell proliferation. In addition, this new concept may provide a clue for developing new strategies against obesity.展开更多
Chloroplast is a typical plant cell organelle where photosynthesis takes place. In this study, a total of 1 808 chloroplast core proteins in Arabidopsis thaliana were reliably identified by combining the results of pr...Chloroplast is a typical plant cell organelle where photosynthesis takes place. In this study, a total of 1 808 chloroplast core proteins in Arabidopsis thaliana were reliably identified by combining the results of previously published studies and our own predictions. We then constructed a chloroplast protein interaction network primarily based on these core protein interactions. The network had 22 925 protein interaction pairs which involved 2 214 proteins. A total of 160 previously uncharacterized proteins were annotated in this network. The subunits of the photosynthetic complexes were modularized, and the functional relationships among photosystem Ⅰ (PSI), photosystem Ⅱ (PSII), light harvesting complex of photosystem Ⅰ (LHC Ⅰ) and light harvesting complex of photosystem Ⅰ (LHC Ⅱ) could be deduced from the predicted protein interactions in this network. We further confirmed an interaction between an unknown protein AT1G52220 and a photosynthetic subunit PSI-D2 by yeast two-hybrid analysis. Our chloroplast protein interaction network should be useful for functional mining of photosynthetic proteins and investigation of chloroplast-related functions at the systems biology level in Arabidopsis.展开更多
Leptospirosis is a widespread zoonotic disease caused by pathogenic spirochetes of the genus Leptospira that infects humans and a wide range of animals. By combining computational prediction and high-accuracy tandem m...Leptospirosis is a widespread zoonotic disease caused by pathogenic spirochetes of the genus Leptospira that infects humans and a wide range of animals. By combining computational prediction and high-accuracy tandem mass spectra, we revised the genome annotation of Leptospira interrogans serovar Lai, a free-living pathogenic spirochete responsible for leptospirosis, providing substantial peptide evidence for novel genes and new gene boundaries. Subsequently, we presented a high-coverage proteome analysis of protein expression and multiple posttranslational modifications (PTMs). Approximately 64.3% of the predicted L. interrogans proteins were cataloged by detecting 2 540 proteins. Meanwhile, a profile of multiple PTMs was concurrently established, containing in total 32 phosphorylated, 46 acetylated and 155 methylated proteins. The PTM systems in the serovar Lai show unique features. Unique eukaryotic-like features of L. interrogans protein modifications were demonstrated in both phosphorylation and arginine methylation. This systematic analysis provides not only comprehensive information of high-coverage protein expression and multiple modifications in prokaryotes but also a view suggesting that the evolutionarily primitive L. interrogans shares significant similarities in protein modification systems with eukaryotes.展开更多
Objective To provide a set of useful analysis tools for the researchers to explore the microRNA data. Methods The R language was used for generating the Graphical Users Interface and implementing most functions. Some ...Objective To provide a set of useful analysis tools for the researchers to explore the microRNA data. Methods The R language was used for generating the Graphical Users Interface and implementing most functions. Some Practical Extraction and Report Language (Perl) scripts were used for parsing source files. Results We developed a graphical R package named miRE, which was designated for the analysis of microRNA functions, genomic organization, etc. This package provided effective and convenient tools for molecular biologists to deal with routine analyses in microRNA-related research. With its help, the users would be able to build a desktop- centered microRNA research environment quite easily and effectively. miRE is freely available at http://www. biosino.org/~kanghu/WorkPresentation/miRE/miRE.html. A detailed user manual and tutorials with example code and image are also available. Conclusion miRE is a tool providing an open-source, user-friendly, integrated interface for microRNA-related analysis. With its help, researchers can perform microRNA-related analysis more efficiently.展开更多
Tumor development is a process involving loss of the differentiation phenotype and acquisition of stem-like characteristics,which is driven by intracellular rewiring of signaling network.The measurement of network rep...Tumor development is a process involving loss of the differentiation phenotype and acquisition of stem-like characteristics,which is driven by intracellular rewiring of signaling network.The measurement of network reprogramming and disorder would be challenging due to the complexity and heterogeneity of tumors.Here,we proposed signaling entropy(SR)to assess the degree of tumor network disorder.We calculated SR for 33 tumor types in The Cancer Genome Atlas database based on transcrip-tomic and proteomic data.The SR of tumors was significantly higher than that of normal samples and was highly correlated with cell sternness,cancer type,tumor grade,and metastasis.We further demonstrated the sensitivity and accuracy of using local SR in prognosis prediction and drug response evaluation.Overall,SR could reveal cancer network disorders related to tumor malignant potency,clinical prognosis,and drug response.展开更多
Background: For Chinese patients with hepatocellular carcinoma (HCC), surgical resection is the most important treatment to achieve long-term survival for patients with an early-stage tumor, and yet the prognosis a...Background: For Chinese patients with hepatocellular carcinoma (HCC), surgical resection is the most important treatment to achieve long-term survival for patients with an early-stage tumor, and yet the prognosis after surgery is diverse. We aimed to construct a scoring system (Shanghai Score) for individualized prognosis estimation and adjuvant treatment evaluation. Methods: A multivariate Cox proportional hazards model was constructed based on 4166 HCC patients undergoing resection during 2001-2008 at Zhongshan Hospital. Age, hepatitis B surface antigen, hepatitis B e antigen, partial thromboplastin time, total bilirubin, alkaline phosphatase, y-glutamyltransferase, a-fetoprotein, tumor size, cirrhosis, vascular invasion, differentiation, encapsulation, and tumor number were finally retained by a backward step-down selection process with the Akaike information criterion. The Harrell's concordance index (C-index) was used to measure model performance. Shanghai Score is calculated by summing the products of the 14 variable values times each variable's corresponding regression coefficient. Totally 1978 patients from Zhongshan Hospital undergoing resection during 2009-2012, 808 patients from Eastern Hepatobiliary Surgery Hospital during 2008-2010, and 244 patients from Tianjin Medical University Cancer Hospital during 2010-2011 were enrolled as external validation cohorts. Shanghai Score was also implied in evaluating adjuvant treatment choices based on propensity score matching analysis.Results: Shanghai Score showed good calibration and discrimination in postsurgical HCC patients. The bootstrap-corrected C-index (confidence interval [CI]) was 0.74 for overall survival (OS) and 0.68 for recurrence-free survival (RFS) in derivation cohort (4166 patients), and in the three independent validation cohorts, the CIs for OS ranged 0.70 0.72 and that for RFS ranged 0.63 0.68. Furthermore, Shanghai Score provided evaluation for adjuvant treatment choices (transcatheter arterial chemoembolization or interferon-a). The identified subset of patients at low risk could be ideal candidates for curative surgery, and subsets of patients at moderate or high risk could be recommended with possible adjuvant therapies after surgery. Finally, a web server with individualized outcome prediction and treatment recommendation was constructed. Conclusions: Based on the largest cohort up to date, we established Shanghai Score - an individualized outcome prediction system specifically designed for Chinese HCC patients after surgery. The Shanghai Score web server provides an easily accessible tool to stratify the prognosis of patients undergoing liver resection for HCC.展开更多
The application of next-generation sequencing (NGS) technology in cancer is influenced by the quality and purity of tissue samples. This issue is especially critical for patient-derived xenograft (PDX) models, whi...The application of next-generation sequencing (NGS) technology in cancer is influenced by the quality and purity of tissue samples. This issue is especially critical for patient-derived xenograft (PDX) models, which have proven to be by far the best preclinical tool for investigating human tumor biology, because the sensitivity and specificity of NGS analysis in xenograft samples would be compromised by the contamination of mouse DNA and RNA. This definitely affects downstream analyses by causing inaccurate mutation calling and gene expression estimates. The reliability of NGS data analysis for cancer xenograft samples is therefore highly dependent on whether the sequencing reads derived from the xenograft could be distinguished from those originated from the host. That is, each sequence read needs to be accurately assigned to its original species. Here, we review currently available methodologies in this field, including Xenome, Disambiguate, bamcmp and pdxBlacklist, and provide guidelines for users.展开更多
The implementation of cancer precision medicine requires biomarkers or signatures for predicting prognosis and therapeutic benefits.Most of current efforts in this field are paying much more attention to predictive ac...The implementation of cancer precision medicine requires biomarkers or signatures for predicting prognosis and therapeutic benefits.Most of current efforts in this field are paying much more attention to predictive accuracy than to molecular mechanistic interpretability.Mechanism-driven strategy has recently emerged,aiming to build signatures with both predictive power and explanatory power.Driven by this strategy,we developed a robust gene dysregulation analysis framework with machine learning algorithms,which is capable of exploring gene dysregulations underlying carcinogenesis from high-dimensional data with cooperativity and synergy between regulators and several other transcriptional regulation rules taken into consideration.We then applied the framework to a colorectal cancer(CRC)cohort from The Cancer Genome Atlas.The identified CRC-related dysregulations significantly covered known carcinogenic processes and exhibited good prognostic effect.By choosing dysregulations with greedy strategy,we built a four-dysregulation(4-DysReg)signature,which has the capability of predicting prognosis and adjuvant chemotherapy benefit.4-DysReg has the potential to explain carcinogenesis in terms of dysfunctional transcriptional regulation.These results demonstrate that our gene dysregulation analysis framework could be used to develop predictive signature with mechanistic interpretability for cancer precision medicine,and furthermore,elucidate the mechanisms of carcinogenesis.展开更多
文摘Coordination of cell differentiation and proliferation is a key issue in the development process of multi-cellular organisms and stem cells. Here we provide evidence that the establishment of adipocyte differentiation of 3T3-L1 cells requires two processes: the licensing of an adipogenesis gene-expression program within a particular growth-arrest stage, i.e., the contact-inhibition stage, and then the execution of this program in a cell-cycle-independent manner, by which the licensed progenitors are differentiated into adipocytes in the presence of inducing factors. Our results showed that differentiation licensing of 3T3-L1 cells during the contact-inhibition stage involved epigenetic modifications such as DNA methylation and histone modifications, whereas disturbing these epigenetic modifications by DNA methylation inhibitors or RNAi during the contact-inhibition stage significantly reduced adipogenesis efficiency. More importantly, when these licensed 3T3-L1 cells were re-cultured under non-differentiating conditions or treated only with insulin, this adipogenesis commitment could be maintained from one cell generation to the next, whereby the licensed program could be activated in a cell-cycle-independent manner once these cells were subjected to adipo- genesis-inducing conditions. This result suggests that differentiation licensing and differentiation execution can be uncoupled and disparately linked to cell proliferation. Our findings deliver a new concept that cell-fate decision can be subdivided into at least two stages, licensing and execution, which might have different regulatory relationships with cell proliferation. In addition, this new concept may provide a clue for developing new strategies against obesity.
基金Acknowledgements We thank the RIKEN BRC in Japan for provision of all full-length cDNA in this study. National Natural Science Foundation of China (grants numbers 30530100 and 90408010), the State Key Program of Basic Research of China (grant numbers 2007CB947600 and 2007CB108800), and Hi-Tech Research and Development Program of China (grant number 2006AA02Z313) supported this project.
文摘Chloroplast is a typical plant cell organelle where photosynthesis takes place. In this study, a total of 1 808 chloroplast core proteins in Arabidopsis thaliana were reliably identified by combining the results of previously published studies and our own predictions. We then constructed a chloroplast protein interaction network primarily based on these core protein interactions. The network had 22 925 protein interaction pairs which involved 2 214 proteins. A total of 160 previously uncharacterized proteins were annotated in this network. The subunits of the photosynthetic complexes were modularized, and the functional relationships among photosystem Ⅰ (PSI), photosystem Ⅱ (PSII), light harvesting complex of photosystem Ⅰ (LHC Ⅰ) and light harvesting complex of photosystem Ⅰ (LHC Ⅱ) could be deduced from the predicted protein interactions in this network. We further confirmed an interaction between an unknown protein AT1G52220 and a photosynthetic subunit PSI-D2 by yeast two-hybrid analysis. Our chloroplast protein interaction network should be useful for functional mining of photosynthetic proteins and investigation of chloroplast-related functions at the systems biology level in Arabidopsis.
文摘Leptospirosis is a widespread zoonotic disease caused by pathogenic spirochetes of the genus Leptospira that infects humans and a wide range of animals. By combining computational prediction and high-accuracy tandem mass spectra, we revised the genome annotation of Leptospira interrogans serovar Lai, a free-living pathogenic spirochete responsible for leptospirosis, providing substantial peptide evidence for novel genes and new gene boundaries. Subsequently, we presented a high-coverage proteome analysis of protein expression and multiple posttranslational modifications (PTMs). Approximately 64.3% of the predicted L. interrogans proteins were cataloged by detecting 2 540 proteins. Meanwhile, a profile of multiple PTMs was concurrently established, containing in total 32 phosphorylated, 46 acetylated and 155 methylated proteins. The PTM systems in the serovar Lai show unique features. Unique eukaryotic-like features of L. interrogans protein modifications were demonstrated in both phosphorylation and arginine methylation. This systematic analysis provides not only comprehensive information of high-coverage protein expression and multiple modifications in prokaryotes but also a view suggesting that the evolutionarily primitive L. interrogans shares significant similarities in protein modification systems with eukaryotes.
基金Supported by the National Basic Research Programs of China (2006CB504100, 2007CB946900) National Science Foundation of China (90612019, 30721063, 30430200, 90608022)Program for New Century Excellent Talents in University (NCET-07-0505).
文摘Objective To provide a set of useful analysis tools for the researchers to explore the microRNA data. Methods The R language was used for generating the Graphical Users Interface and implementing most functions. Some Practical Extraction and Report Language (Perl) scripts were used for parsing source files. Results We developed a graphical R package named miRE, which was designated for the analysis of microRNA functions, genomic organization, etc. This package provided effective and convenient tools for molecular biologists to deal with routine analyses in microRNA-related research. With its help, the users would be able to build a desktop- centered microRNA research environment quite easily and effectively. miRE is freely available at http://www. biosino.org/~kanghu/WorkPresentation/miRE/miRE.html. A detailed user manual and tutorials with example code and image are also available. Conclusion miRE is a tool providing an open-source, user-friendly, integrated interface for microRNA-related analysis. With its help, researchers can perform microRNA-related analysis more efficiently.
文摘Tumor development is a process involving loss of the differentiation phenotype and acquisition of stem-like characteristics,which is driven by intracellular rewiring of signaling network.The measurement of network reprogramming and disorder would be challenging due to the complexity and heterogeneity of tumors.Here,we proposed signaling entropy(SR)to assess the degree of tumor network disorder.We calculated SR for 33 tumor types in The Cancer Genome Atlas database based on transcrip-tomic and proteomic data.The SR of tumors was significantly higher than that of normal samples and was highly correlated with cell sternness,cancer type,tumor grade,and metastasis.We further demonstrated the sensitivity and accuracy of using local SR in prognosis prediction and drug response evaluation.Overall,SR could reveal cancer network disorders related to tumor malignant potency,clinical prognosis,and drug response.
基金This study was supported by grants from the National High Technology Research and Development Program (863 Program) of China (No. 2015AA020401), National Key Research and Development Program of China (No. 2016YFC0904101), State Key Program of National Natural Science Foundation of China (No. 81530077), National Natural Science Foundation of China (No. 81372317, 81472676, and 81572823), Projects from the Shanghai Science and Technology Commission (No. 14DZ1940300, 14411970200, 134119a1201, and 14140902301), and Specialized Research Fund for the Doctoral Program of Higher Education and Research Grants Council Earmarked Research Grants Joint Research Scheme (No. 20130071140008).
文摘Background: For Chinese patients with hepatocellular carcinoma (HCC), surgical resection is the most important treatment to achieve long-term survival for patients with an early-stage tumor, and yet the prognosis after surgery is diverse. We aimed to construct a scoring system (Shanghai Score) for individualized prognosis estimation and adjuvant treatment evaluation. Methods: A multivariate Cox proportional hazards model was constructed based on 4166 HCC patients undergoing resection during 2001-2008 at Zhongshan Hospital. Age, hepatitis B surface antigen, hepatitis B e antigen, partial thromboplastin time, total bilirubin, alkaline phosphatase, y-glutamyltransferase, a-fetoprotein, tumor size, cirrhosis, vascular invasion, differentiation, encapsulation, and tumor number were finally retained by a backward step-down selection process with the Akaike information criterion. The Harrell's concordance index (C-index) was used to measure model performance. Shanghai Score is calculated by summing the products of the 14 variable values times each variable's corresponding regression coefficient. Totally 1978 patients from Zhongshan Hospital undergoing resection during 2009-2012, 808 patients from Eastern Hepatobiliary Surgery Hospital during 2008-2010, and 244 patients from Tianjin Medical University Cancer Hospital during 2010-2011 were enrolled as external validation cohorts. Shanghai Score was also implied in evaluating adjuvant treatment choices based on propensity score matching analysis.Results: Shanghai Score showed good calibration and discrimination in postsurgical HCC patients. The bootstrap-corrected C-index (confidence interval [CI]) was 0.74 for overall survival (OS) and 0.68 for recurrence-free survival (RFS) in derivation cohort (4166 patients), and in the three independent validation cohorts, the CIs for OS ranged 0.70 0.72 and that for RFS ranged 0.63 0.68. Furthermore, Shanghai Score provided evaluation for adjuvant treatment choices (transcatheter arterial chemoembolization or interferon-a). The identified subset of patients at low risk could be ideal candidates for curative surgery, and subsets of patients at moderate or high risk could be recommended with possible adjuvant therapies after surgery. Finally, a web server with individualized outcome prediction and treatment recommendation was constructed. Conclusions: Based on the largest cohort up to date, we established Shanghai Score - an individualized outcome prediction system specifically designed for Chinese HCC patients after surgery. The Shanghai Score web server provides an easily accessible tool to stratify the prognosis of patients undergoing liver resection for HCC.
基金supported by the grants from the National Natural Science Foundation of China(Nos.,81672736 and 91529302)the Shanghai Industrial Technology Institute(17CXXF008)+3 种基金the Shanghai Sailing Program(16YF1408600)the Shanghai Municipal Commission of Science and Technology(14DZ2252000)the administrative committee of Shanghai Zhangjiang Hi-Teck Park(2016e08)the Medical engineering cross fund of Shanghai Jiao Tong University(YG2015QN27)
文摘The application of next-generation sequencing (NGS) technology in cancer is influenced by the quality and purity of tissue samples. This issue is especially critical for patient-derived xenograft (PDX) models, which have proven to be by far the best preclinical tool for investigating human tumor biology, because the sensitivity and specificity of NGS analysis in xenograft samples would be compromised by the contamination of mouse DNA and RNA. This definitely affects downstream analyses by causing inaccurate mutation calling and gene expression estimates. The reliability of NGS data analysis for cancer xenograft samples is therefore highly dependent on whether the sequencing reads derived from the xenograft could be distinguished from those originated from the host. That is, each sequence read needs to be accurately assigned to its original species. Here, we review currently available methodologies in this field, including Xenome, Disambiguate, bamcmp and pdxBlacklist, and provide guidelines for users.
基金This work was supported by the grants from the National Natural Science Foundation of China(81672736)the National Key R&D Program of China(2018YFC0910500)+1 种基金Shanghai Municipal Science and Technology(2017SHZDZX01 and 18DZ2294200)NIH CPTAC(Cancer Proteomic Tumor Analysis Consortium)program.
文摘The implementation of cancer precision medicine requires biomarkers or signatures for predicting prognosis and therapeutic benefits.Most of current efforts in this field are paying much more attention to predictive accuracy than to molecular mechanistic interpretability.Mechanism-driven strategy has recently emerged,aiming to build signatures with both predictive power and explanatory power.Driven by this strategy,we developed a robust gene dysregulation analysis framework with machine learning algorithms,which is capable of exploring gene dysregulations underlying carcinogenesis from high-dimensional data with cooperativity and synergy between regulators and several other transcriptional regulation rules taken into consideration.We then applied the framework to a colorectal cancer(CRC)cohort from The Cancer Genome Atlas.The identified CRC-related dysregulations significantly covered known carcinogenic processes and exhibited good prognostic effect.By choosing dysregulations with greedy strategy,we built a four-dysregulation(4-DysReg)signature,which has the capability of predicting prognosis and adjuvant chemotherapy benefit.4-DysReg has the potential to explain carcinogenesis in terms of dysfunctional transcriptional regulation.These results demonstrate that our gene dysregulation analysis framework could be used to develop predictive signature with mechanistic interpretability for cancer precision medicine,and furthermore,elucidate the mechanisms of carcinogenesis.