期刊文献+

scLM:Automatic Detection of Consensus Gene Clusters Across Multiple Single-cell Datasets 被引量:2

原文传递
导出
摘要 In gene expression profiling studies,including single-cell RNA sequencing(scRNA-seq)analyses,the identification and characterization of co-expressed genes provides critical information on cell identity and function.Gene co-expression clustering in scRNA-seq data presents certain challenges.We show that commonly used methods for single-cell data are not capable of identifying co-expressed genes accurately,and produce results that substantially limit biological expectations of co-expressed genes.Herein,we present single-cell Latent-variable Model(scLM),a gene coclustering algorithm tailored to single-cell data that performs well at detecting gene clusters with significant biologic context.Importantly,scLM can simultaneously cluster multiple single-cell datasets,i.e.,consensus clustering,enabling users to leverage single-cell data from multiple sources for novel comparative analysis.scLM takes raw count data as input and preserves biological variation without being influenced by batch effects from multiple datasets.Results from both simulation data and experimental data demonstrate that scLM outperforms the existing methods with considerably improved accuracy.To illustrate the biological insights of scLM,we apply it to our in-house and public experimental scRNA-seq datasets.scLM identifies novel functional gene modules and refines cell states,which facilitates mechanism discovery and understanding of complex biosystems such as cancers.A user-friendly R package with all the key features of the scLM method is available at https://github.com/QSong-github/scLM.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2021年第2期330-341,共12页 基因组蛋白质组与生物信息学报(英文版)
基金 the Cancer Genomics,Tumor Tissue Repository,and Bioinformatics Shared Resources under the NCI Cancer Center Support Grant to the Comprehensive Cancer Center of Wake Forest University Health Sciences,USA(Grant No.P30CA012197)。
  • 相关文献

参考文献8

二级参考文献165

  • 1[1]Johansson,(O).,et al.2003.Identification of functional clusters of transcription factor binding motifs in genome sequences:the MSCAN algorithm.Bioinformatics 19:i169-176.
  • 2[2]van Helden,J.,et al.1998.Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies.J.Mol.Biol.281:827-842.
  • 3[3]Hertz,G.Z.and Stormo,G.D.1999.Identifying DNA and protein patterns with statistically significant alignments of multiple sequences.Bioinformatics 15:563-577.
  • 4[4]Hughes,J.D.,et al.2000.Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae.J.Mol.Biol.296:1205-1214.
  • 5[5]Jensen,S.T.,et al.2004.Computational discovery of gene regulatory binding motifs:a Bayesian perspective.Statist.Sci.19:188-204.
  • 6[6]Sinha,S.,et al.2004.PhyME:a probabilistic algorithm for finding motifs in sets of orthologous sequences.BMC Bioinformatics 5:170.
  • 7[7]Roth,F.P.,et al.1998.Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation.Nat.Biotechnol.16:939-945.
  • 8[8]Tavazoie,S.,et al.1999.Systematic determination of genetic network architecture.Nat.Genet.22:281-285.
  • 9[9]Segal,E.,et al.2003.Module networks:identifying regulatory modules and their condition-specific regulators from gene expression data.Nat.Genet.34:166-176.
  • 10[10]Latchman,D.S.2000.Transcription factors as potential targets for therapeutic drugs.Curr.Pharm.Biotechnol.1:57-61.

共引文献27

同被引文献33

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部