期刊文献+

系列RNA-seq数据的表达模式聚类方法研究

Gene Expression Pattern Clustering for Serial RNA-seq Libraries
原文传递
导出
摘要 目的转录组测序技术为研究特定组织细胞生理状态和分子水平变化提供有力方法。为了建立分子水平变化和组织细胞生理功能之间的关系并排除随机因素的干扰,需要建立基于系列RNA-seq数据的表达模式分析方法。方法本文提出了一种整合的方法(geneexpressionclustermethod,GECluster)用于对系列样本模式聚类。整合曲线拟合以及信息熵建立模型并提取特征属性,最后按照上面模型提供的特征属性对基因进行层次聚类分析。结果表达趋势一致的基因被很好地聚到一个类别中,功能富集分析发现这些基因具有很强的功能相关性,并与文献报道相吻合。结论GECluster可以更灵活客观对多样本系列RNA—seq数据挖掘共表达基因,为后期功能分析提供了更有效的研究方案。 Objective Transcriptome sequencingplays an increasingly important role in biologi- cal science, which provides the most direct evidence of the relationship between physiological state and molecular changes. This study examined a new method that could be used to analyze the biologi- cal information based on serial analysis of gene expression (SAGE) . Methods ~ (GEClus- ter) was employed to examine a serial of samples by means of cluster analysis. The model was estab- lished for identification of gene features by using polynomial fitting and Shannon' s entropy. Hierar- chical clustering algorithm was performed according to the gene features. Then functional enrichment analysis (FEA) was used for specific category obtained by GECluster. Results The genes that have similar expression pattern were found to fall in the same category. FEA showed these genes had strong associations in terms of their function, which was in line with the reports by some litera- tures. Conclusion GECluster provides an objective and flexible method for identification of co-ex- pressed gene based on RNA-seq data.
出处 《医学分子生物学杂志》 CAS 2013年第1期41-46,共6页 Journal of Medical Molecular Biology
基金 国家自然科学基金(N0.61170154,91129710),高等学校博士学科点专项科研基金(No.20102307120027,20102307110022)
关键词 转录组测序 曲线拟合 信息熵 聚类分析 RNA-seq polynomial fitting Shannon' s entropy hierarchical clustering
  • 相关文献

参考文献18

  • 1DERISI J,PENLAND L,BROWN P 0,et al. Use of a cD-NA microarray to analyse gene expression patterns in hu-man cancer [ J]. Nat Genet, 1996 ,14(4) ;457-460.
  • 2KATZ Y,WANG E T,AIR0LDI E M,et al. Analysis anddesign of RNA sequencing experiments for identifying iso-form regulation [ J ] . Nat Methods,2010,7 ( 12 ) : 1009-1015.
  • 3ROYCHOWDHURY S,IYER M K, ROBINSON D R,etal. Personalized oncology through integrative high-through-put sequencing:a pilot study [ J]. Sci Transl Med,2011,3(111):lllral21.
  • 4MILLS J D, JANITZ M. Alternative splicing ofmRNA inthe molecular pathology of neurodegenerative diseases[J] . Neurobiol Aging,2011,33(5) :el 1 -e24.
  • 5MALONE J H,OLIVER B. Microarrays,deep sequencingand the true measure of the transcriptome [ J]. BMC Biol,2011,9:34.
  • 6OZSOLAK F,MILOS P M. RNA sequencing: advances,challenges and opportunities [ J]. Nat Rev Genet,2011,12(2) :87-98.
  • 7PAREEK C S,SMOCZYNSKI R,TRETYN A,et al. Se-quencing technologies and genome sequencing [ J ]. J Ap-pl Genet,2011 ,52(4) :413-435.
  • 8GARBER M,GRABHERR M G, GUTTMAN M,et al.Computational methods for transcriptome annotation andqaantification using RNA-seq [ J] . Nat Methods,2011,8(6):469477.
  • 9FANG Z,CUJ X. Design and validation issues in RNA-seqexperiments [ J]. Brief Bioinform,2011,12(3) :280-287.
  • 10LOOMIS W F,SHAULSKY G. Developmental changes intranscriptional profiles [ J ]. Dev Growth Differ, 2011,53(4):567-575.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部