期刊文献+

转录组测序数据中cSNP和表达差异基因的分析方法 被引量:1

RNA-Seq based analysis on cSNP and gene expression level
下载PDF
导出
摘要 目的确立本次转录组测序数据中编码区单核苷酸多态性(cSNP)和表达差异基因的分析方法,筛选出可能导致蛋白质功能改变的单核苷酸多态性(SNP)位点和不同表型细胞中存在的表达差异基因。方法对正常培养的胃癌细胞系MKN28和SGC7901进行RNA测序(RNA-Seq),将测序数据与参考基因组进行比对,对测序的reads数、测得的基因数、MKN28和SGC7901中各自表达上调的基因数、SNP数及可变剪接形式进行统计学分析。运用在线的软件和数据库并结合计算机编程,对2株胃癌细胞系转录组测序数据中的SNP进行筛选和功能预测;对2株细胞中表达差异基因GO聚类结果进行分析比较。结果筛选并预测了8种类别709种基因的SNP,分析出了6个经预测能够导致蛋白功能改变的SNP位点。对表达差异基因的分析得到了丝氨酸/苏氨酸蛋白激酶在2株细胞中的表达情况;经Western blotting和PCR验证了部分分析结果。结论确立了1种转录组测序后cSNP数据的分析方法,该方法能够对大量SNP数据进行高效筛选和分析;通过聚类分析后再比较得到了一组在MKN28中高表达而在SGC7901中低表达的蛋白激酶基因;这些结果为后续实验提供了依据。 Objective To establish the analytical method for cSNP and gene expression difference based on transcriptome RNA-Seq data, and to screen SNP loci that may alter protein functions and gene expression difference among different cell phenotypes. Methods RNA-Seq was performed for normal cultured gastric cancer cell lines MKN28 and SGC7901. The sequencing data was then compared with the reference genome and the statistic analysis was conducted for the number of reads, sequenced genes, upregulated genes of MKN28 and SGC7901, and SNP and variable splicing patterns. Online software, database and computer programming were combined to screen and predict functions of SNP in transcriptome sequencing data of two gastric cancer cell lines, and to perform analysis and comparison for the GO clustering results of differentially expressed genes. Results The SNP of 709 genes belonging to 8 different gene terms were screened and predicted and 6 cSNPs that could cause protein functional alterations were identified. The expression of serine/threonine kinase in two cell lines were obtained by analyzing gene expression differences. Some of the analytical results were confirmed by the Western blotting and PCR. Conclusion An analytical method for cSNP data of transcriptome sequencing is established. This method can efficiently screen and analyze massive SNP data. A set of protein kinase genes with high expression in MKN28 and low expression in SGC7901 are obtained by clustering analysis and comparision. These results are basis for further experiments.
出处 《上海交通大学学报(医学版)》 CAS CSCD 北大核心 2014年第2期129-133,共5页 Journal of Shanghai Jiao tong University:Medical Science
基金 国家自然科学基金(81171939)~~
关键词 编码区单核苷酸多态性 转录组 RNA测序 表达差异基因 胃癌 cSNP transcriptome RNA-Seq gene expression difference gastric cancer
  • 相关文献

参考文献2

二级参考文献101

  • 1ZHOU XiaoGuang1,REN LuFeng1,LI YunTao2,ZHANG Meng1,YU YuDe2 & YU Jun1 1 Key Laboratory of Genome Sciences and Information,Beijing Institute of Genomics,Chinese Academy of Sciences,Beijing 100029,China,2 Institute of Semiconductors,Chinese Academy of Sciences,Beijing 100083,China.Next-generation sequencing technology:A technology review and future perspective[J].Science China(Life Sciences),2010,53(1):44-57. 被引量:29
  • 2Ansorge W J.Next-generation DNA sequencing techniques.N Biotechnol,2009,25:195-203.
  • 3Glenn T C.Field guide to next-generation DNA sequencers.Mol Ecol Resour,2011,11:759-769.
  • 4Hillier L W,Marth G T,Quinlan A R,et al.Whole-genome sequencing and variant discovery in C.elegans.Nat Methods,2008,5:183-188.
  • 5TAGI.Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.Nature,2000,408:796-815.
  • 6Check E.Mouse genome:the real deal.Nature,2002,420:457.
  • 7Waterston R H,Lindblad-Toh K,Birney E,et al.Initial sequencing and comparative analysis of the mouse genome.Nature,2002,420:520-562.
  • 8Yu J,Hu S,Wang J,et al.A draft sequence of the rice genome(Oryza sativa L.ssp.indica).Science,2002,296:79-92.
  • 9Li R,Fan W,Tian G,et al.The sequence and de novo assembly of the giant panda genome.Nature,2010,463:311-317.
  • 10Huang S,Li R,Zhang Z,et al.The genome of the cucumber,Cucumis sativus L.Nat Genet,2009,41:1275-1281.

共引文献260

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部