基因芯片筛选差异表达基因方法比较被引量：4

Comparison of statistical methods for detecting differential expression in microarray data

下载PDF

导出

摘要使用计算机模拟数据和真实的芯片数据,对8种筛选差异表达基因的方法进行了比较分析,旨在比较不同方法对基因芯片数据的筛选效果。模拟数据分析表明,所使用的8种方法对均匀分布的差异表达基因有很好的识别、检出作用。算法方面,SAM和Wilcoxon秩和检验方法较好;数据分布方面,正态分布的识别效果较好,卡方分布和指数分布的识别效果较差。杨树cDNA芯片分析表明,SAM、Samroc和回归模型方法相近,而Wilcoxon秩和检验方法与它们有较大差异。 DNA microarray is a new tool in biotechnology, which allows simultaneously monitoring thousands of gene expression in cells. The goal of differential gene expression analysis is to detect genes with significant change of gene ex- pression levels arising from experimental conditions. Although various statistical methods have been suggested to confirm differential gene expression, only a few studies compared performance of the statistical methods. This paper presented comparison of statistical methods for finding differentially expressed genes （DEGs） from the microarray data. Using simu- lated and real datasets （Populus cDNA microarray data）, we compared eight methods of identifying differential gene ex- pression. The simulated datasets included four differential distributions （normal distribution, uniform distribution, Z2 distri- bution, and exponential distribution）. The results of simulated datasets analysis showed that the eight methods were more preferable with the microarray data of uniform distribution than normal distribution. They were not preferable with the ~2 distribution and exponential distribution. Of these eight methods, SAM （Significance Analysis of Microarrays） and Wil- coxon rank sum test performed well in most cases. The results of real cDNA microarray data of Populus showed that there was much similarity of SAM, Samroc, and regression modeling approach. Wilcoxon rank sum test was different from them. Samroc and regression modeling approach were similar in the eight methods. For both simulated and real datasets, SAM, Samroc, and regression modeling approach performed better than other methods.

作者单文娟童春发施季森

机构地区南京林业大学国家林业局、江苏省林木遗传和基因工程重点实验室

出处《遗传》 CAS CSCD 北大核心 2008年第12期1640-1646,共7页 Hereditas（Beijing)

基金江苏省自然科学基金“重要模式树种(杨树和杉木功能基因组学研究)”项目(编号:BK2003213)资助~~

关键词基因芯片杨树差异表达 microarray Populus differential expression

分类号 Q75 [生物学—分子生物学]

引文网络
相关文献

参考文献31

1Brent R. Genomic biology. Cell, 2000, 100(1): 169-183.
2Baldi P, Long AD. A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes. Bioinformatics, 2001, 17(6): 509-519.
3Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW. On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data. J Comput Biol, 2001, 8(1): 37-52.
4Lonnstedt I, Speed TP. Replicated microarray data. Star Sin, 2002, 12: 31-46.
5Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol, 2004, 3: Article 3.
6Tusher VG, Tibshirani R, Chu G, Significance analysis of microarrays applied to transcriptional responses to ionizing radiation. Proc Natl Acad Sci USA, 2001, 98:5116-5121.
7Dudoit S, Yang YH, Speed TP, Callow MJ. Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Stat Sin, 2002, 12: 111-139.
8Pan W, Lin J, Le C. A mixture model approach to detecting differentially expressed genes with microarray data. Funct Integr Genomics, 2003, 3(3): 117-124.
9Nykter M, Aho T, Ahdesmaiki M, Ruusuvuori P, Lehmussola A, Yli-Harja O. Simulation of microarray data with realistic characteristics. BMC Bioinformatics, 2006, 7: 349.
10Kim SY, Lee JW, Sohn IS. Comparison of various statistical methods for identifying differential gene expression in replicated microarray data. Stat Methods Med Res, 2006, 15(1): 3-20.

同被引文献123

1蒋定锋,高峻,赵耐青.乳腺癌基因芯片数据分析[J].复旦学报（医学版）,2005,32(2):169-172. 被引量：2
2孙薇,贺福初.差异蛋白质组学研究技术新进展[J].化学通报,2005,68(6):401-407. 被引量：12
3王洪宝,王启贵,李辉.利用基因芯片技术研究两品种鸡脂肪组织差异表达基因[J].生物工程学报,2005,21(6):979-982. 被引量：7
4荆志伟,王忠,王永炎,高思华.基因芯片数据分析方法研究进展[J].生物技术通讯,2007,18(1):144-148. 被引量：5
5Yu H, Chen X, Hong Y Y, Wang Y, Xu P, Ke S D, Liu H Y, Zhu J K, Oliver D J, Xiang C B. Activated expression of an Arabidopsis HD-START protein confers drought tolerance with improved root system and reduced stomatal density. The Plant Cell, 2008, 20: 1134-1151.
6Zhu T. Global analysis of gene expression using GeneChip microarrays. Current Opinion in Plant Biology, 2003, 6:418-425.
7Jung C , Lyou S H, Yeu S Y, Kim M A, Rhee S, Kim M, Lee J S, Choi Y D, Cheong J J. Microarray-based screening of jasmonateresponsive genes in Arabidopsis thaliana. Plant Cell Reports, 2007, 26:1053-1063.
8Guo P G, BaumM, Li R H, Grando S, Varshney R K, Granet, A, Ceccarelli S, Valkoun J. Transcriptional analysis of barley genes in response to drought stress at the reproductive growth stage using affymetrix Barley 1 genechip. Journal of Guangzhou University: Naturaf Science Edition, 2007, 6(5): 36-41.
9Seki M, Ishida J, Narusaka M, Fujita M, Nanjo T, Umezawa T, Kamiya A, Nakajima M, Enju A, Sakurai T, Satou M, Akiyama K J, Yamaguehi-Shinozaki K, Caminci P, Kawai J, Hayashizaki Y, Shinozaki K. Monitoring the expression pattern of around 7000 Arabidopsis genes under ABA treatments using a full-length eDNA mieroarray. Functional andlntegrative Genomics, 2002, 2: 282-291.
10Kawasaki S, Borehert C, Deyholos M, Wang H, Brazille S, Kawai K, Galbraith D, Bohnert H J. Gene expression profiles during the initial phase of salt stress in rice. The Plant Cell, 2001, 13: 889-905.

引证文献4

1宋雯雯,李文滨,韩雪,高慕娟,王继安.干旱胁迫下大豆幼苗根系基因的表达谱分析[J].中国农业科学,2010,43(22):4579-4586. 被引量：9
2栾德琴,常国斌,盛中伟,黄正洋,周伟,龚琳琳,陈丹艳,刘岩,王克华,窦套存,陈国宏.如皋鸡不同时期肌肉组织生长相关基因的表达谱分析[J].畜牧兽医学报,2012,43(1):14-21. 被引量：4
3刘正龙,王洪平,杨艳梅,罗玉军.基因表达差异谱数据的显著性分析方法[J].数理医药学杂志,2015,28(2):161-163.
4王锦霞,常乘,马洁,吴松锋,庄举娟,朱云平.基于质谱技术筛选差异表达蛋白的统计学策略研究进展[J].中国科学：生命科学,2015,45(4):347-358.

二级引证文献13

1罗延青,俎峰,李劲峰,赵凯琴,王敬乔,符明联.甘、芥种间杂交后代DH株系K0959 PEG干旱胁迫差异表达研究[J].南方农业学报,2011,42(12):1454-1457. 被引量：3
2肖翔,官春云,尹明智,李栒,官梅.基因芯片技术在农业中应用的研究进展[J].中国农学通报,2012,28(33):187-193. 被引量：10
3张旭,马红,刘娣,孙亚蒙,汪亮.小鼠黏着斑相关蛋白在不同组织中的表达分析[J].畜牧与兽医,2013,45(6):59-61. 被引量：2
4范锋贵,金秀锋,叶石,任万杰,张帆,鞠丽萍,王宏礼,张晓科.晋麦47幼苗叶片水分胁迫差异表达蛋白的鉴定与分析[J].麦类作物学报,2013,33(3):566-572. 被引量：3
5王洪志,常国斌,马腾,翟飞,夏明秀,刘璐,陈静,徐璐,陈国宏.基于时序表达谱芯片挖掘鸡脂肪酸代谢关键调控基因[J].畜牧兽医学报,2013,44(12):1882-1890. 被引量：1
6吴奎,梁宏伟,李忠,王丹,王春枝,邹桂伟.尼罗罗非鱼雌雄鱼肌肉组织差异表达基因的筛选[J].水产学报,2014,38(3):316-324. 被引量：3
7白鹏,冉春艳,谢小玉.干旱胁迫对油菜蕾薹期生理特性及农艺性状的影响[J].中国农业科学,2014,47(18):3566-3576. 被引量：76
8李培培,王接弟,孔广超.PEG胁迫下玉米基因表达谱分析[J].基因组学与应用生物学,2014,33(2):365-373. 被引量：5
9韩泽刚,赵曾强,何兰兰,柴蒙亮,李会会,张薇.枯萎病菌诱导感、抗陆地棉品种的转录因子表达变化[J].作物学报,2015,41(2):228-239. 被引量：3
10韩泽刚,赵曾强,何兰兰,柴蒙亮,李会会,张薇.应用Solexa测序技术分析棉花不同抗病品种的数字表达谱[J].核农学报,2015,29(4):651-662. 被引量：6

1雷文英,刘娜,张龙.Visual Basic处理浮点DSP芯片数据的方法[J].石油仪器,2010,24(4):69-71.
2产业利好密码芯片冲刺在即——国民技术成功承办“2011密码芯片分析与测评技术论坛”[J].信息安全与通信保密,2011,9(11):33-33.
3“2011密码芯片分析和测评技术论坛”深圳召开[J].中国集成电路,2011,20(12):9-9.
4魏大木,陶宏才,李伟,李斌.数据挖掘在基因芯片中的探索[J].中国科技博览,2010(9):252-252.
5李沐雨,王星.高维异方差高斯混合罚模型聚类[J].数学的实践与认识,2013,43(5):163-170.
6张扬.C4X系列打印机93C46芯片数据意义的初探[J].家电维修,2006(2):29-30.
7杜丽,庞振凌,周索,曾晓慧,包满珠.香樟胚性愈伤组织遗传转化体系建立[J].林业科学,2008,44(4):54-59. 被引量：12
8张小川,于旭庭,张宜浩.一种改进的向量空间模型的文本表示算法[J].重庆理工大学学报（自然科学）,2017,31(1):87-92. 被引量：8
9张文军,商鸿生.连续性空间分布型模型与应用研究[J].应用生态学报,1992,3(2):169-172. 被引量：3
10秦汉.如何恢复iPhone丢失的数据[J].计算机与网络,2015,41(17):43-43.

遗传

2008年第12期

浏览历史

内容加载中请稍等...

基因芯片筛选差异表达基因方法比较被引量：4

参考文献31

同被引文献123

引证文献4

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基因芯片筛选差异表达基因方法比较 被引量：4

参考文献31

同被引文献123

引证文献4

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基因芯片筛选差异表达基因方法比较被引量：4