RobustDEA:一种快速鲁棒的RNA-Seq数据寻找差异表达基因方法

RobustDEA:A fast and robust method for finding differentially expressed genes on RNA-Seq data

下载PDF

导出

摘要第二代高通量RNA-Seq测序技术已成为转录组分析的标准技术手段.寻找差异表达基因作为RNA-Seq测序数据分析中最基本任务之一,提出了大量的分析方法.但是这些不同方法检测出的差异基因往往存在结果不一致性,并且综述性评估已经证明单一方法无法在所有数据集中一直保持优势.因此,提出了一种快速鲁棒的RNA-Seq数据寻找差异表达基因方法RobustDEA,通过自动加权方式结合多种寻找差异表达基因方法,其权值可快速的数据集中学习获得,能有效的体现不同数据集的特点,从而使得RobustDEA方法在不同数据集上都可获得稳定的结果.通过包含qRT-PCR验证的人类大脑数据集和多个老鼠数据集的评估,相比于单个差异表达基因方法和其他结合方法,RobustDEA方法都能获得最准确的预测结果,且表现出很好的鲁棒性能.此外,与PANDOR结合方法相比,RobustDEA方法能大幅度提高计算效率. The next-generation high-throughput RNA-Seq sequencing technology has become the standard and important technique for transcriptome analysis.Finding differentially expressed genes is one of the most basic tasks in RNA-Seq data analysis,and a large number of statistic methods have been proposed.However,the differential genes detected by these methods are often inconsistent.Some systematic evaluation experiments have proved that no single method can maintain its advantages in all RNA-Seq datasets.Therefore,we propose a fast and robust method for finding differentially expressed genes in RAN-Seq data.RobustDEA combines multiple methods by weighting,and its weights can be quickly learned from the dataset.Because these weights reflect the characteristics of the dataset,RobustDEA is able to obtain stable results on various RNA-Seq datasets.A human brain dataset with qRT-PCR validation,mouse and rat RNA-Seq datasets are used to evaluate our proposed method.Compared dataset with any single method and other combined methods,RobustDEA obtains the most accurate results and shows better robustness.In addition,RobustDEA can significantly improve computational efficiency compared with PANDOR.

作者张礼王嘉瑞吴东洋 ZHANG Li;WANG Jiarui;WU Dongyang(College of Computer Science and Technology, Nanjing Forestry University, Nanjing 210016, China)

机构地区南京林业大学信息科学与技术学院

出处《江苏科技大学学报（自然科学版）》 CAS 北大核心 2021年第6期51-58,共8页 Journal of Jiangsu University of Science and Technology:Natural Science Edition

基金国家自然科学青年基金资助项目(61802193) 江苏省自然科学基金资助项目(BK20170934) 南京林业大学青年科技创新基金资助项目(CX2017031) 南京林业大学大学生创新训练计划项目(2018NFUSPITP452) 汕尾市省级科技创新战略专项资金资助项目(2018D2002)。

关键词转录组分析 RNA-SEQ 差异表达基因 transcriptome analysis RNA-Seq differentially expressed genes

分类号 TP398 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1LiZHANG Songcan CHEN Xuejun LIU.Detecting differential expression from RNA-seq data with expression measurement uncertainty[J].Frontiers of Computer Science,2015,9(4):652-663. 被引量：3

二级参考文献35

1Mortazavi A, Williams A, McCue K, Schaeffer L, Wold B. Map- ping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods, 2008, 5(7): 621-628.
2Marioni J, Mason C, Mane S, Stephens M, Gilad Y. RNA-seq: an as- sessment of technical reproducibility and comparison with gene ex- pression arrays. Genome Research, 2008, 18:1509-1517.
3Marguerat S, Bahler J. RNA-seq: from technology to biology. Cellular and Molecular Life Sciences, 2010, 67(4): 569-579.
4Rapaport F, Khanin R, Liang Y, Pirun M, Krek A, Zumbo P, Mason C E, Socci N D, Betel D. Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data. Genome Biology, 2013, 14(9): R95.
5Zhang Z H, Jbaveri D J, Marshall V M, Bauer D C, Edson J, Narayanan R K, Zhao Q. A comparative study of techniques for differential expres- sion analysis on RNA-Seq data. PLoS ONE, 2014, 9:e103207.
6Ozsolak F, Milos E RNA sequencing: advances, challenges and oppor- tunities. Nature Reviews Genetics, 2011, 12(2): 87-98.
7Soneson C, Delorenzi M. A comparison of methods for differential ex- pression analysis of RNA-seq data. BMC Bioinformatics, 2013, 14(1): 9.
8Kvam V, Lu P, Si Y. A comparison of statistical methods for detecting differentially expressed genes from Rna-Seq data. American Journal of Botany, 2012, 99(2): 248-256.
9Seyednasrollah F, Laiho A, Elo L L. Comparison of software packages for detecting differential expression in RNA-seq studies. Briefings in bioinformatics, 2013, bbt086.
10Anders S, McCarthy D J, Chen Y, Okoniewski M, Smyth G K, Hu- ber W, Robinson M D. Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nature Protocols, 2013, 8(9): 1765-1786.

共引文献2

1张礼,刘学军,陈松灿.基于多样本RNA-Seq数据的表达水平估计方法[J].计算机科学与探索,2016,10(2):210-219. 被引量：1
2Yuan LI,Yuhai ZHAO,Guoren WANG,Xiaofeng ZHU,Xiang ZHANG,Zhanghui WANG,Jun PANG.Finding susceptible and protective interaction patterns in large-scale genetic association study[J].Frontiers of Computer Science,2017,11(3):541-554.

1杨雅娟.大数据背景下电子商务平台创新模式探析[J].美化生活,2021(9):220-221.
2李婷.大数据赋能教育管理[J].数据,2019(3):57-59.
3许莹.保持优势持续创新[J].现代制造,2021(18):1-1.
4王春霞,王晶,曹子健,胡宝,许子洁,陈立群.玉米根毛单细胞类型转录组分析[J].江苏农业科学,2022,50(3):49-58.
5邓君令.碳中和背景下基于BP神经网络的电费成本管理[J].商业会计,2021(12):82-86.
6杨海柠.基于机器学习的图像识别技术与应用探析[J].中国宽带,2022(1):77-78. 被引量：1
7刘艳平.小学信息技术微课教学的实践研究[J].新作文（教研）,2022(1):0113-0114.
8薛英.BIM技术在建筑工程造价管理中的应用探讨[J].科技视界,2022(1):117-118. 被引量：8
9王建勋,康凤,孙玲,颜霞,黄丽丽.杨凌糖丝菌Hhs.015拮抗苹果树腐烂病菌转录组分析及抗菌机制探究[J].西北农业学报,2022,31(1):105-116. 被引量：1
10邓涛.很可能领先的半步双座型歼20与下一代空战系统[J].航空知识,2022(1):34-37.

江苏科技大学学报（自然科学版）

2021年第6期

浏览历史

内容加载中请稍等...

RobustDEA:一种快速鲁棒的RNA-Seq数据寻找差异表达基因方法

参考文献1

二级参考文献35

共引文献2

相关作者

相关机构

相关主题

浏览历史