Examining the practical limits of batch effect-correction algorithms:When should you care about batch effects? 被引量：1

Examining the practical limits of batch effect-correction algorithms:When should you care about batch effects?

导出

摘要 Batch effects are technical sources of variation and can confound analysis.While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm(BECA),we hold the viewpoint that the notion of best is context-dependent.Moreover,alternative questions beyond the simplistic notion of "best" are also interesting:are BECAs robust against various degrees of confounding and if so,what is the limit?Using two different methods for simulating class(phenotype) and batch effects and taking various representative datasets across both genomics(RNA-Seq) and proteomics platforms,we demonstrate that under situations where sample classes and batch factors are moderately confounded,most BECAs are remarkably robust and only weakly affected by upstream normalization procedures.This observation is consistently supported across the multitude of test datasets.BECAs do have limits:When sample classes and batch factors are strongly confounded,BECA performance declines,with variable performance in precision,recall and also batch correction.We also report that while conventional normalization methods have minimal impact on batch effect correction,they do not affect downstream statistical feature selection,and in strongly confounded scenarios,may even outperform BECAs.In other words,removing batch effects is no guarantee of optimal functional analysis.Overall,this study suggests that simplistic performance ranking exercises are quite trivial,and all BECAs are compromises in some context or another. Batch effects are technical sources of variation and can confound analysis.While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm(BECA),we hold the viewpoint that the notion of best is context-dependent.Moreover,alternative questions beyond the simplistic notion of "best" are also interesting:are BECAs robust against various degrees of confounding and if so,what is the limit?Using two different methods for simulating class(phenotype) and batch effects and taking various representative datasets across both genomics(RNA-Seq) and proteomics platforms,we demonstrate that under situations where sample classes and batch factors are moderately confounded,most BECAs are remarkably robust and only weakly affected by upstream normalization procedures.This observation is consistently supported across the multitude of test datasets.BECAs do have limits:When sample classes and batch factors are strongly confounded,BECA performance declines,with variable performance in precision,recall and also batch correction.We also report that while conventional normalization methods have minimal impact on batch effect correction,they do not affect downstream statistical feature selection,and in strongly confounded scenarios,may even outperform BECAs.In other words,removing batch effects is no guarantee of optimal functional analysis.Overall,this study suggests that simplistic performance ranking exercises are quite trivial,and all BECAs are compromises in some context or another.

作者 Longjian Zhou Andrew Chi-Hau Sue Wilson Wen Bin Goh

机构地区 School of Pharmaceutical Science and Technology School of Biological Sciences

出处《Journal of Genetics and Genomics》 SCIE CAS CSCD 2019年第9期433-443,共11页 遗传学报（英文版）

基金 support from the National Research Foundation of Singapore NRF-NSFC(Grant No.NRF2018NRF-NSFC003SB-006)

关键词 BATCH effects BIOINFORMATICS Feature selection NORMALIZATION STATISTICS Batch effects Bioinformatics Feature selection Normalization Statistics

分类号 O17 [理学—基础数学]

引文网络
相关文献

同被引文献1

1李飒,赵毅强.基因表达数据批次效应去除方法的研究进展[J].南京农业大学学报,2019,42(3):389-397. 被引量：1

引证文献1

1刘淏晟,张博文.转录组分析中批次效应的检测与矫正[J].北京师范大学学报（自然科学版）,2023,59(4):564-574.

1Jacob E. Wulff,Matthew W. Mitchell.A Comparison of Various Normalization Methods for LC/MS Metabolomics Data[J].Advances in Bioscience and Biotechnology,2018,9(8):339-351.
2Wenbin Liu.Analysis of environmental factors about cerebral stroke[J].Health,2013,5(12):1946-1948. 被引量：3
3I. A. Sadkovsky,O. Golubnitschaja,M. A. Mandrik,M. A. Studneva,H. Abe,H. Schroeder,E. N. Antonova,F. Betsou,T. A. Bodrova,K. Payne,S. V. Suchkov.PPPM (Predictive, Preventive and Personalized Medicine) as a New Model of the National and International Healthcare Services and Thus a Promising Strategy to Prevent a Disease: From Basics to Practice[J].International Journal of Clinical Medicine,2014,5(14):855-870.
4Adalgisa Ieda Maiworm,Milena B.Monteiro,Sebastiao D.Santos-Filho,Agnaldo J.Lopes,Leandro Azeredo,Sotiris Missailidis,Pedro J.Marin,Mario Bernardo-Filho.Cystic fibrosis and the relevance of the whole-body vibration exercises in oscillating platforms: a short review[J].Health,2011,3(10):656-662.
5Yuriy V. Voronenko,Ozar P. Mintser,Dmytro D. Ivanov,Larysa Yu. Babintseva.Objective Assessment in Continual Medical Education (CME) Medical Objective Assessment in System Control[J].Journal of Integrative Medicine（双语）,2018,7(2):7-11.
6Jihui Tu,Bin Yang.A New Correction Algorithm of the Eccentric Ultrasonovision Time Image in the Casing Hole[J].Applied Mathematics,2014,5(10):1427-1431.
7Hans-Kristian Knutson,Anders Holmqvist,Niklas Andersson,Bernt Nilsson.Robust Multi-Objective Optimization of Chromatographic Rare Earth Element Separation[J].Advances in Chemical Engineering and Science,2017,7(4):477-493. 被引量：1
8Gyu Jin Heo,So Young Nam,Soo-Kyung Lee.Factors associated with obesity among Korean adolescents[J].Health,2013,5(8):1328-1334. 被引量：1
9Mohamed Gamal El-Ziney.Molecular and Probiotic Characterizations of <i>Lactobacillus reuteri</i>DSM 12246 and Impact of pH on Biomass and Metabolic Profile in Batch-Culture[J].Advances in Microbiology,2018,8(1):18-30.
10Ian Humphery-Smith.Importance of Bacteriophage in Combating Hospital-Acquired Infection (HAI)[J].Pharmacology & Pharmacy,2014,5(13):1192-1201.

Journal of Genetics and Genomics

2019年第9期

浏览历史

内容加载中请稍等...

Examining the practical limits of batch effect-correction algorithms:When should you care about batch effects? 被引量：1

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史