期刊文献+

SeqSQC: A Bioconductor Package for Evaluating the Sample Quality of Next-generation Sequencing Data

SeqSQC: A Bioconductor Package for Evaluating the Sample Quality of Next-generation Sequencing Data
原文传递
导出
摘要 As next-generation sequencing (NGS) technology has become widely used to identify genetic causal variants for various diseases and traits,a number of packages for checking NGS data quality have sprung up in public domains. In addition to the quality of sequencing data,sample quality issues,such as gender mismatch,abnormal inbreeding coefficient,cryptic relatedness,and population outliers,can also have fundamental impact on downstream analysis. However,there is a lack of tools specialized in identifying problematic samples from NGS data,often due to the limitation of sample size and variant counts. We developed SeqSQC,a Bioconductor package,to automate and accelerate sample cleaning in NGS data of any scale. SeqSQC is designed for efficient data storage and access,and equipped with interactive plots for intuitive data visualization to expedite the identification of problematic samples. SeqSQC is available at http://bioconductor. org/packages/SeqSQC. As next-generation sequencing(NGS) technology has become widely used to identify genetic causal variants for various diseases and traits, a number of packages for checking NGS data quality have sprung up in public domains. In addition to the quality of sequencing data, sample quality issues, such as gender mismatch, abnormal inbreeding coefficient, cryptic relatedness, and population outliers, can also have fundamental impact on downstream analysis. However, there is a lack of tools specialized in identifying problematic samples from NGS data, often due to the limitation of sample size and variant counts. We developed SeqSQC, a Bioconductor package, to automate and accelerate sample cleaning in NGS data of any scale. SeqSQC is designed for efficient data storage and access, and equipped with interactive plots for intuitive data visualization to expedite the identification of problematic samples. SeqSQC is available at http://bioconductor.org/packages/SeqSQC.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2019年第2期211-218,共8页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by the National Cancer Institute (NCI), the National Institutes of Health (NIH), USA (Grant Nos. CA162218 awarded to SL and HZ, CA105274 awarded to LHK, and CA195565 awarded to LHK and CBA) supported by the NCI (Grant No. P30CA016056 awarded to Roswell Park Comprehensive Cancer Center involving the use of DBBR, Genomic, Bioinformatics, and Biostatistics Shared Resources) supported by the Breast Cancer Research Foundation, USA
关键词 Next-generation SEQUENCING QUALITY assessment 1000 GENOMES Project Whole-exome SEQUENCING BIOCONDUCTOR PACKAGE Next-generation sequencing Quality assessment 1000 Genomes Project Whole-exome sequencing Bioconductor package
分类号 Q [生物学]
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部