The data output from microbiome research is growing at an accelerating rate,yet mining the data quickly and efficiently remains difficult.There is still a lack of an effective data structure to represent and manage da...The data output from microbiome research is growing at an accelerating rate,yet mining the data quickly and efficiently remains difficult.There is still a lack of an effective data structure to represent and manage data,as well as flexible and composable analysis methods.In response to these two issues,we designed and developed the MicrobiotaProcess package.It provides a comprehensive data structure,MPSE,to better integrate the primary and intermediate data,which improves the integration and exploration of the downstream data.Around this data structure,the downstream analysis tasks are decomposed and a set of functions are designed under a tidy framework.These functions independently perform simple tasks and can be combined to perform complex tasks.展开更多
Functional enrichment analysis is pivotal for interpreting highthroughput omics data in life science.It is crucial for this type of tool to use the latest annotation databases for as many organisms as possible.To meet...Functional enrichment analysis is pivotal for interpreting highthroughput omics data in life science.It is crucial for this type of tool to use the latest annotation databases for as many organisms as possible.To meet these requirements,we present here an updated version of our popular Bioconductor package,clusterProfiler 4.0.This package has been enhanced considerably compared with its original version published 9 years ago.The new version provides a universal interface for functional enrichment analysis in thousands of organisms based on internally supported ontologies and pathways as well as annotation data provided by users or derived from online databases.It also extends the dplyr and ggplot2 packages to offer tidy interfaces for data operation and visualization.Other new features include gene set enrichment analysis and comparison of enrichment results from multiple gene lists.We anticipate that clusterProfiler 4.0 will be applied to a wide range of scenarios across diverse organisms.展开更多
基金supported by the National Natural Science Foundation of China(32270677).
文摘The data output from microbiome research is growing at an accelerating rate,yet mining the data quickly and efficiently remains difficult.There is still a lack of an effective data structure to represent and manage data,as well as flexible and composable analysis methods.In response to these two issues,we designed and developed the MicrobiotaProcess package.It provides a comprehensive data structure,MPSE,to better integrate the primary and intermediate data,which improves the integration and exploration of the downstream data.Around this data structure,the downstream analysis tasks are decomposed and a set of functions are designed under a tidy framework.These functions independently perform simple tasks and can be combined to perform complex tasks.
基金This work was supported by a startup fund from Southern Medical University.
文摘Functional enrichment analysis is pivotal for interpreting highthroughput omics data in life science.It is crucial for this type of tool to use the latest annotation databases for as many organisms as possible.To meet these requirements,we present here an updated version of our popular Bioconductor package,clusterProfiler 4.0.This package has been enhanced considerably compared with its original version published 9 years ago.The new version provides a universal interface for functional enrichment analysis in thousands of organisms based on internally supported ontologies and pathways as well as annotation data provided by users or derived from online databases.It also extends the dplyr and ggplot2 packages to offer tidy interfaces for data operation and visualization.Other new features include gene set enrichment analysis and comparison of enrichment results from multiple gene lists.We anticipate that clusterProfiler 4.0 will be applied to a wide range of scenarios across diverse organisms.