Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric ...Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric Ozone Assessment Report(TOAR)created,contains one of the world's largest collections of near-surface air quality measurements and considers FAIR data principles as an integral part.A special feature of our data service is the on-demand processing and product generation of several air quality metrics directly from the underlying database.In this paper,we show that the necessary data harmonization for establishing such online analysis services goes much deeper than the obvious issues of common data formats,variable names,and measurement units,and we explore how the generation of FAIR Digital Objects(FDO)in combination with automatically generateddocumentation may support Canonical Analysis Workflows for airquality and related data.展开更多
文摘Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric Ozone Assessment Report(TOAR)created,contains one of the world's largest collections of near-surface air quality measurements and considers FAIR data principles as an integral part.A special feature of our data service is the on-demand processing and product generation of several air quality metrics directly from the underlying database.In this paper,we show that the necessary data harmonization for establishing such online analysis services goes much deeper than the obvious issues of common data formats,variable names,and measurement units,and we explore how the generation of FAIR Digital Objects(FDO)in combination with automatically generateddocumentation may support Canonical Analysis Workflows for airquality and related data.