One idea of the Canonical Workflow Framework for Research(CWFR)is to improve the reusability and automation in research.In this paper,we aim to deliver a concrete view on the application of CWFRs to a use case of the ...One idea of the Canonical Workflow Framework for Research(CWFR)is to improve the reusability and automation in research.In this paper,we aim to deliver a concrete view on the application of CWFRs to a use case of the arts and humanities to enrich further discussions on the practical realization of canonical workflows and the benefits that come with it.This use case involves context dependent data transformation and feature extraction,ingests into multiple repositories as well as a"human-in-the-loop"workflow step,which introduces a certain complexity into the mapping to a canonical workflow.展开更多
InCanonicalWorkflowFramework forResearch(CWFR)"packages"arerelevantin twodifferentdirections.In data science,workflows are in general being executed on a set of files which have been aggregated for specific ...InCanonicalWorkflowFramework forResearch(CWFR)"packages"arerelevantin twodifferentdirections.In data science,workflows are in general being executed on a set of files which have been aggregated for specific purposes,such as for training a model in deep learning.We call this type of"package"a data collection and its aggregation and metadata description is motivated by research interests.The other type of"packages"relevant for CWFR are supposed to represent workflows in a self-describing and self-contained way for later execution.In this paper,we will review different packaging technologies and investigate their usability in the context of CWFR.For this purpose,we draw on an exemplary use case and show how packaging technologies can support its realization.We conclude that packaging technologies of different flavors help on providing inputs and outputs for workflow steps in a machine-readable way,as well as on representing a workflow and all its artifacts in a self-describing and self-contained way.展开更多
文摘One idea of the Canonical Workflow Framework for Research(CWFR)is to improve the reusability and automation in research.In this paper,we aim to deliver a concrete view on the application of CWFRs to a use case of the arts and humanities to enrich further discussions on the practical realization of canonical workflows and the benefits that come with it.This use case involves context dependent data transformation and feature extraction,ingests into multiple repositories as well as a"human-in-the-loop"workflow step,which introduces a certain complexity into the mapping to a canonical workflow.
文摘InCanonicalWorkflowFramework forResearch(CWFR)"packages"arerelevantin twodifferentdirections.In data science,workflows are in general being executed on a set of files which have been aggregated for specific purposes,such as for training a model in deep learning.We call this type of"package"a data collection and its aggregation and metadata description is motivated by research interests.The other type of"packages"relevant for CWFR are supposed to represent workflows in a self-describing and self-contained way for later execution.In this paper,we will review different packaging technologies and investigate their usability in the context of CWFR.For this purpose,we draw on an exemplary use case and show how packaging technologies can support its realization.We conclude that packaging technologies of different flavors help on providing inputs and outputs for workflow steps in a machine-readable way,as well as on representing a workflow and all its artifacts in a self-describing and self-contained way.