Metadata,data about other digital objects,play an important role in FAIR with a direct relation to all FAIR principles.In this paper we present and discuss the FAIR Data Point(FDP),a software architecture aiming to de...Metadata,data about other digital objects,play an important role in FAIR with a direct relation to all FAIR principles.In this paper we present and discuss the FAIR Data Point(FDP),a software architecture aiming to define a common approach to publish semantically-rich and machine-actionable metadata according to the FAIR principles.We present the core components and features of the FDP,its approach to metadata provision,the criteria to evaluate whether an application adheres to the FDP specifications and the service to register,index and allow users to search for metadata content of available FDPs.展开更多
While the FAIR Principles do not specify a technical solution for'FAIRness',it was clear from the outset of the FAIR initiative that it would be useful to have commodity software and tooling that would simplif...While the FAIR Principles do not specify a technical solution for'FAIRness',it was clear from the outset of the FAIR initiative that it would be useful to have commodity software and tooling that would simplify the creation of FAIR-compliant resources.The FAIR Data Point is a metadata repository that follows the DCAT(2)schema,and utilizes the Linked Data Platform to manage the hierarchical metadata layers as LDP Containers.There has been a recent flurry of development activity around the FAIR Data Point that has significantly improved its power and ease-of-use.Here we describe five specific tools—an installer,a loader,two Webbased interfaces,and an indexer-aimed at maximizing the uptake and utility of the FAIR Data Point.展开更多
The FAIR guiding principles aim to enhance the Findability,Accessibility,Interoperability and Reusability of digital resources such as data,for both humans and machines.The process of making data FAIR(“FAIRification...The FAIR guiding principles aim to enhance the Findability,Accessibility,Interoperability and Reusability of digital resources such as data,for both humans and machines.The process of making data FAIR(“FAIRification”)can be described in multiple steps.In this paper,we describe a generic step-by-step FAIRification workflow to be performed in a multidisciplinary team guided by FAIR data stewards.The FAIRification workflow should be applicable to any type of data and has been developed and used for“Bring Your Own Data”(BYOD)workshops,as well as for the FAIRification of e.g.,rare diseases resources.The steps are:1)identify the FAIRification objective,2)analyze data,3)analyze metadata,4)define semantic model for data(4a)and metadata(4b),5)make data(5a)and metadata(5b)linkable,6)host FAIR data,and 7)assess FAIR data.For each step we describe how the data are processed,what expertise is required,which procedures and tools can be used,and which FAIR principles they relate to.展开更多
The FAIR principles,an acronym for Findable,Accessible,Interoperable and Reusable,are recognised worldwide as key elements for good practice in all data management processes.To understand how the Brazilian scientific ...The FAIR principles,an acronym for Findable,Accessible,Interoperable and Reusable,are recognised worldwide as key elements for good practice in all data management processes.To understand how the Brazilian scientific community is adhering to these principles,this article reports Brazilian adherence to the GO FAIR initiative through the creation of the GO FAIR Brazil Office and the manner in which they create their implementation networks.To contextualise this understanding,we provide a brief presentation of open data policies in Brazilian research and government,and finally,we describe a model that has been adopted for the GO FAIR Brazil implementation networks.The Brazilian Institute of Information in Science and Technology is responsible for the GO FAIR Brazil Office,which operates in all fields of knowledge and supports thematic implementation networks.Today,GO FAIR Brazil-Health is the first active implementation network in operation,which works in all health domains,serving as a model for other fields like agriculture,nuclear energy,and digital humanities,which are in the process of adherence negotiation.This report demonstrates the strong interest and effort from the Brazilian scientific communities in implementing the FAIR principles in their research data management practices.展开更多
In recent years,as newer technologies have evolved around the healthcare ecosystem,more and more data have been generated.Advanced analytics could power the data collected from numerous sources,both from healthcare in...In recent years,as newer technologies have evolved around the healthcare ecosystem,more and more data have been generated.Advanced analytics could power the data collected from numerous sources,both from healthcare institutions,or generated by individuals themselves via apps and devices,and lead to innovations in treatment and diagnosis of diseases;improve the care given to the patient;and empower citizens to participate in the decision-making process regarding their own health and well-being.However,the sensitive nature of the health data prohibits healthcare organizations from sharing the data.The Personal Health Train(PHT)is a novel approach,aiming to establish a distributed data analytics infrastructure enabling the(re)use of distributed healthcare data,while data owners stay in control of their own data.The main principle of the PHT is that data remain in their original location,and analytical tasks visit data sources and execute the tasks.The PHT provides a distributed,flexible approach to use data in a network of participants,incorporating the FAIR principles.It facilitates the responsible use of sensitive and/or personal data by adopting international principles and regulations.This paper presents the concepts and main components of the PHT and demonstrates how it complies with FAIR principles.展开更多
In recent years,implementations enabling Distributed Analytics(DA)have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data.Th...In recent years,implementations enabling Distributed Analytics(DA)have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data.These concepts propose privacy-enhancing alternatives to data centralisation approaches,which have restricted applicability in case of sensitive data due to ethical,legal or social aspects.Nevertheless,the immanent problem of DA-enabling architectures is the black-box-alike behaviour of the highly distributed components originating from the lack of semantically enriched descriptions,particularly the absence of basic metadata for data sets or analysis tasks.To approach the mentioned problems,we propose a metadata schema for DA infrastructures,which provides a vocabulary to enrich the involved entities with descriptive semantics.We initially perform a requirement analysis with domain experts to reveal necessary metadata items,which represents the foundation of our schema.Afterwards,we transform the obtained domain expert knowledge into user stories and derive the most significant semantic content.In the final step,we enable machine-readability via RDF(S)and SHACL serialisations.We deploy our schema in a proof-of-concept monitoring dashboard to validate its contribution to the transparency of DA architectures.Additionally,we evaluate the schema’s compliance with the FAIR principles.The evaluation shows that the schema succeeds in increasing transparency while being compliant with most of the FAIR principles.Because a common metadata model is critical for enhancing the compatibility between multiple DA infrastructures,our work lowers data access and analysis barriers.It represents an initial and infrastructure-independent foundation for the FAIRification of DA and the underlying scientific data management.展开更多
The FAIR principles have been widely cited,endorsed and adopted by a broad range of stakeholders since their publication in 2016.By intention,the 15 FAIR guiding principles do not dictate specific technological implem...The FAIR principles have been widely cited,endorsed and adopted by a broad range of stakeholders since their publication in 2016.By intention,the 15 FAIR guiding principles do not dictate specific technological implementations,but provide guidance for improving Findability,Accessibility,Interoperability and Reusability of digital resources.This has likely contributed to the broad adoption of the FAIR principles,because individual stakeholder communities can implement their own FAIR solutions.However,it has also resulted in inconsistent interpretations that carry the risk of leading to incompatible implementations.Thus,while the FAIR principles are formulated on a high level and may be interpreted and implemented in different ways,for true interoperability we need to support convergence in implementation choices that are widely accessible and(re)-usable.We introduce the concept of FAIR implementation considerations to assist accelerated global participation and convergence towards accessible,robust,widespread and consistent FAIR implementations.Any self-identified stakeholder community may either choose to reuse solutions from existing implementations,or when they spot a gap,accept the challenge to create the needed solution,which,ideally,can be used again by other communities in the future.Here,we provide interpretations and implementation considerations(choices and challenges)for each FAIR principle.展开更多
In order to provide responsible access to health data by reconciling benefits of data sharing with privacy rights and ethical and regulatory requirements,Findable,Accessible,Interoperable and Reusable(FAIR)metadata sh...In order to provide responsible access to health data by reconciling benefits of data sharing with privacy rights and ethical and regulatory requirements,Findable,Accessible,Interoperable and Reusable(FAIR)metadata should be developed.According to the H2020 Program Guidelines on FAIR Data,data should be"as open as possible and as closed as necessary","open"in order to foster the reusability and to accelerate research,but at the same time they should be"closed"to safeguard the privacy of the subjects.Additional provisions on the protection of natural persons with regard to the processing of personal data have been endorsed by the European General Data Protection Regulation(GDPR),Reg(EU)2016/679,that came into force in May 2018.This work aims to solve accessibility problems related to the protection of personal data in the digital era and to achieve a responsible access to and responsible use of health data.We strongly suggest associating each data set with FAIR metadata describing both the type of data collected and the accessibility conditions by considering data protection obligations and ethical and regulatory requirements.Finally,an existing FAIR infrastructure component has been used as an example to explain how FAIR metadata could facilitate data sharing while ensuring protection of individuals.展开更多
Since their publication in 2016 we have seen a rapid adoption of the FAIR principles in many scientific disciplines where the inherent value of research data and,therefore,the importance of good data management and da...Since their publication in 2016 we have seen a rapid adoption of the FAIR principles in many scientific disciplines where the inherent value of research data and,therefore,the importance of good data management and data stewardship,is recognized.This has led to many communities asking“What is FAIR?”and“How FAIR are we currently?”,questions which were addressed respectively by a publication revisiting the principles and the emergence of FAIR metrics.However,early adopters of the FAIR principles have already run into the next question:“How can we become(more)FAIR?”This question is more difficult to answer,as the principles do not prescribe any specific standard or implementation.Moreover,there does not yet exist a mature ecosystem of tools,platforms and standards to support human and machine agents to manage,produce,publish and consume FAIR data in a user-friendly and efficient(i.e.,“easy”)way.In this paper we will show,however,that there are already many emerging examples of FAIR tools under development.This paper puts forward the position that we are likely already in a creolization phase where FAIR tools and technologies are merging and combining,before converging in a subsequent phase to solutions that make FAIR feasible in daily practice.展开更多
The industry sector is a very large producer and consumer of data,and many companies traditionally focused on production or manufacturing are now relying on the analysis of large amounts of data to develop new product...The industry sector is a very large producer and consumer of data,and many companies traditionally focused on production or manufacturing are now relying on the analysis of large amounts of data to develop new products and services.As many of the data sources needed are distributed and outside the company,FAIR data will have a major impact,both by reducing the existing internal data silos and by enabling the efficient integration with external(public and commercial)data.Many companies are still in the early phases of internal data”FAIRification”,providing opportunities for SMEs and academics to apply and develop their expertise on FAIR data in collaborations and public-private partnerships.For a global Internet of FAIR Data&Services to thrive,also involving industry,professional tools and services are essential.FAIR metrics and certifications on individuals,data,organizations,and software,must ensure that data producers and consumers have independent quality metrics on their data.In this opinion article we reflect on some industry specific challenges of FAIR implementation to be dealt with when choices are made regarding”Industry GOing FAIR”.展开更多
文摘Metadata,data about other digital objects,play an important role in FAIR with a direct relation to all FAIR principles.In this paper we present and discuss the FAIR Data Point(FDP),a software architecture aiming to define a common approach to publish semantically-rich and machine-actionable metadata according to the FAIR principles.We present the core components and features of the FDP,its approach to metadata provision,the criteria to evaluate whether an application adheres to the FDP specifications and the service to register,index and allow users to search for metadata content of available FDPs.
基金supported by Czech Technical University in Prague grant No.SGS20/209/OHK3/3T/18.LOBSS,RK and KB are partially funded by funding from the Horizon2020 projects FAIRsFAIR grant No.831558.
文摘While the FAIR Principles do not specify a technical solution for'FAIRness',it was clear from the outset of the FAIR initiative that it would be useful to have commodity software and tooling that would simplify the creation of FAIR-compliant resources.The FAIR Data Point is a metadata repository that follows the DCAT(2)schema,and utilizes the Linked Data Platform to manage the hierarchical metadata layers as LDP Containers.There has been a recent flurry of development activity around the FAIR Data Point that has significantly improved its power and ease-of-use.Here we describe five specific tools—an installer,a loader,two Webbased interfaces,and an indexer-aimed at maximizing the uptake and utility of the FAIR Data Point.
基金The work of A.Jacobsen,R.Kaliyaperumal,M.Roos and M.Thompson is supported by funding from the European Union’s Horizon 2020 research and innovation program under the EJP RD COFUND-EJP N°825575The work of A.Jacobsen,R.Kaliyaperumal,M.Roos and M.Thompson is supported by funding from ELIXIR EXCELERATE,H2020 grant agreement number 676559.M.Roos and M.Thompson received funding from NWO(VWData 400.17.605)H2020-EU 824087.The work of B.Mons and L.O.Bonino da Silva Santos is funded by the H2020-EU 824068 and the GO FAIR ISCO grant of the Dutch Ministry of Science and Culture.
文摘The FAIR guiding principles aim to enhance the Findability,Accessibility,Interoperability and Reusability of digital resources such as data,for both humans and machines.The process of making data FAIR(“FAIRification”)can be described in multiple steps.In this paper,we describe a generic step-by-step FAIRification workflow to be performed in a multidisciplinary team guided by FAIR data stewards.The FAIRification workflow should be applicable to any type of data and has been developed and used for“Bring Your Own Data”(BYOD)workshops,as well as for the FAIRification of e.g.,rare diseases resources.The steps are:1)identify the FAIRification objective,2)analyze data,3)analyze metadata,4)define semantic model for data(4a)and metadata(4b),5)make data(5a)and metadata(5b)linkable,6)host FAIR data,and 7)assess FAIR data.For each step we describe how the data are processed,what expertise is required,which procedures and tools can be used,and which FAIR principles they relate to.
文摘The FAIR principles,an acronym for Findable,Accessible,Interoperable and Reusable,are recognised worldwide as key elements for good practice in all data management processes.To understand how the Brazilian scientific community is adhering to these principles,this article reports Brazilian adherence to the GO FAIR initiative through the creation of the GO FAIR Brazil Office and the manner in which they create their implementation networks.To contextualise this understanding,we provide a brief presentation of open data policies in Brazilian research and government,and finally,we describe a model that has been adopted for the GO FAIR Brazil implementation networks.The Brazilian Institute of Information in Science and Technology is responsible for the GO FAIR Brazil Office,which operates in all fields of knowledge and supports thematic implementation networks.Today,GO FAIR Brazil-Health is the first active implementation network in operation,which works in all health domains,serving as a model for other fields like agriculture,nuclear energy,and digital humanities,which are in the process of adherence negotiation.This report demonstrates the strong interest and effort from the Brazilian scientific communities in implementing the FAIR principles in their research data management practices.
文摘In recent years,as newer technologies have evolved around the healthcare ecosystem,more and more data have been generated.Advanced analytics could power the data collected from numerous sources,both from healthcare institutions,or generated by individuals themselves via apps and devices,and lead to innovations in treatment and diagnosis of diseases;improve the care given to the patient;and empower citizens to participate in the decision-making process regarding their own health and well-being.However,the sensitive nature of the health data prohibits healthcare organizations from sharing the data.The Personal Health Train(PHT)is a novel approach,aiming to establish a distributed data analytics infrastructure enabling the(re)use of distributed healthcare data,while data owners stay in control of their own data.The main principle of the PHT is that data remain in their original location,and analytical tasks visit data sources and execute the tasks.The PHT provides a distributed,flexible approach to use data in a network of participants,incorporating the FAIR principles.It facilitates the responsible use of sensitive and/or personal data by adopting international principles and regulations.This paper presents the concepts and main components of the PHT and demonstrates how it complies with FAIR principles.
基金this work was supported by the German Ministry for Research and Education(BMBF)as part of the SMITH consortium(SW,LN,YUY,SD and OB,grant no.01ZZ1803K)
文摘In recent years,implementations enabling Distributed Analytics(DA)have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data.These concepts propose privacy-enhancing alternatives to data centralisation approaches,which have restricted applicability in case of sensitive data due to ethical,legal or social aspects.Nevertheless,the immanent problem of DA-enabling architectures is the black-box-alike behaviour of the highly distributed components originating from the lack of semantically enriched descriptions,particularly the absence of basic metadata for data sets or analysis tasks.To approach the mentioned problems,we propose a metadata schema for DA infrastructures,which provides a vocabulary to enrich the involved entities with descriptive semantics.We initially perform a requirement analysis with domain experts to reveal necessary metadata items,which represents the foundation of our schema.Afterwards,we transform the obtained domain expert knowledge into user stories and derive the most significant semantic content.In the final step,we enable machine-readability via RDF(S)and SHACL serialisations.We deploy our schema in a proof-of-concept monitoring dashboard to validate its contribution to the transparency of DA architectures.Additionally,we evaluate the schema’s compliance with the FAIR principles.The evaluation shows that the schema succeeds in increasing transparency while being compliant with most of the FAIR principles.Because a common metadata model is critical for enhancing the compatibility between multiple DA infrastructures,our work lowers data access and analysis barriers.It represents an initial and infrastructure-independent foundation for the FAIRification of DA and the underlying scientific data management.
基金The work of A.Jacobsen,C.Evelo,M.Thompson,R.Cornet,R.Kaliyaperuma and M.Roos is supported by funding from the European Union’s Horizon 2020 research and innovation program under the EJP RD COFUND-EJP N°825575.The work of A.Jacobsen,C.Evelo,C.Goble,M.Thompson,N.Juty,R.Hooft,M.Roos,S-A.Sansone,P.McQuilton,P.Rocca-Serra and D.Batista is supported by funding from ELIXIR EXCELERATE,H2020 grant agreement number 676559.R.Hooft was further funded by NL NWO NRGWI.obrug.2018.009.N.Juty and C.Goble were funded by CORBEL(H2020 grant agreement 654248)N.Juty,C.Goble,S-A.Sansone,P.McQuilton,P.Rocca-Serra and D.Batista were funded by FAIRplus(IMI grant agreement 802750)+13 种基金N.Juty,C.Goble,M.Thompson,M.Roos,S-A.Sansone,P.McQuilton,P.Rocca-Serra and D.Batista were funded by EOSClife H2020-EU(grant agreement number 824087)C.Goble was funded by DMMCore(BBSRC BB/M013189/)M.Thompson,M.Roos received funding from NWO(VWData 400.17.605)S-A.Sansone,P.McQuilton,P.Rocca-Serra and D.Batista have been funded by grants awarded to S-A.Sansone from the UK BBSRC and Research Councils(BB/L024101/1,BB/L005069/1)EU(H2020-EU 634107H2020-EU 654241,IMI(IMPRiND 116060)NIH Data Common Fund,and from the Wellcome Trust(ISA-InterMine 212930/Z/18/ZFAIRsharing 208381/A/17/Z)The work of A.Waagmeester has been funded by grant award number GM089820 from the National Institutes of Health.M.Kersloot was funded by the European Regional Development Fund(KVW-00163).The work of N.Meyers was funded by the National Science Foundation(OAC 1839030)The work of M.D.Wilkinson is funded by Isaac Peral/Marie Curie cofund with the Universidad Politecnica de Madrid and the Ministerio de Economia y Competitividad grant number TIN2014-55993-RMThe work of B.Magagna,E.Schultes,L.da Silva Santos and K.Jeffery is funded by the H2020-EU 824068The work of B.Magagna,E.Schultes and L.da Silva Santos is funded by the GO FAIR ISCO grant of the Dutch Ministry of Science and CultureThe work of G.Guizzardi is supported by the OCEAN Project(FUB).M.Courtot received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No.802750.R.Cornet was further funded by FAIR4Health(H2020-EU grant agreement number 824666)K.Jeffery received funding from EPOS-IP H2020-EU agreement 676564 and ENVRIplus H2020-EU agreement 654182.
文摘The FAIR principles have been widely cited,endorsed and adopted by a broad range of stakeholders since their publication in 2016.By intention,the 15 FAIR guiding principles do not dictate specific technological implementations,but provide guidance for improving Findability,Accessibility,Interoperability and Reusability of digital resources.This has likely contributed to the broad adoption of the FAIR principles,because individual stakeholder communities can implement their own FAIR solutions.However,it has also resulted in inconsistent interpretations that carry the risk of leading to incompatible implementations.Thus,while the FAIR principles are formulated on a high level and may be interpreted and implemented in different ways,for true interoperability we need to support convergence in implementation choices that are widely accessible and(re)-usable.We introduce the concept of FAIR implementation considerations to assist accelerated global participation and convergence towards accessible,robust,widespread and consistent FAIR implementations.Any self-identified stakeholder community may either choose to reuse solutions from existing implementations,or when they spot a gap,accept the challenge to create the needed solution,which,ideally,can be used again by other communities in the future.Here,we provide interpretations and implementation considerations(choices and challenges)for each FAIR principle.
基金Part of this work is funded by the NWA program(project VWData-400.17.605)by the Netherlands Organization for Scientific Research(NWO)by the European Joint Program Rare Diseases(grant agreement#825575)and ELIXIR-EXCELERATE(H2020-INFRADEV-1-2015-12).
文摘In order to provide responsible access to health data by reconciling benefits of data sharing with privacy rights and ethical and regulatory requirements,Findable,Accessible,Interoperable and Reusable(FAIR)metadata should be developed.According to the H2020 Program Guidelines on FAIR Data,data should be"as open as possible and as closed as necessary","open"in order to foster the reusability and to accelerate research,but at the same time they should be"closed"to safeguard the privacy of the subjects.Additional provisions on the protection of natural persons with regard to the processing of personal data have been endorsed by the European General Data Protection Regulation(GDPR),Reg(EU)2016/679,that came into force in May 2018.This work aims to solve accessibility problems related to the protection of personal data in the digital era and to achieve a responsible access to and responsible use of health data.We strongly suggest associating each data set with FAIR metadata describing both the type of data collected and the accessibility conditions by considering data protection obligations and ethical and regulatory requirements.Finally,an existing FAIR infrastructure component has been used as an example to explain how FAIR metadata could facilitate data sharing while ensuring protection of individuals.
基金Part of this work is funded by the NWA program(project VWData-400.17.605)by the Netherlands Organization for Scientific Research(NWO)+1 种基金by the European Joint Program Rare Diseases(grant agreement#825575)ELIXIR-EXCELERATE(H2020-INFRADEV-1-2015-12).
文摘Since their publication in 2016 we have seen a rapid adoption of the FAIR principles in many scientific disciplines where the inherent value of research data and,therefore,the importance of good data management and data stewardship,is recognized.This has led to many communities asking“What is FAIR?”and“How FAIR are we currently?”,questions which were addressed respectively by a publication revisiting the principles and the emergence of FAIR metrics.However,early adopters of the FAIR principles have already run into the next question:“How can we become(more)FAIR?”This question is more difficult to answer,as the principles do not prescribe any specific standard or implementation.Moreover,there does not yet exist a mature ecosystem of tools,platforms and standards to support human and machine agents to manage,produce,publish and consume FAIR data in a user-friendly and efficient(i.e.,“easy”)way.In this paper we will show,however,that there are already many emerging examples of FAIR tools under development.This paper puts forward the position that we are likely already in a creolization phase where FAIR tools and technologies are merging and combining,before converging in a subsequent phase to solutions that make FAIR feasible in daily practice.
文摘The industry sector is a very large producer and consumer of data,and many companies traditionally focused on production or manufacturing are now relying on the analysis of large amounts of data to develop new products and services.As many of the data sources needed are distributed and outside the company,FAIR data will have a major impact,both by reducing the existing internal data silos and by enabling the efficient integration with external(public and commercial)data.Many companies are still in the early phases of internal data”FAIRification”,providing opportunities for SMEs and academics to apply and develop their expertise on FAIR data in collaborations and public-private partnerships.For a global Internet of FAIR Data&Services to thrive,also involving industry,professional tools and services are essential.FAIR metrics and certifications on individuals,data,organizations,and software,must ensure that data producers and consumers have independent quality metrics on their data.In this opinion article we reflect on some industry specific challenges of FAIR implementation to be dealt with when choices are made regarding”Industry GOing FAIR”.