This paper explores for the first time the contents,structure and relationships across institutions and disciplines of a global Big Earth Data cyber-infrastructure:the Global Earth Observation System of System(GEOSS)....This paper explores for the first time the contents,structure and relationships across institutions and disciplines of a global Big Earth Data cyber-infrastructure:the Global Earth Observation System of System(GEOSS).The analysis builds on 1.8 million metadata records harvested in GEOSS.Because this set includes almost all the major large data collections in GEOSS,the analysis represents more than 80%of all the data made available through this global system.We explore two major aspects:the collaborative networks and the thematic coverage in GEOSS.The first connects the contributing organisations through the more than 200,000 keywords used in the systems,and then explores who is citing whom,a proxy for of institutional thickness.The thematic coverage is analysed through neural network algorithms,first on the keywords,and then on the corpus of 653 million lemmatised lower case words built from the titles and abstracts of all 1.8 million metadata records.The findings not only give a good overview of the GEOSS data universe,but offer immediate priorities on how to increase the usability of GEOSS through improved data management,and the opportunity to augment the metadata with high level concept that synthetise well the contents of the data-set.展开更多
The GEOSS Platform is a key contribution to the goal of building the Global Earth Observation System of Systems(GEOSS).It enables a harmonized discovery and access of Earth observation data,shared online by heterogene...The GEOSS Platform is a key contribution to the goal of building the Global Earth Observation System of Systems(GEOSS).It enables a harmonized discovery and access of Earth observation data,shared online by heterogeneous organizations worldwide.This work analyzes both what is made available in the GEOSS Platform by the data providers and how users are utilizing it including multiyear trends,updating a previous analysis published in 2017.The present statistics derive from a 2021 EOValue report funded by the European Commission.The offer of GEOSS Platform data has been the object of various analyses,including data provider characterization,data sharing trends,and data characterization(comprising metadata quality analysis,thematic analysis,responsible party identification,spatial–temporal coverage).GEOSS data demand has also been the object of several analyses,including data consumer characterization,utilization trends,and requested data characterization(comprising thematic analysis,spatial–temporal coverage,and popularity).Among thefindings,a large amount of shared data,mostly from satellite sources,emerges with an issue of low metadata quality and related discovery match.Moreover,the trend in usage is decreasing.Therefore,the progressive disconnection of the GEOSS platform from its data Providers and Users and other possible causes are also reported.展开更多
文摘This paper explores for the first time the contents,structure and relationships across institutions and disciplines of a global Big Earth Data cyber-infrastructure:the Global Earth Observation System of System(GEOSS).The analysis builds on 1.8 million metadata records harvested in GEOSS.Because this set includes almost all the major large data collections in GEOSS,the analysis represents more than 80%of all the data made available through this global system.We explore two major aspects:the collaborative networks and the thematic coverage in GEOSS.The first connects the contributing organisations through the more than 200,000 keywords used in the systems,and then explores who is citing whom,a proxy for of institutional thickness.The thematic coverage is analysed through neural network algorithms,first on the keywords,and then on the corpus of 653 million lemmatised lower case words built from the titles and abstracts of all 1.8 million metadata records.The findings not only give a good overview of the GEOSS data universe,but offer immediate priorities on how to increase the usability of GEOSS through improved data management,and the opportunity to augment the metadata with high level concept that synthetise well the contents of the data-set.
基金funded by EOValue project funds from European Commission Directorate-General for Research and InnovationDAB4EDGE project funds from European Space Agency[ESA grant agreement 4000123005/18/IT/CGD]DAB4GPP project funds from European Space Agency[ESA grant agreement 4000138128/22/I/AG].
文摘The GEOSS Platform is a key contribution to the goal of building the Global Earth Observation System of Systems(GEOSS).It enables a harmonized discovery and access of Earth observation data,shared online by heterogeneous organizations worldwide.This work analyzes both what is made available in the GEOSS Platform by the data providers and how users are utilizing it including multiyear trends,updating a previous analysis published in 2017.The present statistics derive from a 2021 EOValue report funded by the European Commission.The offer of GEOSS Platform data has been the object of various analyses,including data provider characterization,data sharing trends,and data characterization(comprising metadata quality analysis,thematic analysis,responsible party identification,spatial–temporal coverage).GEOSS data demand has also been the object of several analyses,including data consumer characterization,utilization trends,and requested data characterization(comprising thematic analysis,spatial–temporal coverage,and popularity).Among thefindings,a large amount of shared data,mostly from satellite sources,emerges with an issue of low metadata quality and related discovery match.Moreover,the trend in usage is decreasing.Therefore,the progressive disconnection of the GEOSS platform from its data Providers and Users and other possible causes are also reported.