An adequate compute and storage infrastructure supporting the full exploitation of Copernicus and Earth Observation datasets is currently not available in Europe.This paper presents the cross-disciplinary open-source ...An adequate compute and storage infrastructure supporting the full exploitation of Copernicus and Earth Observation datasets is currently not available in Europe.This paper presents the cross-disciplinary open-source technologies being leveraged in the C-SCALE project to develop an open federation of compute and data providers as an alternative to monolithic infrastructures for processing and analysing Copernicus and Earth Observation data.Three critical aspects of the federation and the chosen technologies are elaborated upon:(1)federated data discovery,(2)federated access and(3)software distribution.With these technologies the open federation aims to provide homogenous access to resources,thereby enabling its users to generate meaningful results quickly and easily.This will be achieved by abstracting the complexity of infrastructure resource access provisioning and orchestration,including discovery of data across distributed archives,away from the end-users.Which is needed because end-users wish to focus on analysing ready-to-use data products and models rather than spending their time on the setup and maintenance of complex and heterogeneous IT infrastructures.The open federation will support processing and analysing the vast amounts of Copernicus and Earth Observation data that are critical for the implementation of the Destination Earth resp.Digital Twins vision for a high precision digital model of the Earth to model,monitor and simulate natural phenomena and related human activities.展开更多
The Geosciences Network(GEON)project has been developing cyberinfrastructure for data sharing in the Earth Science community based on a serviceoriented architecture.The layered architecture consists of Core,Middleware...The Geosciences Network(GEON)project has been developing cyberinfrastructure for data sharing in the Earth Science community based on a serviceoriented architecture.The layered architecture consists of Core,Middleware,and Applications services.Core services provide system-level functions(e.g.user authentication),Middleware services provide generic capabilities(e.g.catalog search),and Application services provide functions that users directly interact with,including applications that are specific to Earth Sciences.The GEON‘service stack’includes a standardized set of these services and the corresponding software modules.The GEON Portal provides Web-based access to these services via a set of portlets.This service-oriented approach has enabled GEON to expand to new partner sites and leverage GEON services for other projects.To facilitate interoperation in a distributed geoinformatics environment,GEON is focusing on standards for distributed search across federated catalogs.展开更多
One way to speed up the execution of sequential programs is to divide them into concurrent segments and execute such segments in a parallel manner over a distributed computing environment.We argue that the execution s...One way to speed up the execution of sequential programs is to divide them into concurrent segments and execute such segments in a parallel manner over a distributed computing environment.We argue that the execution speedup primarily depends on the concurrency degree between the identified segments as well as communication overhead between the segments.To guarantee the best speedup,we have to obtain the maximum possible concurrency degree between the identified segments,taking communication overhead into consideration.Existing code distributor and multi-threading approaches do not fulfill such requirements;hence,they cannot provide expected distributability gains in advance.To overcome such limitations,we propose a novel approach for verifying the distributability of sequential object-oriented programs.The proposed approach enables users to see the maximum speedup gains before the actual distributability implementations,as it computes an objective function which is used to measure different distribution values from the same program,taking into consideration both remote and sequential calls.Experimental results showed that the proposed approach successfully determines the distributability of different real-life software applications compared with their real-life sequential and distributed implementations.展开更多
基金the C-SCALE project(https://c-scale.eu/),which has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 101017529。
文摘An adequate compute and storage infrastructure supporting the full exploitation of Copernicus and Earth Observation datasets is currently not available in Europe.This paper presents the cross-disciplinary open-source technologies being leveraged in the C-SCALE project to develop an open federation of compute and data providers as an alternative to monolithic infrastructures for processing and analysing Copernicus and Earth Observation data.Three critical aspects of the federation and the chosen technologies are elaborated upon:(1)federated data discovery,(2)federated access and(3)software distribution.With these technologies the open federation aims to provide homogenous access to resources,thereby enabling its users to generate meaningful results quickly and easily.This will be achieved by abstracting the complexity of infrastructure resource access provisioning and orchestration,including discovery of data across distributed archives,away from the end-users.Which is needed because end-users wish to focus on analysing ready-to-use data products and models rather than spending their time on the setup and maintenance of complex and heterogeneous IT infrastructures.The open federation will support processing and analysing the vast amounts of Copernicus and Earth Observation data that are critical for the implementation of the Destination Earth resp.Digital Twins vision for a high precision digital model of the Earth to model,monitor and simulate natural phenomena and related human activities.
基金This research has been funded by the US National Science Foundation via grants 0225673 and 0744229.
文摘The Geosciences Network(GEON)project has been developing cyberinfrastructure for data sharing in the Earth Science community based on a serviceoriented architecture.The layered architecture consists of Core,Middleware,and Applications services.Core services provide system-level functions(e.g.user authentication),Middleware services provide generic capabilities(e.g.catalog search),and Application services provide functions that users directly interact with,including applications that are specific to Earth Sciences.The GEON‘service stack’includes a standardized set of these services and the corresponding software modules.The GEON Portal provides Web-based access to these services via a set of portlets.This service-oriented approach has enabled GEON to expand to new partner sites and leverage GEON services for other projects.To facilitate interoperation in a distributed geoinformatics environment,GEON is focusing on standards for distributed search across federated catalogs.
文摘One way to speed up the execution of sequential programs is to divide them into concurrent segments and execute such segments in a parallel manner over a distributed computing environment.We argue that the execution speedup primarily depends on the concurrency degree between the identified segments as well as communication overhead between the segments.To guarantee the best speedup,we have to obtain the maximum possible concurrency degree between the identified segments,taking communication overhead into consideration.Existing code distributor and multi-threading approaches do not fulfill such requirements;hence,they cannot provide expected distributability gains in advance.To overcome such limitations,we propose a novel approach for verifying the distributability of sequential object-oriented programs.The proposed approach enables users to see the maximum speedup gains before the actual distributability implementations,as it computes an objective function which is used to measure different distribution values from the same program,taking into consideration both remote and sequential calls.Experimental results showed that the proposed approach successfully determines the distributability of different real-life software applications compared with their real-life sequential and distributed implementations.