In this paper, a Graph-based semantic Data Model (GDM) is proposed with the primary objective of bridging the gap between the human perception of an enterprise and the needs of computing infrastructure to organize i...In this paper, a Graph-based semantic Data Model (GDM) is proposed with the primary objective of bridging the gap between the human perception of an enterprise and the needs of computing infrastructure to organize information in some particular manner for efficient storage and retrieval. The Graph Data Model (GDM) has been proposed as an alternative data model to combine the advantages of the relational model with the positive features of semantic data models. The proposed GDM offers a structural representation for interacting to the designer, making it always easy to comprehend the complex relations amongst basic data items. GDM allows an entire database to be viewed as a Graph (V, E) in a layered organization. Here, a graph is created in a bottom up fashion where V represents the basic instances of data or a functionally abstracted module, called primary semantic group (PSG) and secondary semantic group (SSG). An edge in the model implies the relationship among the secondary semantic groups. The contents of the lowest layer are the semantically grouped data values in the form of primary semantic groups. The SSGs are nothing but the higher-level abstraction and are created by the method of encapsulation of various PSGs, SSGs and basic data elements. This encapsulation methodology to provide a higher-level abstraction continues generating various secondary semantic groups until the designer thinks that it is sufficient to declare the actual problem domain. GDM, thus, uses standard abstractions available in a semantic data model with a structural representation in terms of a graph. The operations on the data model are formalized in the proposed graph algebra. A Graph Query Language (GQL) is also developed, maintaining similarity with the widely accepted user-friendly SQL. Finally, the paper also presents the methodology to make this GDM compatible with the distributed environment, and a corresponding query processing technique for distributed environment is also suggested for the sake of completeness.展开更多
Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines(that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and s...Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines(that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and stewardship has the potential to remarkably enhance the framework for the reuse of research data. In this way, FAIR is aiding digital transformation. The ‘FAIRification’ of data increases the interoperability and(re)usability of data, so that new and robust analytical tools, such as machine learning(ML) models, can access the data to deduce meaningful insights, extract actionable information, and identify hidden patterns. This article aims to build a FAIR ML model pipeline using the generic FAIRification workflow to make the whole ML analytics process FAIR. Accordingly, FAIR input data was modelled using a FAIR ML model. The output data from the FAIR ML model was also made FAIR. For this, a hybrid hierarchical k-means (HHK) clustering ML algorithm was applied to group the data into homogeneous subgroups and ascertain the underlying structure of the data using a Nigerian-based FAIR dataset that contains data on economic factors, healthcare facilities, and coronavirus occurrences in all the 36 states of Nigeria. The model showed that research data and the ML pipeline can be FAIRified, shared, and reused by following the proposed FAIRification workflow and implementing technical architecture.展开更多
The benefits of the use of modeling and simulation in engineering are acknowledged widely.It has proven its advantages e.g.,in virtual prototyping i.e.,simulation aided design and testing as well as in training and R&...The benefits of the use of modeling and simulation in engineering are acknowledged widely.It has proven its advantages e.g.,in virtual prototyping i.e.,simulation aided design and testing as well as in training and R&D.It is recognized to be a tool for modern decision making.However,there are still reasons that slow down the wider utilization of modeling and simulation in companies.Modeling and simulation tools are separate and are not an integrated part of the other engineering information management in the company networks.They do not integrate well enough into the used CAD,PLM/PDM and control systems.The co-use of the simulation tools themselves is poor and the whole modeling process is considered often to be too laborious.In this article we introduce an integration solution for modeling and simulation based on the semantic data modeling approach.Semantic data modeling and ontology mapping techniques have been used in database system integration,but the novelty of this work is in utilizing these techniques in the domain of modeling and simulation.The benefits and drawbacks of the chosen approach are discussed.Furthermore,we describe real industrial project cases where this new approach has been applied.展开更多
文摘In this paper, a Graph-based semantic Data Model (GDM) is proposed with the primary objective of bridging the gap between the human perception of an enterprise and the needs of computing infrastructure to organize information in some particular manner for efficient storage and retrieval. The Graph Data Model (GDM) has been proposed as an alternative data model to combine the advantages of the relational model with the positive features of semantic data models. The proposed GDM offers a structural representation for interacting to the designer, making it always easy to comprehend the complex relations amongst basic data items. GDM allows an entire database to be viewed as a Graph (V, E) in a layered organization. Here, a graph is created in a bottom up fashion where V represents the basic instances of data or a functionally abstracted module, called primary semantic group (PSG) and secondary semantic group (SSG). An edge in the model implies the relationship among the secondary semantic groups. The contents of the lowest layer are the semantically grouped data values in the form of primary semantic groups. The SSGs are nothing but the higher-level abstraction and are created by the method of encapsulation of various PSGs, SSGs and basic data elements. This encapsulation methodology to provide a higher-level abstraction continues generating various secondary semantic groups until the designer thinks that it is sufficient to declare the actual problem domain. GDM, thus, uses standard abstractions available in a semantic data model with a structural representation in terms of a graph. The operations on the data model are formalized in the proposed graph algebra. A Graph Query Language (GQL) is also developed, maintaining similarity with the widely accepted user-friendly SQL. Finally, the paper also presents the methodology to make this GDM compatible with the distributed environment, and a corresponding query processing technique for distributed environment is also suggested for the sake of completeness.
基金VODAN-Africathe Philips Foundation+2 种基金the Dutch Development Bank FMOCORDAIDthe GO FAIR Foundation for supporting this research
文摘Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines(that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and stewardship has the potential to remarkably enhance the framework for the reuse of research data. In this way, FAIR is aiding digital transformation. The ‘FAIRification’ of data increases the interoperability and(re)usability of data, so that new and robust analytical tools, such as machine learning(ML) models, can access the data to deduce meaningful insights, extract actionable information, and identify hidden patterns. This article aims to build a FAIR ML model pipeline using the generic FAIRification workflow to make the whole ML analytics process FAIR. Accordingly, FAIR input data was modelled using a FAIR ML model. The output data from the FAIR ML model was also made FAIR. For this, a hybrid hierarchical k-means (HHK) clustering ML algorithm was applied to group the data into homogeneous subgroups and ascertain the underlying structure of the data using a Nigerian-based FAIR dataset that contains data on economic factors, healthcare facilities, and coronavirus occurrences in all the 36 states of Nigeria. The model showed that research data and the ML pipeline can be FAIRified, shared, and reused by following the proposed FAIRification workflow and implementing technical architecture.
文摘The benefits of the use of modeling and simulation in engineering are acknowledged widely.It has proven its advantages e.g.,in virtual prototyping i.e.,simulation aided design and testing as well as in training and R&D.It is recognized to be a tool for modern decision making.However,there are still reasons that slow down the wider utilization of modeling and simulation in companies.Modeling and simulation tools are separate and are not an integrated part of the other engineering information management in the company networks.They do not integrate well enough into the used CAD,PLM/PDM and control systems.The co-use of the simulation tools themselves is poor and the whole modeling process is considered often to be too laborious.In this article we introduce an integration solution for modeling and simulation based on the semantic data modeling approach.Semantic data modeling and ontology mapping techniques have been used in database system integration,but the novelty of this work is in utilizing these techniques in the domain of modeling and simulation.The benefits and drawbacks of the chosen approach are discussed.Furthermore,we describe real industrial project cases where this new approach has been applied.