In the big data environment, the construction of massive S&T literature data resources needs intelligent technical assistance. With a focus on comparing the domestic and foreign knowledge organization systems and ...In the big data environment, the construction of massive S&T literature data resources needs intelligent technical assistance. With a focus on comparing the domestic and foreign knowledge organization systems and their applications, this article analyzes and summarizes the gaps in related researches and applications at home and abroad. A knowledge organization system framework for S&T literature data resources is presented in the article. Starting from the basic element of knowledge organization system, it also proposes and designs terminology-based analysis methods and technologies for S&T literature. Based on this framework, it proposes ideas and develops corresponding software tool to carry out relevant experiments. It gives an overview of theories and technologies method for future research.展开更多
Information Technology (IT) consolidates as an essential element to support the business strategies to survive and rapidly adapts to changes in the competitive environment. This paper examines the impact of the use ...Information Technology (IT) consolidates as an essential element to support the business strategies to survive and rapidly adapts to changes in the competitive environment. This paper examines the impact of the use of information systems (IS) and strategic organization knowledge (SOK) on firm performance in 150 Brazilian companies. The study uses partial least squares structural equation modeling (PLS-SEM) and establishes models to express the relationship among the constructs examined. The study identifies that the direct influence of 1S use on performance is moderately significant. However, when mediated by orientation strategy, the total effect of IS use on firm performance is demonstrated to be highly significant. The model explains 54% of the variability of firm performance and confirms IS use as a fundamental resource to support strategic business processes.展开更多
Purpose:This paper compares the paradigmatic differences between knowledge organization(KO)in library and information science and knowledge representation(KR)in AI to show the convergence in KO and KR methods and appl...Purpose:This paper compares the paradigmatic differences between knowledge organization(KO)in library and information science and knowledge representation(KR)in AI to show the convergence in KO and KR methods and applications.Methodology:The literature review and comparative analysis of KO and KR paradigms is the primary method used in this paper.Findings:A key difference between KO and KR lays in the purpose of KO is to organize knowledge into certain structure for standardizing and/or normalizing the vocabulary of concepts and relations,while KR is problem-solving oriented.Differences between KO and KR are discussed based on the goal,methods,and functions.Research limitations:This is only a preliminary research with a case study as proof of concept.Practical implications:The paper articulates on the opportunities in applying KR and other AI methods and techniques to enhance the functions of KO.Originality/value:Ontologies and linked data as the evidence of the convergence of KO and KR paradigms provide theoretical and methodological support to innovate KO in the AI era.展开更多
Purpose:This study aims to construct an ontology to model the semantics of social media streams,in particular,trending topics and public issues.Design/methodology/approach:Our knowledge base included 10 public events ...Purpose:This study aims to construct an ontology to model the semantics of social media streams,in particular,trending topics and public issues.Design/methodology/approach:Our knowledge base included 10 public events and topics from Weibo respectively,which were collected through keyword search and a crawler program.We used a semi-automatic approach to model and annotate the semantics in social media,and adapted the multi-layered ontology to refine the design based on previous researches,then we used named entity recognition(NER) to extract entities to instantiate the ontology.Relationships were extracted based on co-occurrence measures.Finally,we manually conducted post-filtering evaluation and edited the extracted entities and relationships.Findings:An initial assessment demonstrated that our multi-layered ontology supports various types of queries and analyses in the public issue knowledge base(PIKB),which can serve as an effective tool to query,understand and trace public issues.Research limitations:Manual involvement cannot meet the requirements for challenges of sustainable developments.Since the relationships extracted are fully based on the co-occurrence of entities,rich semantic relationships,such as how much the key players have been involved,could not be fully reflected.Besides,the user evaluation is necessary for further ontology assessment.Practical implications:The PIKB can be used by regular Web users and policy makers to query,understand,and make sense of public events and topics.The methodology and reusable ontology model are useful for institutions that are interested in making use of the social media data.Originality/value:In this study,a multi-layered ontology is applied to model the evolving semantics of public events and trending topics in social media,and the semi-automatic approach could make it possible to extract entities and relationships from large amount of unstructured short texts of user generated content(UGC) from social media.展开更多
Purpose:According to the different requirements of research group users,we established the knowledge-based subject group integration platforms of Shanghai Institute of Ceramics,the Chinese Academy of Sciences(abbrevia...Purpose:According to the different requirements of research group users,we established the knowledge-based subject group integration platforms of Shanghai Institute of Ceramics,the Chinese Academy of Sciences(abbreviated as SIC CAS hereinafter),which were designed and constructed to better meet the needs of CAS research groups for their development,collaboration and communication.Design/methodology/approach:We first identified the requirements of users via preliminary investigation,and then chose CASI1 P,iLibrary and XKE technology,respectively as the building tools compatible with the major demands of users.These steps helped us complete the layout design of SIC CAS integration platforms,as well as its knowledge organization and integration.Findings:According to the need of users,we applied three types of platform construction technologies to five SIC integration platforms,and formulated standard norms for the further construction process,which could provide useful reference for a sustainable development for the extensive construction in CAS institutes.Research limitations:In order to make the SIC integration platforms more intelligent and have more functions,we need to enlarge the scale of the Platforms and upgrade the building tools for the platform construction.Practical implications:The nature of SIC sub-project integration platforms is to construct a content-sensitive environment which can embed knowledge services and knowledge applications seamlessly into scientific activities,so the Platform is expected to be a useful tool to help researchers better understand the recent development of the research field and form collaborations with their peers.Originality/value:SIC integration platforms are the only pilot construction that used 3different platform technologies in the first batch of knowledge-based subject group integration platforms of the Chinese Academy of Sciences.The construction is user-centered throughout the whole process,namely,from the technology selection,content construction to the sustainable development of the platforms,which are all based on user requirements.During this process,we have not only established sustainable mechanisms for both the personalized feedback and security management of the institutional knowledge of SIC CAS,but also formed a service team for the sustainable development of SIC integration platforms.展开更多
Purpose: The aim of this study is to develop and implement a quality-control system to ensure authority control of the different knowledge units for the Scientific & Technological Knowledge Organization Systems(ST...Purpose: The aim of this study is to develop and implement a quality-control system to ensure authority control of the different knowledge units for the Scientific & Technological Knowledge Organization Systems(STKOS).Design/methodology/approach: First, we analyzed quality-control requirements based on the construction of the STKOS Metathesaurus. Then we designed a quality-control framework, the task management and transfer mechanism, and a service model. Afterwards, we carried out the experiments to check the rules and algorithms used in the system. Finally, the system was developed, and gradually optimized during its service.Findings: The quality-control system supports collaborative knowledge construction, as well as consistency checks of knowledge units with different granularity levels, including terminologies, relationships, and concepts. The system can be flexibly configured.Research limitations: The system is oriented to an English-language knowledge organization system, and may not perform well with Chinese-language systems and ontologies.Practical implications: The system can be used to support the construction of a single knowledge organization system, as well as data warehousing and interoperable knowledge organization systems. Originality/value: The STKOS quality-control system not only focuses on content building for the knowledge system, but also supports collaborative task management.展开更多
To improve the informationization and intelligence of the energy Internet industry and enhance the capability of knowledge services,it is necessary to organize the energy Internet body of knowledge from existing knowl...To improve the informationization and intelligence of the energy Internet industry and enhance the capability of knowledge services,it is necessary to organize the energy Internet body of knowledge from existing knowledge resources of the State Grid,which have the characteristics of large scale,multiple sources,and heterogeneity.At the same time,the business fields of State Grid cover a wide range.There are many sub-fields under each business field,and the relationship between fields is diverse and complex.The key to establishing the energy Internet body of knowledge is how to fuse the heterogeneous knowledge resources from multiple sources,extract the knowledge contents from them,and organize the different relationships.This paper considers transforming the original knowledge resources of State Grid into a unified and wellorganized knowledge system described in OWL language to meet the requirements of heterogeneous resource integration,multi-source resource organization,and knowledge service provision.For the State Grid knowledge resources mainly in XML format,this paper proposes a Knowledge Automatic Fusion and Organization idea and method based on XSD Directed Graph.According to the method,the XML corresponding XSD documents are transformed into a directed graph in the first stage during which the graph neural network detects hidden knowledge inside the structure to add semantic information to the graph.In the second stage,for other structured knowledge resources(e.g.,databases,spreadsheets),the knowledge contents and the relationships are analyzed manually to establish the mappings from structured resources to graph structures,using which the original knowledge resources are transformed into graph structures,and merged with the directed graphs obtained in the first stage to achieve the fusion of heterogeneous knowledge resources.And expert knowledge is introduced for heterogeneous knowledge fusion to further extend the directed graph.And in the third stage,the expanded directed graph is converted to the body of knowledge in the form of OWL.This paper takes the knowledge resources in the field of human resources of the State Grid as an example,to establish the ontology of the human resources training field in a unified manner,initially demonstrating the effectiveness of the proposed method.展开更多
Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the f...Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.展开更多
Purpose:This paper informs about the publication of the second edition of the Integrative Levels Classification(ILC2),a freely-faceted knowledge organization system(KOS),and reviews the main changes that have been int...Purpose:This paper informs about the publication of the second edition of the Integrative Levels Classification(ILC2),a freely-faceted knowledge organization system(KOS),and reviews the main changes that have been introduced as compared to its first edition(ILC1).Design/methodology/approach:The most relevant changes are illustrated,with special reference to those of interest to general classification theory,by means of examples of notation for individual classes and combinations of them.Findings:Changes introduced in ILC2 include:the names and order of some main classes;the development of subclasses for various phenomena,especially quantities and algebraic structures;the order of facet categories and the new category of Disorder;notation for special facets;distinction of the semantical function of facets(attributes)from their syntactic function.The system can be freely accessed online through a PHP browser as well as in SKOS format.Research limitations:Only a selection of changed classes is discussed for space reasons.Practical implications:ILC1 has been previously applied to the BARTOC directory of KOSs.Update of BARTOC data to ILC2 and application of ILC2 to further information systems are envisaged.Possible methods for reclassifying BARTOC with ILC2 are discussed.Originality:ILC is a newly developed classification system,based on phenomena instead of traditional disciplines and featuring various innovative devices.This paper is an original account of its most recent evolution.展开更多
Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring th...Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.展开更多
The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate go...The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate goal of the project, but a first step towards universal access to human knowledge. In particular, there are four challenges about the new way to analyze, process, operate, visualize and interact with digital media resource in this library. To tackle these challenges, North China Centre of Million Book Project (in Chinese Academy of Sciences) has initiated several innovative research projects in areas such as multimedia content analysis and retrieval, bilingual services, multimodal information presentation, and knowledge-based organization and services. In this keynote speech, we simply review our work in these areas, and argue that by technological cooperation with these innovation research topics, the project will develop a top-level digital library platform for the million book library.展开更多
This paper is aimed at designing an integrated framework of DSS with multiple ESs to enhance decision makers’ability in more complex cases.The basic intedrated frameword,knowledge organization and management of multi...This paper is aimed at designing an integrated framework of DSS with multiple ESs to enhance decision makers’ability in more complex cases.The basic intedrated frameword,knowledge organization and management of multiple ESs system (MESS),are presented.The following aspects:combing opinions form multiple ESs,problem-based model integration,and inconsistency processing among multiple ESs are emphatically doscissed. Finally,a summary and some ideas for the future are even.展开更多
Standards to describe soil properties are well established,with many ISO specifications and a few international thesauri available for specific applications.Besides,in recent years,the European directive on "Infr...Standards to describe soil properties are well established,with many ISO specifications and a few international thesauri available for specific applications.Besides,in recent years,the European directive on "Infrastructure for Spatial Information in the European Community(INSPIRE)"has brought together most of the existing standards into a well defined model.However,the adoption of these standards so far has not reached the level of semantic interoperability,defined in the paper,which would facilitate the building of data services that reuse and combine data from different sources.This paper reviews standards for describing soil data and reports on the work done within the EC funded agINFRA project to apply Linked Data technologies to existing standards and data in order to improve the interoperability of soil datasets.The main result of this work is twofold.First,an RDF vocabulary for soil concepts based on the UML INSPIRE model was published.Second,a KOS(Knowledge Organization System)for soil data was published and mapped to existing relevant KOS,based on the analysis of the SISI database of the CREA of Italy.This work also has a methodological value,in that it proposes and applies a methodology to standardize metadata used in local scientific databases,a very common situation in the scientific domain.Finally,this work aims at contributing towards a wider adoption of the INSPIRE directive,by providing an RDF version of it.展开更多
Objective:Critical process parameters(CPPs)identification is an important step of the implementation of quality by design(Qb D)concept.There are many CPP identification methods,such as risk analysis method,sensitivity...Objective:Critical process parameters(CPPs)identification is an important step of the implementation of quality by design(Qb D)concept.There are many CPP identification methods,such as risk analysis method,sensitivity analysis method,multiple linear regression method,standard partial regression coefficient(SPRC)method,and so on.The SPRC method can consider multiple process critical quality attributes(CQAs)simultaneously,but the determination of CPP number is subjective.Therefore,new CPP identification method is still required.Methods:The manufacturing process of Astragali Radix extract,which contained water reflux extraction,concentration,and ethanol precipitation,was used as an example.First,the multiple process CQAs were determined to be the yield of pigment,dry matter,sugars,and active ingredients.Second,the potential CPPs were determined by a knowledge organization method.Plackett-Burman designed experiments were then performed.A weighted determination coefficient(R2 w)method was presented to identify CPPs.In this method,the importance of different CQAs was considered.Process parameters were removed one-by-one according to their importance index.The decrease in R2 wwas used to characterize the importance of the removed parameter.If the decrease of R2 wwas less than a preset threshold,the removed parameter was not a CPP.Results:During the manufacturing process of Astragali Radix extract,the potential CPPs determined by the knowledge organization method were water consumption,reflux extraction time,extraction frequency,ethanol content,ethanol consumption,and concentration endpoint.Reflux extraction time,the first ethanol consumption,the second ethanol consumption,and the second ethanol precipitation refrigeration temperature were found to be CPPs using the weighted determination coefficient method with the threshold of 10%.Conclusion:Using the weighted determination coefficient method,CPPs can be determined with all the CQAs considered based on their importance.The determination of CPP number is more objective compared with the SPRC method.展开更多
基金Supported by the National Social Science Fund of China(No.18BTQ054)
文摘In the big data environment, the construction of massive S&T literature data resources needs intelligent technical assistance. With a focus on comparing the domestic and foreign knowledge organization systems and their applications, this article analyzes and summarizes the gaps in related researches and applications at home and abroad. A knowledge organization system framework for S&T literature data resources is presented in the article. Starting from the basic element of knowledge organization system, it also proposes and designs terminology-based analysis methods and technologies for S&T literature. Based on this framework, it proposes ideas and develops corresponding software tool to carry out relevant experiments. It gives an overview of theories and technologies method for future research.
文摘Information Technology (IT) consolidates as an essential element to support the business strategies to survive and rapidly adapts to changes in the competitive environment. This paper examines the impact of the use of information systems (IS) and strategic organization knowledge (SOK) on firm performance in 150 Brazilian companies. The study uses partial least squares structural equation modeling (PLS-SEM) and establishes models to express the relationship among the constructs examined. The study identifies that the direct influence of 1S use on performance is moderately significant. However, when mediated by orientation strategy, the total effect of IS use on firm performance is demonstrated to be highly significant. The model explains 54% of the variability of firm performance and confirms IS use as a fundamental resource to support strategic business processes.
文摘Purpose:This paper compares the paradigmatic differences between knowledge organization(KO)in library and information science and knowledge representation(KR)in AI to show the convergence in KO and KR methods and applications.Methodology:The literature review and comparative analysis of KO and KR paradigms is the primary method used in this paper.Findings:A key difference between KO and KR lays in the purpose of KO is to organize knowledge into certain structure for standardizing and/or normalizing the vocabulary of concepts and relations,while KR is problem-solving oriented.Differences between KO and KR are discussed based on the goal,methods,and functions.Research limitations:This is only a preliminary research with a case study as proof of concept.Practical implications:The paper articulates on the opportunities in applying KR and other AI methods and techniques to enhance the functions of KO.Originality/value:Ontologies and linked data as the evidence of the convergence of KO and KR paradigms provide theoretical and methodological support to innovate KO in the AI era.
基金supported by Beijing Thinker Workshop(Grant No.XK201211001)
文摘Purpose:This study aims to construct an ontology to model the semantics of social media streams,in particular,trending topics and public issues.Design/methodology/approach:Our knowledge base included 10 public events and topics from Weibo respectively,which were collected through keyword search and a crawler program.We used a semi-automatic approach to model and annotate the semantics in social media,and adapted the multi-layered ontology to refine the design based on previous researches,then we used named entity recognition(NER) to extract entities to instantiate the ontology.Relationships were extracted based on co-occurrence measures.Finally,we manually conducted post-filtering evaluation and edited the extracted entities and relationships.Findings:An initial assessment demonstrated that our multi-layered ontology supports various types of queries and analyses in the public issue knowledge base(PIKB),which can serve as an effective tool to query,understand and trace public issues.Research limitations:Manual involvement cannot meet the requirements for challenges of sustainable developments.Since the relationships extracted are fully based on the co-occurrence of entities,rich semantic relationships,such as how much the key players have been involved,could not be fully reflected.Besides,the user evaluation is necessary for further ontology assessment.Practical implications:The PIKB can be used by regular Web users and policy makers to query,understand,and make sense of public events and topics.The methodology and reusable ontology model are useful for institutions that are interested in making use of the social media data.Originality/value:In this study,a multi-layered ontology is applied to model the evolving semantics of public events and trending topics in social media,and the semi-automatic approach could make it possible to extract entities and relationships from large amount of unstructured short texts of user generated content(UGC) from social media.
基金supported by the Chinese Academy of Sciences(Grant No.:Y11006)
文摘Purpose:According to the different requirements of research group users,we established the knowledge-based subject group integration platforms of Shanghai Institute of Ceramics,the Chinese Academy of Sciences(abbreviated as SIC CAS hereinafter),which were designed and constructed to better meet the needs of CAS research groups for their development,collaboration and communication.Design/methodology/approach:We first identified the requirements of users via preliminary investigation,and then chose CASI1 P,iLibrary and XKE technology,respectively as the building tools compatible with the major demands of users.These steps helped us complete the layout design of SIC CAS integration platforms,as well as its knowledge organization and integration.Findings:According to the need of users,we applied three types of platform construction technologies to five SIC integration platforms,and formulated standard norms for the further construction process,which could provide useful reference for a sustainable development for the extensive construction in CAS institutes.Research limitations:In order to make the SIC integration platforms more intelligent and have more functions,we need to enlarge the scale of the Platforms and upgrade the building tools for the platform construction.Practical implications:The nature of SIC sub-project integration platforms is to construct a content-sensitive environment which can embed knowledge services and knowledge applications seamlessly into scientific activities,so the Platform is expected to be a useful tool to help researchers better understand the recent development of the research field and form collaborations with their peers.Originality/value:SIC integration platforms are the only pilot construction that used 3different platform technologies in the first batch of knowledge-based subject group integration platforms of the Chinese Academy of Sciences.The construction is user-centered throughout the whole process,namely,from the technology selection,content construction to the sustainable development of the platforms,which are all based on user requirements.During this process,we have not only established sustainable mechanisms for both the personalized feedback and security management of the institutional knowledge of SIC CAS,but also formed a service team for the sustainable development of SIC integration platforms.
基金supported by the Ministry of Science and Technology of China(Grant No.:2011BAH10B02)
文摘Purpose: The aim of this study is to develop and implement a quality-control system to ensure authority control of the different knowledge units for the Scientific & Technological Knowledge Organization Systems(STKOS).Design/methodology/approach: First, we analyzed quality-control requirements based on the construction of the STKOS Metathesaurus. Then we designed a quality-control framework, the task management and transfer mechanism, and a service model. Afterwards, we carried out the experiments to check the rules and algorithms used in the system. Finally, the system was developed, and gradually optimized during its service.Findings: The quality-control system supports collaborative knowledge construction, as well as consistency checks of knowledge units with different granularity levels, including terminologies, relationships, and concepts. The system can be flexibly configured.Research limitations: The system is oriented to an English-language knowledge organization system, and may not perform well with Chinese-language systems and ontologies.Practical implications: The system can be used to support the construction of a single knowledge organization system, as well as data warehousing and interoperable knowledge organization systems. Originality/value: The STKOS quality-control system not only focuses on content building for the knowledge system, but also supports collaborative task management.
基金the research result of"Research and Application of Dynamic Knowledge Map Technology for Energy Internet"(1200/2021-66002A),a science and technology project of the State Grid Corporation of China.
文摘To improve the informationization and intelligence of the energy Internet industry and enhance the capability of knowledge services,it is necessary to organize the energy Internet body of knowledge from existing knowledge resources of the State Grid,which have the characteristics of large scale,multiple sources,and heterogeneity.At the same time,the business fields of State Grid cover a wide range.There are many sub-fields under each business field,and the relationship between fields is diverse and complex.The key to establishing the energy Internet body of knowledge is how to fuse the heterogeneous knowledge resources from multiple sources,extract the knowledge contents from them,and organize the different relationships.This paper considers transforming the original knowledge resources of State Grid into a unified and wellorganized knowledge system described in OWL language to meet the requirements of heterogeneous resource integration,multi-source resource organization,and knowledge service provision.For the State Grid knowledge resources mainly in XML format,this paper proposes a Knowledge Automatic Fusion and Organization idea and method based on XSD Directed Graph.According to the method,the XML corresponding XSD documents are transformed into a directed graph in the first stage during which the graph neural network detects hidden knowledge inside the structure to add semantic information to the graph.In the second stage,for other structured knowledge resources(e.g.,databases,spreadsheets),the knowledge contents and the relationships are analyzed manually to establish the mappings from structured resources to graph structures,using which the original knowledge resources are transformed into graph structures,and merged with the directed graphs obtained in the first stage to achieve the fusion of heterogeneous knowledge resources.And expert knowledge is introduced for heterogeneous knowledge fusion to further extend the directed graph.And in the third stage,the expanded directed graph is converted to the body of knowledge in the form of OWL.This paper takes the knowledge resources in the field of human resources of the State Grid as an example,to establish the ontology of the human resources training field in a unified manner,initially demonstrating the effectiveness of the proposed method.
基金College of Communication and Information(CCI)Research and Creative Activity Fund,Kent State University
文摘Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.
文摘Purpose:This paper informs about the publication of the second edition of the Integrative Levels Classification(ILC2),a freely-faceted knowledge organization system(KOS),and reviews the main changes that have been introduced as compared to its first edition(ILC1).Design/methodology/approach:The most relevant changes are illustrated,with special reference to those of interest to general classification theory,by means of examples of notation for individual classes and combinations of them.Findings:Changes introduced in ILC2 include:the names and order of some main classes;the development of subclasses for various phenomena,especially quantities and algebraic structures;the order of facet categories and the new category of Disorder;notation for special facets;distinction of the semantical function of facets(attributes)from their syntactic function.The system can be freely accessed online through a PHP browser as well as in SKOS format.Research limitations:Only a selection of changed classes is discussed for space reasons.Practical implications:ILC1 has been previously applied to the BARTOC directory of KOSs.Update of BARTOC data to ILC2 and application of ILC2 to further information systems are envisaged.Possible methods for reclassifying BARTOC with ILC2 are discussed.Originality:ILC is a newly developed classification system,based on phenomena instead of traditional disciplines and featuring various innovative devices.This paper is an original account of its most recent evolution.
基金support of the European Commission ETER Project (No. 934533-2017-AO8-CH)H2020 RISIS 2 project (No. 824091)。
文摘Purpose: This paper relates the definition of data quality procedures for knowledge organizations such as Higher Education Institutions. The main purpose is to present the flexible approach developed for monitoring the data quality of the European Tertiary Education Register(ETER) database, illustrating its functioning and highlighting the main challenges that still have to be faced in this domain.Design/methodology/approach: The proposed data quality methodology is based on two kinds of checks, one to assess the consistency of cross-sectional data and the other to evaluate the stability of multiannual data. This methodology has an operational and empirical orientation. This means that the proposed checks do not assume any theoretical distribution for the determination of the threshold parameters that identify potential outliers, inconsistencies, and errors in the data. Findings: We show that the proposed cross-sectional checks and multiannual checks are helpful to identify outliers, extreme observations and to detect ontological inconsistencies not described in the available meta-data. For this reason, they may be a useful complement to integrate the processing of the available information.Research limitations: The coverage of the study is limited to European Higher Education Institutions. The cross-sectional and multiannual checks are not yet completely integrated.Practical implications: The consideration of the quality of the available data and information is important to enhance data quality-aware empirical investigations, highlighting problems, and areas where to invest for improving the coverage and interoperability of data in future data collection initiatives.Originality/value: The data-driven quality checks proposed in this paper may be useful as a reference for building and monitoring the data quality of new databases or of existing databases available for other countries or systems characterized by high heterogeneity and complexity of the units of analysis without relying on pre-specified theoretical distributions.
文摘The China-US Million Book Digital Library Project (Million Book Project) is an intemational cooperation program between China and the US. However, one million digitized books are considered not to be the ultimate goal of the project, but a first step towards universal access to human knowledge. In particular, there are four challenges about the new way to analyze, process, operate, visualize and interact with digital media resource in this library. To tackle these challenges, North China Centre of Million Book Project (in Chinese Academy of Sciences) has initiated several innovative research projects in areas such as multimedia content analysis and retrieval, bilingual services, multimodal information presentation, and knowledge-based organization and services. In this keynote speech, we simply review our work in these areas, and argue that by technological cooperation with these innovation research topics, the project will develop a top-level digital library platform for the million book library.
文摘This paper is aimed at designing an integrated framework of DSS with multiple ESs to enhance decision makers’ability in more complex cases.The basic intedrated frameword,knowledge organization and management of multiple ESs system (MESS),are presented.The following aspects:combing opinions form multiple ESs,problem-based model integration,and inconsistency processing among multiple ESs are emphatically doscissed. Finally,a summary and some ideas for the future are even.
基金The research leading to these results has received funding from the European Union Seventh Framework Programme(FP7/2007-2013)under grant agreement No.283770.
文摘Standards to describe soil properties are well established,with many ISO specifications and a few international thesauri available for specific applications.Besides,in recent years,the European directive on "Infrastructure for Spatial Information in the European Community(INSPIRE)"has brought together most of the existing standards into a well defined model.However,the adoption of these standards so far has not reached the level of semantic interoperability,defined in the paper,which would facilitate the building of data services that reuse and combine data from different sources.This paper reviews standards for describing soil data and reports on the work done within the EC funded agINFRA project to apply Linked Data technologies to existing standards and data in order to improve the interoperability of soil datasets.The main result of this work is twofold.First,an RDF vocabulary for soil concepts based on the UML INSPIRE model was published.Second,a KOS(Knowledge Organization System)for soil data was published and mapped to existing relevant KOS,based on the analysis of the SISI database of the CREA of Italy.This work also has a methodological value,in that it proposes and applies a methodology to standardize metadata used in local scientific databases,a very common situation in the scientific domain.Finally,this work aims at contributing towards a wider adoption of the INSPIRE directive,by providing an RDF version of it.
基金the support of National Project for Standardization of Chinese Materia Medica(ZYBZH-C-GD-04)Student Research Training Program of College of Pharmaceutical Sciences of Zhejiang University(Y36023)。
文摘Objective:Critical process parameters(CPPs)identification is an important step of the implementation of quality by design(Qb D)concept.There are many CPP identification methods,such as risk analysis method,sensitivity analysis method,multiple linear regression method,standard partial regression coefficient(SPRC)method,and so on.The SPRC method can consider multiple process critical quality attributes(CQAs)simultaneously,but the determination of CPP number is subjective.Therefore,new CPP identification method is still required.Methods:The manufacturing process of Astragali Radix extract,which contained water reflux extraction,concentration,and ethanol precipitation,was used as an example.First,the multiple process CQAs were determined to be the yield of pigment,dry matter,sugars,and active ingredients.Second,the potential CPPs were determined by a knowledge organization method.Plackett-Burman designed experiments were then performed.A weighted determination coefficient(R2 w)method was presented to identify CPPs.In this method,the importance of different CQAs was considered.Process parameters were removed one-by-one according to their importance index.The decrease in R2 wwas used to characterize the importance of the removed parameter.If the decrease of R2 wwas less than a preset threshold,the removed parameter was not a CPP.Results:During the manufacturing process of Astragali Radix extract,the potential CPPs determined by the knowledge organization method were water consumption,reflux extraction time,extraction frequency,ethanol content,ethanol consumption,and concentration endpoint.Reflux extraction time,the first ethanol consumption,the second ethanol consumption,and the second ethanol precipitation refrigeration temperature were found to be CPPs using the weighted determination coefficient method with the threshold of 10%.Conclusion:Using the weighted determination coefficient method,CPPs can be determined with all the CQAs considered based on their importance.The determination of CPP number is more objective compared with the SPRC method.