Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among ...Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among all tokens in a sentence. We proposed an annotation scheme based on feature structure theory for enriching biomedical domain corpora with token semantic association (TSA). There are 227 documents of the BioNLP GE ST training data annotated to form TSA corpus in which each annotated item shows a token semantic association that appears as a triple. The annotation of token semantic association has the potential to significantly advance biomedical text mining by providing rich token semantic information for NLP systems especially for the sophisticated IE systems, such as bio-event extraction.展开更多
A context memory model and an approach for context query and association discovery are proposed. The context query is based on a resource description framework (RDF) dataset and SPARQL language. To discover collabor...A context memory model and an approach for context query and association discovery are proposed. The context query is based on a resource description framework (RDF) dataset and SPARQL language. To discover collaborative associations, an approach of transforming RDF named graphs into "context graph" is proposed. First, the definitions of the importance of the nodes and the weight assignment for the "context graph" are given. Secondly, the implementation of a spread activation algorithm based on "context graph" is proposed. An infrastructure is also built up in the collaborative context space (CCS) system to support context memory and knowledge discovery in a collaborative environment.展开更多
Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/m...Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/methodology/approach: A heterogeneous researcher network has been constructed by combining multiple relations of academic researchers. Vertex attributes and their similarities were considered and calculated. An approach has been proposed and tested to detect research community in research organizations based on this multi-relation researcher network.Findings: Detection of topologically well-connected, semantically coherent and meaningful research community was achieved.Research limitations: The sample size of evaluation experiments was relatively small. In the present study, a limited number of 72 researchers were analyzed for constructing researcher network and detecting research community. Therefore, a large sample size is required to give more information and reliable results.Practical implications: The proposed multi-relation researcher network and approaches for discovering research communities of similar research interests will contribute to collective innovation behavior such as brainstorming and to promote interdisciplinary cooperation.Originality/value: Recent researches on community detection devote most efforts to singlerelation researcher networks and put the main focus on the topological structure of networks.In reality, there exist multi-relation social networks. Vertex attribute also plays an important role in community detection. The present study combined multiple single-relational researcher networks into a multi-relational network and proposed a structure-attribute clustering method for detecting research community in research organizations.展开更多
Linked data is a decentralized space of interlinked Resource Description Framework(RDF) graphs that are published,accessed,and manipulated by a multitude of Web agents.Here,we present a multi-agent framework for minin...Linked data is a decentralized space of interlinked Resource Description Framework(RDF) graphs that are published,accessed,and manipulated by a multitude of Web agents.Here,we present a multi-agent framework for mining hypothetical semantic relations from linked data,in which the discovery,management,and validation of relations can be carried out independently by different agents.These agents collaborate in relation mining by publishing and exchanging inter-dependent knowledge elements,e.g.,hypotheses,evidence,and proofs,giving rise to an evidentiary network that connects and ranks diverse knowledge elements.Simulation results show that the framework is scalable in a multi-agent environment.Real-world applications show that the framework is suitable for interdisciplinary and collaborative relation discovery tasks in social domains.展开更多
基金Supported by the National Natural Science Foundation of China(61202304,61173095,61173062,61202193)
文摘Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among all tokens in a sentence. We proposed an annotation scheme based on feature structure theory for enriching biomedical domain corpora with token semantic association (TSA). There are 227 documents of the BioNLP GE ST training data annotated to form TSA corpus in which each annotated item shows a token semantic association that appears as a triple. The annotation of token semantic association has the potential to significantly advance biomedical text mining by providing rich token semantic information for NLP systems especially for the sophisticated IE systems, such as bio-event extraction.
基金The National Natural Science Foundation of China(No.90412009).
文摘A context memory model and an approach for context query and association discovery are proposed. The context query is based on a resource description framework (RDF) dataset and SPARQL language. To discover collaborative associations, an approach of transforming RDF named graphs into "context graph" is proposed. First, the definitions of the importance of the nodes and the weight assignment for the "context graph" are given. Secondly, the implementation of a spread activation algorithm based on "context graph" is proposed. An infrastructure is also built up in the collaborative context space (CCS) system to support context memory and knowledge discovery in a collaborative environment.
基金supported by the National Natural Science Foundation of China(Grant No.:71203164)
文摘Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/methodology/approach: A heterogeneous researcher network has been constructed by combining multiple relations of academic researchers. Vertex attributes and their similarities were considered and calculated. An approach has been proposed and tested to detect research community in research organizations based on this multi-relation researcher network.Findings: Detection of topologically well-connected, semantically coherent and meaningful research community was achieved.Research limitations: The sample size of evaluation experiments was relatively small. In the present study, a limited number of 72 researchers were analyzed for constructing researcher network and detecting research community. Therefore, a large sample size is required to give more information and reliable results.Practical implications: The proposed multi-relation researcher network and approaches for discovering research communities of similar research interests will contribute to collective innovation behavior such as brainstorming and to promote interdisciplinary cooperation.Originality/value: Recent researches on community detection devote most efforts to singlerelation researcher networks and put the main focus on the topological structure of networks.In reality, there exist multi-relation social networks. Vertex attribute also plays an important role in community detection. The present study combined multiple single-relational researcher networks into a multi-relational network and proposed a structure-attribute clustering method for detecting research community in research organizations.
基金supported by the National Natural Science Foundation of China (Nos.61070156 and 61100183)the Natural Science Foundation of Zhejiang Province,China (No.Y1110477)
文摘Linked data is a decentralized space of interlinked Resource Description Framework(RDF) graphs that are published,accessed,and manipulated by a multitude of Web agents.Here,we present a multi-agent framework for mining hypothetical semantic relations from linked data,in which the discovery,management,and validation of relations can be carried out independently by different agents.These agents collaborate in relation mining by publishing and exchanging inter-dependent knowledge elements,e.g.,hypotheses,evidence,and proofs,giving rise to an evidentiary network that connects and ranks diverse knowledge elements.Simulation results show that the framework is scalable in a multi-agent environment.Real-world applications show that the framework is suitable for interdisciplinary and collaborative relation discovery tasks in social domains.