In this paper,to deal with the heterogeneity in federated learning(FL)systems,a knowledge distillation(KD)driven training framework for FL is proposed,where each user can select its neural network model on demand and ...In this paper,to deal with the heterogeneity in federated learning(FL)systems,a knowledge distillation(KD)driven training framework for FL is proposed,where each user can select its neural network model on demand and distill knowledge from a big teacher model using its own private dataset.To overcome the challenge of train the big teacher model in resource limited user devices,the digital twin(DT)is exploit in the way that the teacher model can be trained at DT located in the server with enough computing resources.Then,during model distillation,each user can update the parameters of its model at either the physical entity or the digital agent.The joint problem of model selection and training offloading and resource allocation for users is formulated as a mixed integer programming(MIP)problem.To solve the problem,Q-learning and optimization are jointly used,where Q-learning selects models for users and determines whether to train locally or on the server,and optimization is used to allocate resources for users based on the output of Q-learning.Simulation results show the proposed DT-assisted KD framework and joint optimization method can significantly improve the average accuracy of users while reducing the total delay.展开更多
In order to improve the efficiency of ontology construction from heterogeneous knowledge sources, a semantic-based approach is presented. The ontology will be constructed with the application of cluster technique in a...In order to improve the efficiency of ontology construction from heterogeneous knowledge sources, a semantic-based approach is presented. The ontology will be constructed with the application of cluster technique in an incremental way. Firstly, terms will be extracted from knowledge sources and congregate a term set after pretreat-ment. Then the concept set will be built via semantic-based clustering according to semanteme of terms provided by WordNet. Next, a concept tree is constructed in terms of mapping rules between semant^me relationships and concept relationships. The semi-automatic approach can avoid non-consistence due to knowledge engineers having different understanding of the same concept and the obtained ontology is easily to be expanded.展开更多
Using a sample of 252 R & D teams in Guangzhou, Foshan, Shenzhen, the researcher empirically examines the relationship between knowledge heterogeneity and knowledge innovation performance, the mediating role of kn...Using a sample of 252 R & D teams in Guangzhou, Foshan, Shenzhen, the researcher empirically examines the relationship between knowledge heterogeneity and knowledge innovation performance, the mediating role of knowledge share. Results indicate that knowledge heterogeneity is positively related to knowledge share, the same with knowledge share and knowledge innovation performance. This paper analyzes the results comprehensively and makes recommendations from multiple perspectives including building the knowledge heterogeneous steams, advocating the collaborative spirit, building a knowledge shared platform, improving the organizational structure, and grooming the communication.展开更多
To break through the restrictions of traditional organizational forms,systems,and mechanisms and quickly respond to the innovative development requirements of CASC,the innovation team has gradually become a crucial or...To break through the restrictions of traditional organizational forms,systems,and mechanisms and quickly respond to the innovative development requirements of CASC,the innovation team has gradually become a crucial organizational form within CASC.One of the biggest differences between the innovation team and traditional orga-nizational structure lies in knowledge heterogeneity.Existing studies present different conclusions on the relationship between knowledge heterogeneity and innovation performance,which should be analyzed according to specific situ-ations.Therefore,this paper takes the innovation team of CASC as the research object to conduct an empirical study on 186 team members,propose conceptual models and hypotheses,and study the relationship among knowledge heterogeneity,knowledge sharing,and innovation performance.The research results indicate that the two dimensions of knowledge heterogeneity—explicit knowledge heterogeneity and implicit knowledge heterogeneity—are beneficial to innovation performance when they are to a great extent.Knowledge sharing plays a partially mediating role between knowledge heterogeneity and collaborative innovation performance.It reveals the influence of knowledge heterogene-ity on innovation performance in the innovation team of CASC,aiming to provide a certain reference for the establish-ment and development of CASC’s innovation team.展开更多
Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay ...Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the "curse of dimensionality" issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network;such a process is called experience replay.Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward.展开更多
Computational techniques have been adopted in medi-cal and biological systems for a long time. There is no doubt that the development and application of computational methods will render great help in better understan...Computational techniques have been adopted in medi-cal and biological systems for a long time. There is no doubt that the development and application of computational methods will render great help in better understanding biomedical and biological functions. Large amounts of datasets have been produced by biomedical and biological experiments and simulations. In order for researchers to gain knowledge from origi- nal data, nontrivial transformation is necessary, which is regarded as a critical link in the chain of knowledge acquisition, sharing, and reuse. Challenges that have been encountered include: how to efficiently and effectively represent human knowledge in formal computing models, how to take advantage of semantic text mining techniques rather than traditional syntactic text mining, and how to handle security issues during the knowledge sharing and reuse. This paper summarizes the state-of-the-art in these research directions. We aim to provide readers with an introduction of major computing themes to be applied to the medical and biological research.展开更多
Knowledge graph(KG)fact prediction aims to complete a KG by determining the truthfulness of predicted triples.Reinforcement learning(RL)-based approaches have been widely used for fact prediction.However,the existing ...Knowledge graph(KG)fact prediction aims to complete a KG by determining the truthfulness of predicted triples.Reinforcement learning(RL)-based approaches have been widely used for fact prediction.However,the existing approaches largely suffer from unreliable calculations on rule confidences owing to a limited number of obtained reasoning paths,thereby resulting in unreliable decisions on prediction triples.Hence,we propose a new RL-based approach named EvoPath in this study.EvoPath features a new reward mechanism based on entity heterogeneity,facilitating an agent to obtain effective reasoning paths during random walks.EvoPath also incorporates a new postwalking mechanism to leverage easily overlooked but valuable reasoning paths during RL.Both mechanisms provide sufficient reasoning paths to facilitate the reliable calculations of rule confidences,enabling EvoPath to make precise judgments about the truthfulness of prediction triples.Experiments demonstrate that EvoPath can achieve more accurate fact predictions than existing approaches.展开更多
The drastic growth of coastal observation sensors results in copious data that provide weather information.The intricacies in sensor-generated big data are heterogeneity and interpretation,driving high-end Information...The drastic growth of coastal observation sensors results in copious data that provide weather information.The intricacies in sensor-generated big data are heterogeneity and interpretation,driving high-end Information Retrieval(IR)systems.The Semantic Web(SW)can solve this issue by integrating data into a single platform for information exchange and knowledge retrieval.This paper focuses on exploiting the SWbase systemto provide interoperability through ontologies by combining the data concepts with ontology classes.This paper presents a 4-phase weather data model:data processing,ontology creation,SW processing,and query engine.The developed Oceanographic Weather Ontology helps to enhance data analysis,discovery,IR,and decision making.In addition to that,it also evaluates the developed ontology with other state-of-the-art ontologies.The proposed ontology’s quality has improved by 39.28%in terms of completeness,and structural complexity has decreased by 45.29%,11%and 37.7%in Precision and Accuracy.Indian Meteorological Satellite INSAT-3D’s ocean data is a typical example of testing the proposed model.The experimental result shows the effectiveness of the proposed data model and its advantages in machine understanding and IR.展开更多
The most important problem in the security of wireless sensor network (WSN) is to distribute keys for the sensor nodes and to establish a secure channel in an insecure environment. Since the sensor node has limited re...The most important problem in the security of wireless sensor network (WSN) is to distribute keys for the sensor nodes and to establish a secure channel in an insecure environment. Since the sensor node has limited resources, for instance, low battery life and low computational power, the key distribution scheme must be designed in an efficient manner. Recently many studies added a few high-level nodes into the network, called the heterogeneous sensor network (HSN). Most of these studies considered an application for two-level HSN instead of multi-level one. In this paper, we propose some definitions for multi-level HSN, and design a novel key management strategy based on the polynomial hash tree (PHT) method by using deployment knowledge. Our proposed strategy has lower computation and communication overheads but higher connectivity and resilience.展开更多
Entity Linking(EL)aims to automatically link the mentions in unstructured documents to corresponding entities in a knowledge base(KB),which has recently been dominated by global models.Although many global EL methods ...Entity Linking(EL)aims to automatically link the mentions in unstructured documents to corresponding entities in a knowledge base(KB),which has recently been dominated by global models.Although many global EL methods attempt to model the topical coherence among all linked entities,most of them failed in exploiting the correlations among manifold knowledge helpful for linking,such as the semantics of mentions and their candidates,the neighborhood information of candidate entities in KB and the fine-grained type information of entities.As we will show in the paper,interactions among these types of information are very useful for better characterizing the topic features of entities and more accurately estimating the topical coherence among all the referred entities within the same document.In this paper,we present a novel HEterogeneous Graph-based Entity Linker(HEGEL)for global entity linking,which builds an informative heterogeneous graph for every document to collect various linking clues.Then HEGEL utilizes a novel heterogeneous graph neural network(HGNN)to integrate the different types of manifold information and model the interactions among them.Experiments on the standard benchmark datasets demonstrate that HEGEL can well capture the global coherence and outperforms the prior state-of-the-art EL methods.展开更多
Entity set expansion(ESE)aims to expand an entity seed set to obtain more entities which have common properties.ESE is important for many applications such as dictionary con-struction and query suggestion.Traditional ...Entity set expansion(ESE)aims to expand an entity seed set to obtain more entities which have common properties.ESE is important for many applications such as dictionary con-struction and query suggestion.Traditional ESE methods relied heavily on the text and Web information of entities.Recently,some ESE methods employed knowledge graphs(KGs)to extend entities.However,they failed to effectively and fficiently utilize the rich semantics contained in a KG and ignored the text information of entities in Wikipedia.In this paper,we model a KG as a heterogeneous information network(HIN)containing multiple types of objects and relations.Fine-grained multi-type meta paths are proposed to capture the hidden relation among seed entities in a KG and thus to retrieve candidate entities.Then we rank the entities according to the meta path based structural similarity.Furthermore,to utilize the text description of entities in Wikipedia,we propose an extended model CoMeSE++which combines both structural information revealed by a KG and text information in Wikipedia for ESE.Extensive experiments on real-world datasets demonstrate that our model achieves better performance by combining structural and textual information of entities.展开更多
知识追踪任务旨在通过建模学生历史学习序列追踪学生认知水平,进而预测学生未来的答题表现.该文提出一个融合异构图神经网络的时间卷积知识追踪模型(Temporal Convolutional Knowledge Tracing Model with Heterogeneous Graph Neural N...知识追踪任务旨在通过建模学生历史学习序列追踪学生认知水平,进而预测学生未来的答题表现.该文提出一个融合异构图神经网络的时间卷积知识追踪模型(Temporal Convolutional Knowledge Tracing Model with Heterogeneous Graph Neural Network,HG-TCKT),将知识追踪任务重述为基于异构图神经网络的时序边分类问题.具体来说,首先将学习记录构建成包含3种节点类型(学生,习题和技能),2种边类型(学生-习题和习题-技能)的异构图数据,异构图描述了学生交互记录中实体类型之间的丰富关系,使用异构图神经网络缓解交互稀疏的问题,引入异构互注意力机制捕捉不同类型节点间的交互关系,提取不同类型节点的高阶特征.将学生节点和习题节点表征拼接,构造边(学生-习题)的表征.最后,使用时间卷积网络捕捉学生历史交互序列的时序依赖关系从而进行预测.在2个真实教育数据集进行实验证明,HG-TCKT相比当前主流知识追踪方法有更好的预测效果.展开更多
基金supported by the National Key Research and Development Program of China (2020YFB1807700)the National Natural Science Foundation of China (NSFC)under Grant No.62071356the Chongqing Key Laboratory of Mobile Communications Technology under Grant cqupt-mct202202。
文摘In this paper,to deal with the heterogeneity in federated learning(FL)systems,a knowledge distillation(KD)driven training framework for FL is proposed,where each user can select its neural network model on demand and distill knowledge from a big teacher model using its own private dataset.To overcome the challenge of train the big teacher model in resource limited user devices,the digital twin(DT)is exploit in the way that the teacher model can be trained at DT located in the server with enough computing resources.Then,during model distillation,each user can update the parameters of its model at either the physical entity or the digital agent.The joint problem of model selection and training offloading and resource allocation for users is formulated as a mixed integer programming(MIP)problem.To solve the problem,Q-learning and optimization are jointly used,where Q-learning selects models for users and determines whether to train locally or on the server,and optimization is used to allocate resources for users based on the output of Q-learning.Simulation results show the proposed DT-assisted KD framework and joint optimization method can significantly improve the average accuracy of users while reducing the total delay.
文摘In order to improve the efficiency of ontology construction from heterogeneous knowledge sources, a semantic-based approach is presented. The ontology will be constructed with the application of cluster technique in an incremental way. Firstly, terms will be extracted from knowledge sources and congregate a term set after pretreat-ment. Then the concept set will be built via semantic-based clustering according to semanteme of terms provided by WordNet. Next, a concept tree is constructed in terms of mapping rules between semant^me relationships and concept relationships. The semi-automatic approach can avoid non-consistence due to knowledge engineers having different understanding of the same concept and the obtained ontology is easily to be expanded.
文摘Using a sample of 252 R & D teams in Guangzhou, Foshan, Shenzhen, the researcher empirically examines the relationship between knowledge heterogeneity and knowledge innovation performance, the mediating role of knowledge share. Results indicate that knowledge heterogeneity is positively related to knowledge share, the same with knowledge share and knowledge innovation performance. This paper analyzes the results comprehensively and makes recommendations from multiple perspectives including building the knowledge heterogeneous steams, advocating the collaborative spirit, building a knowledge shared platform, improving the organizational structure, and grooming the communication.
文摘To break through the restrictions of traditional organizational forms,systems,and mechanisms and quickly respond to the innovative development requirements of CASC,the innovation team has gradually become a crucial organizational form within CASC.One of the biggest differences between the innovation team and traditional orga-nizational structure lies in knowledge heterogeneity.Existing studies present different conclusions on the relationship between knowledge heterogeneity and innovation performance,which should be analyzed according to specific situ-ations.Therefore,this paper takes the innovation team of CASC as the research object to conduct an empirical study on 186 team members,propose conceptual models and hypotheses,and study the relationship among knowledge heterogeneity,knowledge sharing,and innovation performance.The research results indicate that the two dimensions of knowledge heterogeneity—explicit knowledge heterogeneity and implicit knowledge heterogeneity—are beneficial to innovation performance when they are to a great extent.Knowledge sharing plays a partially mediating role between knowledge heterogeneity and collaborative innovation performance.It reveals the influence of knowledge heterogene-ity on innovation performance in the innovation team of CASC,aiming to provide a certain reference for the establish-ment and development of CASC’s innovation team.
基金supported by the National Natural Science Foundation of China(61751210,61572441)。
文摘Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the "curse of dimensionality" issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network;such a process is called experience replay.Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward.
文摘Computational techniques have been adopted in medi-cal and biological systems for a long time. There is no doubt that the development and application of computational methods will render great help in better understanding biomedical and biological functions. Large amounts of datasets have been produced by biomedical and biological experiments and simulations. In order for researchers to gain knowledge from origi- nal data, nontrivial transformation is necessary, which is regarded as a critical link in the chain of knowledge acquisition, sharing, and reuse. Challenges that have been encountered include: how to efficiently and effectively represent human knowledge in formal computing models, how to take advantage of semantic text mining techniques rather than traditional syntactic text mining, and how to handle security issues during the knowledge sharing and reuse. This paper summarizes the state-of-the-art in these research directions. We aim to provide readers with an introduction of major computing themes to be applied to the medical and biological research.
基金the National Natural Science Foundation of China,Nos.62272480 and 62072470and the National Science Foundation of Hunan Province,Nos.2021JJ30881 and 2020JJ4758.
文摘Knowledge graph(KG)fact prediction aims to complete a KG by determining the truthfulness of predicted triples.Reinforcement learning(RL)-based approaches have been widely used for fact prediction.However,the existing approaches largely suffer from unreliable calculations on rule confidences owing to a limited number of obtained reasoning paths,thereby resulting in unreliable decisions on prediction triples.Hence,we propose a new RL-based approach named EvoPath in this study.EvoPath features a new reward mechanism based on entity heterogeneity,facilitating an agent to obtain effective reasoning paths during random walks.EvoPath also incorporates a new postwalking mechanism to leverage easily overlooked but valuable reasoning paths during RL.Both mechanisms provide sufficient reasoning paths to facilitate the reliable calculations of rule confidences,enabling EvoPath to make precise judgments about the truthfulness of prediction triples.Experiments demonstrate that EvoPath can achieve more accurate fact predictions than existing approaches.
基金This work is financially supported by the Ministry of Earth Science(MoES),Government of India,(Grant.No.MoES/36/OOIS/Extra/45/2015),URL:https://www.moes.gov.in。
文摘The drastic growth of coastal observation sensors results in copious data that provide weather information.The intricacies in sensor-generated big data are heterogeneity and interpretation,driving high-end Information Retrieval(IR)systems.The Semantic Web(SW)can solve this issue by integrating data into a single platform for information exchange and knowledge retrieval.This paper focuses on exploiting the SWbase systemto provide interoperability through ontologies by combining the data concepts with ontology classes.This paper presents a 4-phase weather data model:data processing,ontology creation,SW processing,and query engine.The developed Oceanographic Weather Ontology helps to enhance data analysis,discovery,IR,and decision making.In addition to that,it also evaluates the developed ontology with other state-of-the-art ontologies.The proposed ontology’s quality has improved by 39.28%in terms of completeness,and structural complexity has decreased by 45.29%,11%and 37.7%in Precision and Accuracy.Indian Meteorological Satellite INSAT-3D’s ocean data is a typical example of testing the proposed model.The experimental result shows the effectiveness of the proposed data model and its advantages in machine understanding and IR.
文摘The most important problem in the security of wireless sensor network (WSN) is to distribute keys for the sensor nodes and to establish a secure channel in an insecure environment. Since the sensor node has limited resources, for instance, low battery life and low computational power, the key distribution scheme must be designed in an efficient manner. Recently many studies added a few high-level nodes into the network, called the heterogeneous sensor network (HSN). Most of these studies considered an application for two-level HSN instead of multi-level one. In this paper, we propose some definitions for multi-level HSN, and design a novel key management strategy based on the polynomial hash tree (PHT) method by using deployment knowledge. Our proposed strategy has lower computation and communication overheads but higher connectivity and resilience.
基金supported in part by the National Key R&D Program of China(No.2020AAA0106600)the Key Laboratory of Science,Technology and Standard in Press Industry(Key Laboratory of Intelligent Press Media Technology)
文摘Entity Linking(EL)aims to automatically link the mentions in unstructured documents to corresponding entities in a knowledge base(KB),which has recently been dominated by global models.Although many global EL methods attempt to model the topical coherence among all linked entities,most of them failed in exploiting the correlations among manifold knowledge helpful for linking,such as the semantics of mentions and their candidates,the neighborhood information of candidate entities in KB and the fine-grained type information of entities.As we will show in the paper,interactions among these types of information are very useful for better characterizing the topic features of entities and more accurately estimating the topical coherence among all the referred entities within the same document.In this paper,we present a novel HEterogeneous Graph-based Entity Linker(HEGEL)for global entity linking,which builds an informative heterogeneous graph for every document to collect various linking clues.Then HEGEL utilizes a novel heterogeneous graph neural network(HGNN)to integrate the different types of manifold information and model the interactions among them.Experiments on the standard benchmark datasets demonstrate that HEGEL can well capture the global coherence and outperforms the prior state-of-the-art EL methods.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.61806020,61772082,61972047,61702296)the National Key Research and Development Program of China(2017YFB0803304)+1 种基金the Beijing Municipal Natural Science Foundation(4182043)the CCF-Tencent Open Fund,and the Fundamental Research Funds for the Central Universities.
文摘Entity set expansion(ESE)aims to expand an entity seed set to obtain more entities which have common properties.ESE is important for many applications such as dictionary con-struction and query suggestion.Traditional ESE methods relied heavily on the text and Web information of entities.Recently,some ESE methods employed knowledge graphs(KGs)to extend entities.However,they failed to effectively and fficiently utilize the rich semantics contained in a KG and ignored the text information of entities in Wikipedia.In this paper,we model a KG as a heterogeneous information network(HIN)containing multiple types of objects and relations.Fine-grained multi-type meta paths are proposed to capture the hidden relation among seed entities in a KG and thus to retrieve candidate entities.Then we rank the entities according to the meta path based structural similarity.Furthermore,to utilize the text description of entities in Wikipedia,we propose an extended model CoMeSE++which combines both structural information revealed by a KG and text information in Wikipedia for ESE.Extensive experiments on real-world datasets demonstrate that our model achieves better performance by combining structural and textual information of entities.
文摘知识追踪任务旨在通过建模学生历史学习序列追踪学生认知水平,进而预测学生未来的答题表现.该文提出一个融合异构图神经网络的时间卷积知识追踪模型(Temporal Convolutional Knowledge Tracing Model with Heterogeneous Graph Neural Network,HG-TCKT),将知识追踪任务重述为基于异构图神经网络的时序边分类问题.具体来说,首先将学习记录构建成包含3种节点类型(学生,习题和技能),2种边类型(学生-习题和习题-技能)的异构图数据,异构图描述了学生交互记录中实体类型之间的丰富关系,使用异构图神经网络缓解交互稀疏的问题,引入异构互注意力机制捕捉不同类型节点间的交互关系,提取不同类型节点的高阶特征.将学生节点和习题节点表征拼接,构造边(学生-习题)的表征.最后,使用时间卷积网络捕捉学生历史交互序列的时序依赖关系从而进行预测.在2个真实教育数据集进行实验证明,HG-TCKT相比当前主流知识追踪方法有更好的预测效果.