Purpose:Given the information overload of scientific literature,there is an increasing need for computable biomedical knowledge buried in free text.This study aimed to develop a novel approach to extracting and measur...Purpose:Given the information overload of scientific literature,there is an increasing need for computable biomedical knowledge buried in free text.This study aimed to develop a novel approach to extracting and measuring uncertain biomedical knowledge from scientific statements.Design/methodology/approach:Taking cardiovascular research publications in China as a sample,we extracted subject-predicate-object triples(SPO triples)as knowledge units and unknown/hedging/conflicting uncertainties as the knowledge context.We introduced information entropy(IE)as potential metric to quantify the uncertainty of epistemic status of scientific knowledge represented at subject-object pairs(SO pairs)levels.Findings:The results indicated an extraordinary growth of cardiovascular publications in China while only a modest growth of the novel SPO triples.After evaluating the uncertainty of biomedical knowledge with IE,we identified the Top 10 SO pairs with highest IE,which implied the epistemic status pluralism.Visual presentation of the SO pairs overlaid with uncertainty provided a comprehensive overview of clusters of biomedical knowledge and contending topics in cardiovascular research.Research limitations:The current methods didn’t distinguish the specificity and probabilities of uncertainty cue words.The number of sentences surrounding a given triple may also influence the value of IE.Practical implications:Our approach identified major uncertain knowledge areas such as diagnostic biomarkers,genetic polymorphism and co-existing risk factors related to cardiovascular diseases in China.These areas are suggested to be prioritized;new hypotheses need to be verified,while disputes,conflicts,and contradictions need to be settled.Originality/value:We provided a novel approach by combining natural language processing and computational linguistics with informetric methods to extract and measure uncertain knowledge from scientific statements.展开更多
Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only rec...Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only recently begun,Uncertain Knowledge Graph Embedding(UKGE)model has a certain effect on solving this problem.However,there are still unresolved issues.On the one hand,when reasoning the confidence of unseen relation facts,the introduced probabilistic soft logic cannot be used to combine multi-path and multi-step global information,leading to information loss.On the other hand,the existing UKG embedding model can only model symmetric relation facts,but the embedding problem of asymmetric relation facts has not be addressed.To address the above issues,a Multiplex Uncertain Knowledge Graph Embedding(MUKGE)model is proposed in this paper.First,to combine multiple information and achieve more accurate results in confidence reasoning,the Uncertain ResourceRank(URR)reasoning algorithm is introduced.Second,the asymmetry in the UKG is defined.To embed asymmetric relation facts of UKG,a multi-relation embedding model is proposed.Finally,experiments are carried out on different datasets via 4 tasks to verify the effectiveness of MUKGE.The results of experiments demonstrate that MUKGE can obtain better overall performance than the baselines,and it helps advance the research on UKG embedding.展开更多
The lasting evolution of computing environment, software engineering and interaction methods leads to cloud computing. Cloud computing changes the configuration mode of resources on the Internet and all kinds of resou...The lasting evolution of computing environment, software engineering and interaction methods leads to cloud computing. Cloud computing changes the configuration mode of resources on the Internet and all kinds of resources are virtualized and provided as services. Mass participation and online interaction with social annotations become usual in human daily life. People who own similar interests on the Internet may cluster naturally into scalable and boundless communities and collective intelligence will emerge. Human is taken as an intelligent computing factor, and uncertainty becomes a basic property in cloud computing. Virtualization, soft computing and granular computing will become essential features of cloud computing. Compared with the engineering technological problems of IaaS (Infrastructure as a service), PaaS (Platform as a Service) and SaaS (Software as a Service), collective intelligence and uncertain knowledge representation will be more important frontiers in cloud computing for researchers within the community of intelligence science.展开更多
Online automatic fault diagnosis in industrial systems is essential for guaranteeing safe, reliable and efficient operations.However, difficulties associated with computational overload, ubiquitous uncertainties and i...Online automatic fault diagnosis in industrial systems is essential for guaranteeing safe, reliable and efficient operations.However, difficulties associated with computational overload, ubiquitous uncertainties and insufficient fault samples hamper the engineering application of intelligent fault diagnosis technology. Geared towards the settlement of these problems, this paper introduces the method of dynamic uncertain causality graph, which is a new attempt to model complex behaviors of real-world systems under uncertainties. The visual representation to causality pathways and self-relied "chaining" inference mechanisms are analyzed. In particular, some solutions are investigated for the diagnostic reasoning algorithm to aim at reducing its computational complexity and improving the robustness to potential losses and imprecisions in observations. To evaluate the effectiveness and performance of this method, experiments are conducted using both synthetic calculation cases and generator faults of a nuclear power plant. The results manifest the high diagnostic accuracy and efficiency, suggesting its practical significance in large-scale industrial applications.展开更多
基金funded by the National Natural Science Foundation of China(nos.71603280,72074006,and 82070235)the Beijing Municipal Natural Science Foundation(7191013)+1 种基金Research Unit of Medical Science Research Management/Basic and Clinical Research of Metabolic Cardiovascular Diseases,Chinese Academy of Medical Sciences(2021RU003)Peking University Health Science Center and the Young Elite Scientists Sponsorship Program by China Association for Science and Technology(2017QNRC001).
文摘Purpose:Given the information overload of scientific literature,there is an increasing need for computable biomedical knowledge buried in free text.This study aimed to develop a novel approach to extracting and measuring uncertain biomedical knowledge from scientific statements.Design/methodology/approach:Taking cardiovascular research publications in China as a sample,we extracted subject-predicate-object triples(SPO triples)as knowledge units and unknown/hedging/conflicting uncertainties as the knowledge context.We introduced information entropy(IE)as potential metric to quantify the uncertainty of epistemic status of scientific knowledge represented at subject-object pairs(SO pairs)levels.Findings:The results indicated an extraordinary growth of cardiovascular publications in China while only a modest growth of the novel SPO triples.After evaluating the uncertainty of biomedical knowledge with IE,we identified the Top 10 SO pairs with highest IE,which implied the epistemic status pluralism.Visual presentation of the SO pairs overlaid with uncertainty provided a comprehensive overview of clusters of biomedical knowledge and contending topics in cardiovascular research.Research limitations:The current methods didn’t distinguish the specificity and probabilities of uncertainty cue words.The number of sentences surrounding a given triple may also influence the value of IE.Practical implications:Our approach identified major uncertain knowledge areas such as diagnostic biomarkers,genetic polymorphism and co-existing risk factors related to cardiovascular diseases in China.These areas are suggested to be prioritized;new hypotheses need to be verified,while disputes,conflicts,and contradictions need to be settled.Originality/value:We provided a novel approach by combining natural language processing and computational linguistics with informetric methods to extract and measure uncertain knowledge from scientific statements.
基金the National Key Research and Development Program of China(Nos.2020YFC2003502,2021YFF0704101)the National Natural Science Foundation of China(Grant No.62276038)+1 种基金the Natural Science Foundation of Chongqing(Nos.cstc2019jcyj-cxttX0002,cstc2021ycjh-bgzxm0013)the Key Cooperation Project of Chongqing Municipal Education Commission(HZ20210-08).
文摘Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only recently begun,Uncertain Knowledge Graph Embedding(UKGE)model has a certain effect on solving this problem.However,there are still unresolved issues.On the one hand,when reasoning the confidence of unseen relation facts,the introduced probabilistic soft logic cannot be used to combine multi-path and multi-step global information,leading to information loss.On the other hand,the existing UKG embedding model can only model symmetric relation facts,but the embedding problem of asymmetric relation facts has not be addressed.To address the above issues,a Multiplex Uncertain Knowledge Graph Embedding(MUKGE)model is proposed in this paper.First,to combine multiple information and achieve more accurate results in confidence reasoning,the Uncertain ResourceRank(URR)reasoning algorithm is introduced.Second,the asymmetry in the UKG is defined.To embed asymmetric relation facts of UKG,a multi-relation embedding model is proposed.Finally,experiments are carried out on different datasets via 4 tasks to verify the effectiveness of MUKGE.The results of experiments demonstrate that MUKGE can obtain better overall performance than the baselines,and it helps advance the research on UKG embedding.
基金supported by National Key Basic Research Program of China (973 Program) under Grant No.2007CB310804China Post-doctoral Science Foundation under Grants No.20090460107, 201003794
文摘The lasting evolution of computing environment, software engineering and interaction methods leads to cloud computing. Cloud computing changes the configuration mode of resources on the Internet and all kinds of resources are virtualized and provided as services. Mass participation and online interaction with social annotations become usual in human daily life. People who own similar interests on the Internet may cluster naturally into scalable and boundless communities and collective intelligence will emerge. Human is taken as an intelligent computing factor, and uncertainty becomes a basic property in cloud computing. Virtualization, soft computing and granular computing will become essential features of cloud computing. Compared with the engineering technological problems of IaaS (Infrastructure as a service), PaaS (Platform as a Service) and SaaS (Software as a Service), collective intelligence and uncertain knowledge representation will be more important frontiers in cloud computing for researchers within the community of intelligence science.
基金supported by the National Natural Science Foundation of China(Nos.61050005 and 61273330)Research Foundation for the Doctoral Program of China Ministry of Education(No.20120002110037)+1 种基金the 2014 Teaching Reform Project of Shandong Normal UniversityDevelopment Project of China Guangdong Nuclear Power Group(No.CNPRI-ST10P005)
文摘Online automatic fault diagnosis in industrial systems is essential for guaranteeing safe, reliable and efficient operations.However, difficulties associated with computational overload, ubiquitous uncertainties and insufficient fault samples hamper the engineering application of intelligent fault diagnosis technology. Geared towards the settlement of these problems, this paper introduces the method of dynamic uncertain causality graph, which is a new attempt to model complex behaviors of real-world systems under uncertainties. The visual representation to causality pathways and self-relied "chaining" inference mechanisms are analyzed. In particular, some solutions are investigated for the diagnostic reasoning algorithm to aim at reducing its computational complexity and improving the robustness to potential losses and imprecisions in observations. To evaluate the effectiveness and performance of this method, experiments are conducted using both synthetic calculation cases and generator faults of a nuclear power plant. The results manifest the high diagnostic accuracy and efficiency, suggesting its practical significance in large-scale industrial applications.