With the popularity of uncertain data, queries over uncertain graphs have become a hot topic in the database community. As one of the important queries, the shortest path query over an uncertain graph has attracted mu...With the popularity of uncertain data, queries over uncertain graphs have become a hot topic in the database community. As one of the important queries, the shortest path query over an uncertain graph has attracted much attention of researchers due to its wide applications. Although there are some e?cient solutions addressing this problem, all existing models ignore an important property existing in uncertain graphs: the correlation among the edges sharing the same vertex. In this paper, we apply Markov network to model the hidden correlation in uncertain graphs and compute the shortest path. Unfortunately, calculating the shortest path and corresponding probability over uncertain graphs modeled by Markov networks is a #P-hard problem. Thus, we propose a filtering-and-verification framework to accelerate the queries. In the filtering phase, we design a probabilistic shortest path index based on vertex cuts and blocks of a graph. We find a series of upper bounds and prune the vertices and edges whose upper bounds of the shortest path probability are lower than the threshold. By carefully picking up the blocks and vertex cuts, the index is optimized to have the maximum pruning capability, so that we can filter a large number of vertices which make no contribution to the final shortest path query results. In the verification phase, we develop an e?cient sampling algorithm to determine the final query answers. Finally, we verify the e?ciency and effectiveness of our solutions with extensive experiments.展开更多
Fault diagnostics is important for safe operation of nuclear power plants(NPPs). In recent years, data-driven approaches have been proposed and implemented to tackle the problem, e.g., neural networks, fuzzy and neuro...Fault diagnostics is important for safe operation of nuclear power plants(NPPs). In recent years, data-driven approaches have been proposed and implemented to tackle the problem, e.g., neural networks, fuzzy and neurofuzzy approaches, support vector machine, K-nearest neighbor classifiers and inference methodologies. Among these methods, dynamic uncertain causality graph(DUCG)has been proved effective in many practical cases. However, the causal graph construction behind the DUCG is complicate and, in many cases, results redundant on the symptoms needed to correctly classify the fault. In this paper, we propose a method to simplify causal graph construction in an automatic way. The method consists in transforming the expert knowledge-based DCUG into a fuzzy decision tree(FDT) by extracting from the DUCG a fuzzy rule base that resumes the used symptoms at the basis of the FDT. Genetic algorithm(GA) is, then, used for the optimization of the FDT, by performing a wrapper search around the FDT: the set of symptoms selected during the iterative search are taken as the best set of symptoms for the diagnosis of the faults that can occur in the system. The effectiveness of the approach is shown with respect to a DUCG model initially built to diagnose 23 faults originally using 262 symptoms of Unit-1 in the Ningde NPP of the China Guangdong Nuclear Power Corporation. The results show that the FDT, with GA-optimized symptoms and diagnosis strategy, can drive the construction of DUCG and lower the computational burden without loss of accuracy in diagnosis.展开更多
Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only rec...Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only recently begun,Uncertain Knowledge Graph Embedding(UKGE)model has a certain effect on solving this problem.However,there are still unresolved issues.On the one hand,when reasoning the confidence of unseen relation facts,the introduced probabilistic soft logic cannot be used to combine multi-path and multi-step global information,leading to information loss.On the other hand,the existing UKG embedding model can only model symmetric relation facts,but the embedding problem of asymmetric relation facts has not be addressed.To address the above issues,a Multiplex Uncertain Knowledge Graph Embedding(MUKGE)model is proposed in this paper.First,to combine multiple information and achieve more accurate results in confidence reasoning,the Uncertain ResourceRank(URR)reasoning algorithm is introduced.Second,the asymmetry in the UKG is defined.To embed asymmetric relation facts of UKG,a multi-relation embedding model is proposed.Finally,experiments are carried out on different datasets via 4 tasks to verify the effectiveness of MUKGE.The results of experiments demonstrate that MUKGE can obtain better overall performance than the baselines,and it helps advance the research on UKG embedding.展开更多
基金This work is supported in part by the National Natural Science Foundation of China under Grant Nos. 61332006, U1401256, 61328202, 61173029, the Fundamental Research Funds for the Central Universities of China under Grant No. N130504006, the Hong Kong RGC Project under Grant No. N_HKUST637/13, the National Basic Research 973 Program of China under Grant No. 2014CB340300, Microsoft Research Asia Gift Grant and Google Faculty Award 2013.
文摘With the popularity of uncertain data, queries over uncertain graphs have become a hot topic in the database community. As one of the important queries, the shortest path query over an uncertain graph has attracted much attention of researchers due to its wide applications. Although there are some e?cient solutions addressing this problem, all existing models ignore an important property existing in uncertain graphs: the correlation among the edges sharing the same vertex. In this paper, we apply Markov network to model the hidden correlation in uncertain graphs and compute the shortest path. Unfortunately, calculating the shortest path and corresponding probability over uncertain graphs modeled by Markov networks is a #P-hard problem. Thus, we propose a filtering-and-verification framework to accelerate the queries. In the filtering phase, we design a probabilistic shortest path index based on vertex cuts and blocks of a graph. We find a series of upper bounds and prune the vertices and edges whose upper bounds of the shortest path probability are lower than the threshold. By carefully picking up the blocks and vertex cuts, the index is optimized to have the maximum pruning capability, so that we can filter a large number of vertices which make no contribution to the final shortest path query results. In the verification phase, we develop an e?cient sampling algorithm to determine the final query answers. Finally, we verify the e?ciency and effectiveness of our solutions with extensive experiments.
文摘Fault diagnostics is important for safe operation of nuclear power plants(NPPs). In recent years, data-driven approaches have been proposed and implemented to tackle the problem, e.g., neural networks, fuzzy and neurofuzzy approaches, support vector machine, K-nearest neighbor classifiers and inference methodologies. Among these methods, dynamic uncertain causality graph(DUCG)has been proved effective in many practical cases. However, the causal graph construction behind the DUCG is complicate and, in many cases, results redundant on the symptoms needed to correctly classify the fault. In this paper, we propose a method to simplify causal graph construction in an automatic way. The method consists in transforming the expert knowledge-based DCUG into a fuzzy decision tree(FDT) by extracting from the DUCG a fuzzy rule base that resumes the used symptoms at the basis of the FDT. Genetic algorithm(GA) is, then, used for the optimization of the FDT, by performing a wrapper search around the FDT: the set of symptoms selected during the iterative search are taken as the best set of symptoms for the diagnosis of the faults that can occur in the system. The effectiveness of the approach is shown with respect to a DUCG model initially built to diagnose 23 faults originally using 262 symptoms of Unit-1 in the Ningde NPP of the China Guangdong Nuclear Power Corporation. The results show that the FDT, with GA-optimized symptoms and diagnosis strategy, can drive the construction of DUCG and lower the computational burden without loss of accuracy in diagnosis.
基金the National Key Research and Development Program of China(Nos.2020YFC2003502,2021YFF0704101)the National Natural Science Foundation of China(Grant No.62276038)+1 种基金the Natural Science Foundation of Chongqing(Nos.cstc2019jcyj-cxttX0002,cstc2021ycjh-bgzxm0013)the Key Cooperation Project of Chongqing Municipal Education Commission(HZ20210-08).
文摘Uncertain Knowledge Graphs(UKGs)are used to characterize the inherent uncertainty of knowledge and have a richer semantic structure than deterministic knowledge graphs.The research on the embedding of UKG has only recently begun,Uncertain Knowledge Graph Embedding(UKGE)model has a certain effect on solving this problem.However,there are still unresolved issues.On the one hand,when reasoning the confidence of unseen relation facts,the introduced probabilistic soft logic cannot be used to combine multi-path and multi-step global information,leading to information loss.On the other hand,the existing UKG embedding model can only model symmetric relation facts,but the embedding problem of asymmetric relation facts has not be addressed.To address the above issues,a Multiplex Uncertain Knowledge Graph Embedding(MUKGE)model is proposed in this paper.First,to combine multiple information and achieve more accurate results in confidence reasoning,the Uncertain ResourceRank(URR)reasoning algorithm is introduced.Second,the asymmetry in the UKG is defined.To embed asymmetric relation facts of UKG,a multi-relation embedding model is proposed.Finally,experiments are carried out on different datasets via 4 tasks to verify the effectiveness of MUKGE.The results of experiments demonstrate that MUKGE can obtain better overall performance than the baselines,and it helps advance the research on UKG embedding.