Prediction intervals(PIs)for industrial time series can provide useful guidance for workers.Given that the failure of industrial sensors may cause the missing point in inputs,the existing kernel dynamic Bayesian netwo...Prediction intervals(PIs)for industrial time series can provide useful guidance for workers.Given that the failure of industrial sensors may cause the missing point in inputs,the existing kernel dynamic Bayesian networks(KDBN),serving as an effective method for PIs construction,suffer from high computational load using the stochastic algorithm for inference.This study proposes a variational inference method for the KDBN for the purpose of fast inference,which avoids the timeconsuming stochastic sampling.The proposed algorithm contains two stages.The first stage involves the inference of the missing inputs by using a local linearization based variational inference,and based on the computed posterior distributions over the missing inputs the second stage sees a Gaussian approximation for probability over the nodes in future time slices.To verify the effectiveness of the proposed method,a synthetic dataset and a practical dataset of generation flow of blast furnace gas(BFG)are employed with different ratios of missing inputs.The experimental results indicate that the proposed method can provide reliable PIs for the generation flow of BFG and it exhibits shorter computing time than the stochastic based one.展开更多
Recently,deep learning-based semantic communication has garnered widespread attention,with numerous systems designed for transmitting diverse data sources,including text,image,and speech,etc.While efforts have been di...Recently,deep learning-based semantic communication has garnered widespread attention,with numerous systems designed for transmitting diverse data sources,including text,image,and speech,etc.While efforts have been directed toward improving system performance,many studies have concentrated on enhancing the structure of the encoder and decoder.However,this often overlooks the resulting increase in model complexity,imposing additional storage and computational burdens on smart devices.Furthermore,existing work tends to prioritize explicit semantics,neglecting the potential of implicit semantics.This paper aims to easily and effectively enhance the receiver's decoding capability without modifying the encoder and decoder structures.We propose a novel semantic communication system with variational neural inference for text transmission.Specifically,we introduce a simple but effective variational neural inferer at the receiver to infer the latent semantic information within the received text.This information is then utilized to assist in the decoding process.The simulation results show a significant enhancement in system performance and improved robustness.展开更多
Topic modeling is a mainstream and effective technology to deal with text data, with wide applications in text analysis, natural language, personalized recommendation, computer vision, etc. Among all the known topic m...Topic modeling is a mainstream and effective technology to deal with text data, with wide applications in text analysis, natural language, personalized recommendation, computer vision, etc. Among all the known topic models, supervised Latent Dirichlet Allocation (sLDA) is acknowledged as a popular and competitive supervised topic model. How- ever, the gradual increase of the scale of datasets makes sLDA more and more inefficient and time-consuming, and limits its applications in a very narrow range. To solve it, a parallel online sLDA, named PO-sLDA (Parallel and Online sLDA), is proposed in this study. It uses the stochastic variational inference as the learning method to make the training procedure more rapid and efficient, and a parallel computing mechanism implemented via the MapReduce framework is proposed to promote the capacity of cloud computing and big data processing. The online training capacity supported by PO-sLDA expands the application scope of this approach, making it instrumental for real-life applications with high real-time demand. The validation using two datasets with different sizes shows that the proposed approach has the comparative accuracy as the sLDA and can efficiently accelerate the training procedure. Moreover, its good convergence and online training capacity make it lucrative for the large-scale text data analyzing and processing.展开更多
Stochastic variational inference is an efficient Bayesian inference technology for massive datasets,which approximates posteriors by using noisy gradient estimates.Traditional stochastic variational inference can only...Stochastic variational inference is an efficient Bayesian inference technology for massive datasets,which approximates posteriors by using noisy gradient estimates.Traditional stochastic variational inference can only be performed in a centralized manner,which limits its applications in a wide range of situations where data is possessed by multiple nodes.Therefore,this paper develops a novel trust-region based stochastic variational inference algorithm for a general class of conjugate-exponential models over distributed and asynchronous networks,where the global parameters are diffused over the network by using the Metropolis rule and the local parameters are updated by using the trust-region method.Besides,a simple rule is introduced to balance the transmission frequencies between neighboring nodes such that the proposed distributed algorithm can be performed in an asynchronous manner.The utility of the proposed algorithm is tested by fitting the Bernoulli model and the Gaussian model to different datasets on a synthetic network,and experimental results demonstrate its effectiveness and advantages over existing works.展开更多
Stochastic variational inference (SVI) can learn topic models with very big corpora. It optimizes the variational objective by using the stochastic natural gradient algorithm with a decreasing learning rate. This ra...Stochastic variational inference (SVI) can learn topic models with very big corpora. It optimizes the variational objective by using the stochastic natural gradient algorithm with a decreasing learning rate. This rate is crucial for SVI; however, it is often tuned by hand in real applications. To address this, we develop a novel algorithm, which tunes the learning rate of each iteration adaptively. The proposed algorithm uses the Kullback-Leibler (KL) divergence to measure the similarity between the variational distribution with noisy update and that with batch update, and then optimizes the learning rates by minimizing the KL divergence. We apply our algorithm to two representative topic models: latent Dirichlet allocation and hierarchical Dirichlet process. Experimental results indicate that our algorithm performs better and converges faster than commonly used learning rates.展开更多
This paper is focused on the state estimation problem for nonlinear systems with unknown statistics of measurement noise.Based on the cubature Kalman filter,we propose a new nonlinear filtering algorithm that employs ...This paper is focused on the state estimation problem for nonlinear systems with unknown statistics of measurement noise.Based on the cubature Kalman filter,we propose a new nonlinear filtering algorithm that employs a skew t distribution to characterize the asymmetry of the measurement noise.The system states and the statistics of skew t noise distribution,including the shape matrix,the scale matrix,and the degree of freedom(DOF)are estimated jointly by employing variational Bayesian(VB)inference.The proposed method is validated in a target tracking example.Results of the simulation indicate that the proposed nonlinear filter can perform satisfactorily in the presence of unknown statistics of measurement noise and outperform than the existing state-of-the-art nonlinear filters.展开更多
Efficient estimation of line spectral from quantized samples is of significant importance in information theory and signal processing,e.g.,channel estimation in energy efficient massive MIMO systems and direction of a...Efficient estimation of line spectral from quantized samples is of significant importance in information theory and signal processing,e.g.,channel estimation in energy efficient massive MIMO systems and direction of arrival estimation.The goal of this paper is to recover the line spectral as well as its corresponding parameters including the model order,frequencies and amplitudes from heavily quantized samples.To this end,we propose an efficient gridless Bayesian algorithm named VALSE-EP,which is a combination of the high resolution and low complexity gridless variational line spectral estimation(VALSE)and expectation propagation(EP).The basic idea of VALSE-EP is to iteratively approximate the challenging quantized model of line spectral estimation as a sequence of simple pseudo unquantized models,where VALSE is applied.Moreover,to obtain a benchmark of the performance of the proposed algorithm,the Cram′er Rao bound(CRB)is derived.Finally,numerical experiments on both synthetic and real data are performed,demonstrating the near CRB performance of the proposed VALSE-EP for line spectral estimation from quantized samples.展开更多
A novel variational Bayesian inference based on adaptive cubature Kalman filter(VBACKF)algorithm is proposed for the problem of state estimation in a target tracking system with time-varying measurement noise and rand...A novel variational Bayesian inference based on adaptive cubature Kalman filter(VBACKF)algorithm is proposed for the problem of state estimation in a target tracking system with time-varying measurement noise and random measurement losses.Firstly,the Inverse-Wishart(IW)distribution is chosen to model the covariance matrix of time-varying measurement noise in the cubature Kalman filter framework.Secondly,the Bernoulli random variable is introduced as the judgement factor of the measurement losses,and the Beta distribution is selected as the conjugate prior distribution of measurement loss probability to ensure that the posterior distribution and prior distribution have the same function form.Finally,the joint posterior probability density function of the estimated variables is approximately decoupled by the variational Bayesian inference,and the fixed-point iteration approach is used to update the estimated variables.The simulation results show that the proposed VBACKF algorithm considers the comprehensive effects of system nonlinearity,time-varying measurement noise and unknown measurement loss probability,moreover,effectively improves the accuracy of target state estimation in complex scene.展开更多
Aiming at the problem of filtering precision degradation caused by the random outliers of process noise and measurement noise in multi-target tracking(MTT) system,a new Gaussian-Student’s t mixture distribution proba...Aiming at the problem of filtering precision degradation caused by the random outliers of process noise and measurement noise in multi-target tracking(MTT) system,a new Gaussian-Student’s t mixture distribution probability hypothesis density(PHD) robust filtering algorithm based on variational Bayesian inference(GST-vbPHD) is proposed.Firstly,since it can accurately describe the heavy-tailed characteristics of noise with outliers,Gaussian-Student’s t mixture distribution is employed to model process noise and measurement noise respectively.Then Bernoulli random variable is introduced to correct the likelihood distribution of the mixture probability,leading hierarchical Gaussian distribution constructed by the Gaussian-Student’s t mixture distribution suitable to model non-stationary noise.Finally,the approximate solutions including target weights,measurement noise covariance and state estimation error covariance are obtained according to variational Bayesian inference approach.The simulation results show that,in the heavy-tailed noise environment,the proposed algorithm leads to strong improvements over the traditional PHD filter and the Student’s t distribution PHD filter.展开更多
In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniq...In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.展开更多
The estimation of sparse underwater acoustic(UWA)channels can be regarded as an inference problem involving hidden variables within the Bayesian framework.While the classical sparse Bayesian learning(SBL),derived thro...The estimation of sparse underwater acoustic(UWA)channels can be regarded as an inference problem involving hidden variables within the Bayesian framework.While the classical sparse Bayesian learning(SBL),derived through the expectation maximization(EM)algorithm,has been widely employed for UWA channel estimation,it still differs from the real posterior expectation of channels.In this paper,we propose an approach that combines variational inference(VI)and Markov chain Monte Carlo(MCMC)methods to provide a more accurate posterior estimation.Specifically,the SBL is first re-derived with VI,allowing us to replace the posterior distribution of the hidden variables with a variational distribution.Then,we determine the full conditional probability distribution for each variable in the variational distribution and then iteratively perform random Gibbs sampling in MCMC to converge the Markov chain.The results of simulation and experiment indicate that our estimation method achieves lower mean square error and bit error rate compared to the classic SBL approach.Additionally,it demonstrates an acceptable convergence speed.展开更多
We present a new category of physics-informed neural networks called physics informed variational embedding generative adversarial network(PI-VEGAN),that effectively tackles the forward,inverse,and mixed problems of s...We present a new category of physics-informed neural networks called physics informed variational embedding generative adversarial network(PI-VEGAN),that effectively tackles the forward,inverse,and mixed problems of stochastic differential equations.In these scenarios,the governing equations are known,but only a limited number of sensor measurements of the system parameters are available.We integrate the governing physical laws into PI-VEGAN with automatic differentiation,while introducing a variational encoder for approximating the latent variables of the actual distribution of the measurements.These latent variables are integrated into the generator to facilitate accurate learning of the characteristics of the stochastic partial equations.Our model consists of three components,namely the encoder,generator,and discriminator,each of which is updated alternatively employing the stochastic gradient descent algorithm.We evaluate the effectiveness of PI-VEGAN in addressing forward,inverse,and mixed problems that require the concurrent calculation of system parameters and solutions.Numerical results demonstrate that the proposed method achieves satisfactory stability and accuracy in comparison with the previous physics-informed generative adversarial network(PI-WGAN).展开更多
Gene regulatory network(GRN)refers to the complex network formed by regulatory interactions between genes in living cells.In this paper,we consider inferring GRNs in single cells based on single-cell RNA sequencing(sc...Gene regulatory network(GRN)refers to the complex network formed by regulatory interactions between genes in living cells.In this paper,we consider inferring GRNs in single cells based on single-cell RNA sequencing(scRNA-seq)data.In scRNA-seq,single cells are often profiled from mixed populations,and their cell identities are unknown.A common practice for single-cell GRN analysis is to first cluster the cells and infer GRNs for every cluster separately.However,this two-step procedure ignores uncertainty in the clustering step and thus could lead to inaccurate estimation of the networks.Here,we consider the mixture Poisson lognormal model(MPLN)for network inference of count data from mixed populations.The precision matrices of the MPLN are the GRNs of different cell types.To avoid the intractable optimization of the MPLN’s log-likelihood,we develop an algorithm called variational mixture Poisson log-normal(VMPLN)to jointly estimate the GRNs of different cell types based on the variational inference method.We compare VMPLN with state-of-the-art single-cell regulatory network inference methods.Comprehensive simulation shows that VMPLN achieves better performance,especially in scenarios where different cell types have a high mixing degree.Benchmarking on real scRNA-seq data also demonstrates that VMPLN can provide more accurate network estimation in most cases.Finally,we apply VMPLN to a large scRNA-seq dataset from patients infected with severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)and find that VMPLN identifies critical differences in regulatory networks in immune cells between patients with moderate and severe symptoms.The source codes are available on the GitHub website(github.com/XiDsLab/SCVMPLN).展开更多
In the article, an improved variational inference (VI) framework for learning finite Beta-Liouville mixture models (BLM) is proposed for proportional data classification and clustering. Within the VI framework, so...In the article, an improved variational inference (VI) framework for learning finite Beta-Liouville mixture models (BLM) is proposed for proportional data classification and clustering. Within the VI framework, some non-linear approximation techniques are adopted to obtain the approximated variational object functions. Analytical solutions are obtained for the variational posterior distributions. Compared to the expectation maximization (EM) algorithm which is commonly used for learning mixture models, underfitting and overfitting events can be prevented. Furthermore, parameters and complexity of the mixture model (model order) can be estimated simultaneously. Experiment shows that both synthetic and real-world data sets are to demonstrate the feasibility and advantages of the proposed method.展开更多
This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous fe...This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous feature metric is first constructed to measure the similarity between different types of features.Then,a feature metric-restricted hierarchical sample generation process is established,in which sample with heterogeneous features is clustered by generating it from a similarity constraint hidden space.When estimating the model parameters and posterior probability,the corresponding variational inference algorithm is derived and implemented.To verify our model capability,we demonstrate our model on the synthetic dataset and show the superiority of the proposed method on some real datasets.Our source code is released on the website:Github.com/yexlwh/Heterogeneousclustering.展开更多
With the rapid growth of power systems measurements in terms of size and complexity,discovering statistical patterns for a large variety of real-world applications such as renewable energy prediction,demand response,e...With the rapid growth of power systems measurements in terms of size and complexity,discovering statistical patterns for a large variety of real-world applications such as renewable energy prediction,demand response,energy disaggregation,and state estimation is considered a crucial challenge.In recent years,deep learning has emerged as a novel class of machine learning algorithms that represents power systems data via a large hypothesis space that leads to the state-of-the-art performance compared to most recent data-driven algorithms.This study explores the theoretical advantages of deep representation learning in power systems research.We review deep learning methodologies presented and applied in a wide range of supervised,unsupervised,and semi-supervised applications as well as reinforcement learning tasks.We discuss various settings of problems solved by discriminative deep models including stacked autoencoders and convolutional neural networks as well as generative deep architectures such as deep belief networks and variational autoencoders.The theoretical and experimental analysis of deep neural networks in this study motivates longterm research on optimizing this cutting-edge class of models to achieve significant improvements in the future power systems research.展开更多
The paper proposes a novel probabilistic generative model for simultaneous image classification and annotation. The model considers the fact that the category information can provide valuable information for image ann...The paper proposes a novel probabilistic generative model for simultaneous image classification and annotation. The model considers the fact that the category information can provide valuable information for image annotation. Once the category of an image is ascertained, the scope of annotation words can be narrowed, and the probability of generating irrelevant annotation words can be reduced. To this end, the idea that annotates images according to class is introduced in the model. Using variational methods, the approximate inference and parameters estimation algorithms of the model are derived, and efficient approximations for classifying and annotating new images are also given. The power of our model is demonstrated on two real world datasets: a 1 600-images LabelMe dataset and a 1 791-images UIUC-Sport dataset. The experiment results show that the classification performance is on par with several state-of-the-art classification models, while the annotation performance is better than that of several state-of-the-art annotation models.展开更多
In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of ...In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of topics need to be given in advance.Moreover,theGTMreleases the assumption of‘bag of words’and considers the graph structure of the text.The combination of HDP and GTM takes advantage of both which is named as HDP–GTM.The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed.We apply the proposed model in text categorisation,comparing to three related topic models,latent Dirichlet allocation(LDA),GTM and HDP.展开更多
基金supported by the National Key Research andDevelopment Program of China(2017YFA0700300)the National Natural Sciences Foundation of China(61533005,61703071,61603069)。
文摘Prediction intervals(PIs)for industrial time series can provide useful guidance for workers.Given that the failure of industrial sensors may cause the missing point in inputs,the existing kernel dynamic Bayesian networks(KDBN),serving as an effective method for PIs construction,suffer from high computational load using the stochastic algorithm for inference.This study proposes a variational inference method for the KDBN for the purpose of fast inference,which avoids the timeconsuming stochastic sampling.The proposed algorithm contains two stages.The first stage involves the inference of the missing inputs by using a local linearization based variational inference,and based on the computed posterior distributions over the missing inputs the second stage sees a Gaussian approximation for probability over the nodes in future time slices.To verify the effectiveness of the proposed method,a synthetic dataset and a practical dataset of generation flow of blast furnace gas(BFG)are employed with different ratios of missing inputs.The experimental results indicate that the proposed method can provide reliable PIs for the generation flow of BFG and it exhibits shorter computing time than the stochastic based one.
基金supported in part by the National Science Foundation of China(NSFC)with grant no.62271514in part by the Science,Technology and Innovation Commission of Shenzhen Municipality with grant no.JCYJ20210324120002007 and ZDSYS20210623091807023in part by the State Key Laboratory of Public Big Data with grant no.PBD2023-01。
文摘Recently,deep learning-based semantic communication has garnered widespread attention,with numerous systems designed for transmitting diverse data sources,including text,image,and speech,etc.While efforts have been directed toward improving system performance,many studies have concentrated on enhancing the structure of the encoder and decoder.However,this often overlooks the resulting increase in model complexity,imposing additional storage and computational burdens on smart devices.Furthermore,existing work tends to prioritize explicit semantics,neglecting the potential of implicit semantics.This paper aims to easily and effectively enhance the receiver's decoding capability without modifying the encoder and decoder structures.We propose a novel semantic communication system with variational neural inference for text transmission.Specifically,we introduce a simple but effective variational neural inferer at the receiver to infer the latent semantic information within the received text.This information is then utilized to assist in the decoding process.The simulation results show a significant enhancement in system performance and improved robustness.
基金This work was supported in part by the National Natural Science Foundation of China under Grant Nos. 61572226 and 61876069, and the Key Scientific and Technological Research and Development Project of Jilin Province of China under Grant Nos. 20180201067GX and 20180201044GX.
文摘Topic modeling is a mainstream and effective technology to deal with text data, with wide applications in text analysis, natural language, personalized recommendation, computer vision, etc. Among all the known topic models, supervised Latent Dirichlet Allocation (sLDA) is acknowledged as a popular and competitive supervised topic model. How- ever, the gradual increase of the scale of datasets makes sLDA more and more inefficient and time-consuming, and limits its applications in a very narrow range. To solve it, a parallel online sLDA, named PO-sLDA (Parallel and Online sLDA), is proposed in this study. It uses the stochastic variational inference as the learning method to make the training procedure more rapid and efficient, and a parallel computing mechanism implemented via the MapReduce framework is proposed to promote the capacity of cloud computing and big data processing. The online training capacity supported by PO-sLDA expands the application scope of this approach, making it instrumental for real-life applications with high real-time demand. The validation using two datasets with different sizes shows that the proposed approach has the comparative accuracy as the sLDA and can efficiently accelerate the training procedure. Moreover, its good convergence and online training capacity make it lucrative for the large-scale text data analyzing and processing.
基金the National Natural Science Foundation of China under Grant Nos.61922076,61873252,61725304,and 61973324in part by Guangdong Basic and Applied Basic Research Foundation under Grant No.2021B1515020094in part by the Guangdong Provincial Key Laboratory of Computational Science under Grant No.2020B1212060032。
文摘Stochastic variational inference is an efficient Bayesian inference technology for massive datasets,which approximates posteriors by using noisy gradient estimates.Traditional stochastic variational inference can only be performed in a centralized manner,which limits its applications in a wide range of situations where data is possessed by multiple nodes.Therefore,this paper develops a novel trust-region based stochastic variational inference algorithm for a general class of conjugate-exponential models over distributed and asynchronous networks,where the global parameters are diffused over the network by using the Metropolis rule and the local parameters are updated by using the trust-region method.Besides,a simple rule is introduced to balance the transmission frequencies between neighboring nodes such that the proposed distributed algorithm can be performed in an asynchronous manner.The utility of the proposed algorithm is tested by fitting the Bernoulli model and the Gaussian model to different datasets on a synthetic network,and experimental results demonstrate its effectiveness and advantages over existing works.
基金This work was supported by the National Natural Science Foundation of China under Grant Nos. 61170092, 61133011 and 61103091.
文摘Stochastic variational inference (SVI) can learn topic models with very big corpora. It optimizes the variational objective by using the stochastic natural gradient algorithm with a decreasing learning rate. This rate is crucial for SVI; however, it is often tuned by hand in real applications. To address this, we develop a novel algorithm, which tunes the learning rate of each iteration adaptively. The proposed algorithm uses the Kullback-Leibler (KL) divergence to measure the similarity between the variational distribution with noisy update and that with batch update, and then optimizes the learning rates by minimizing the KL divergence. We apply our algorithm to two representative topic models: latent Dirichlet allocation and hierarchical Dirichlet process. Experimental results indicate that our algorithm performs better and converges faster than commonly used learning rates.
基金This work was supported in part by National Natural Science Foundation of China under Grants 62103167 and 61833007in part by the Natural Science Foundation of Jiangsu Province under Grant BK20210451.
文摘This paper is focused on the state estimation problem for nonlinear systems with unknown statistics of measurement noise.Based on the cubature Kalman filter,we propose a new nonlinear filtering algorithm that employs a skew t distribution to characterize the asymmetry of the measurement noise.The system states and the statistics of skew t noise distribution,including the shape matrix,the scale matrix,and the degree of freedom(DOF)are estimated jointly by employing variational Bayesian(VB)inference.The proposed method is validated in a target tracking example.Results of the simulation indicate that the proposed nonlinear filter can perform satisfactorily in the presence of unknown statistics of measurement noise and outperform than the existing state-of-the-art nonlinear filters.
基金supported by National Natural Science Foundation of China(No.61901415)。
文摘Efficient estimation of line spectral from quantized samples is of significant importance in information theory and signal processing,e.g.,channel estimation in energy efficient massive MIMO systems and direction of arrival estimation.The goal of this paper is to recover the line spectral as well as its corresponding parameters including the model order,frequencies and amplitudes from heavily quantized samples.To this end,we propose an efficient gridless Bayesian algorithm named VALSE-EP,which is a combination of the high resolution and low complexity gridless variational line spectral estimation(VALSE)and expectation propagation(EP).The basic idea of VALSE-EP is to iteratively approximate the challenging quantized model of line spectral estimation as a sequence of simple pseudo unquantized models,where VALSE is applied.Moreover,to obtain a benchmark of the performance of the proposed algorithm,the Cram′er Rao bound(CRB)is derived.Finally,numerical experiments on both synthetic and real data are performed,demonstrating the near CRB performance of the proposed VALSE-EP for line spectral estimation from quantized samples.
基金Supported by the National Natural Science Foundation of China(No.61976080)the Science and Technology Key Project of Science and TechnologyDepartment of Henan Province(No.212102310298)+1 种基金the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)the Innovation and Quality Improvement Project for Graduate Education of Henan University(No.SYL20010101)。
文摘A novel variational Bayesian inference based on adaptive cubature Kalman filter(VBACKF)algorithm is proposed for the problem of state estimation in a target tracking system with time-varying measurement noise and random measurement losses.Firstly,the Inverse-Wishart(IW)distribution is chosen to model the covariance matrix of time-varying measurement noise in the cubature Kalman filter framework.Secondly,the Bernoulli random variable is introduced as the judgement factor of the measurement losses,and the Beta distribution is selected as the conjugate prior distribution of measurement loss probability to ensure that the posterior distribution and prior distribution have the same function form.Finally,the joint posterior probability density function of the estimated variables is approximately decoupled by the variational Bayesian inference,and the fixed-point iteration approach is used to update the estimated variables.The simulation results show that the proposed VBACKF algorithm considers the comprehensive effects of system nonlinearity,time-varying measurement noise and unknown measurement loss probability,moreover,effectively improves the accuracy of target state estimation in complex scene.
基金Supported by the National Natural Science Foundation of China(No.61976080)the Science and Technology Key Project of Science and Technology Department of Henan Province(No.212102310298)the Innovation and Quality Improvement Project for Graduate Education of Henan University(No.SYL20010101)。
文摘Aiming at the problem of filtering precision degradation caused by the random outliers of process noise and measurement noise in multi-target tracking(MTT) system,a new Gaussian-Student’s t mixture distribution probability hypothesis density(PHD) robust filtering algorithm based on variational Bayesian inference(GST-vbPHD) is proposed.Firstly,since it can accurately describe the heavy-tailed characteristics of noise with outliers,Gaussian-Student’s t mixture distribution is employed to model process noise and measurement noise respectively.Then Bernoulli random variable is introduced to correct the likelihood distribution of the mixture probability,leading hierarchical Gaussian distribution constructed by the Gaussian-Student’s t mixture distribution suitable to model non-stationary noise.Finally,the approximate solutions including target weights,measurement noise covariance and state estimation error covariance are obtained according to variational Bayesian inference approach.The simulation results show that,in the heavy-tailed noise environment,the proposed algorithm leads to strong improvements over the traditional PHD filter and the Student’s t distribution PHD filter.
基金supported by the National Natural Science Foundation of China(No.62271274).
文摘In the tag recommendation task on academic platforms,existing methods disregard users’customized preferences in favor of extracting tags based just on the content of the articles.Besides,it uses co-occurrence techniques and tries to combine nodes’textual content for modelling.They still do not,however,directly simulate many interactions in network learning.In order to address these issues,we present a novel system that more thoroughly integrates user preferences and citation networks into article labelling recommendations.Specifically,we first employ path similarity to quantify the degree of similarity between user labelling preferences and articles in the citation network.Then,the Commuting Matrix for massive node pair paths is used to improve computational performance.Finally,the two commonalities mentioned above are combined with the interaction paper labels based on the additivity of Poisson distribution.In addition,we also consider solving the model’s parameters by applying variational inference.Experimental results demonstrate that our suggested framework agrees and significantly outperforms the state-of-the-art baseline on two real datasets by efficiently merging the three relational data.Based on the Area Under Curve(AUC)and Mean Average Precision(MAP)analysis,the performance of the suggested task is evaluated,and it is demonstrated to have a greater solving efficiency than current techniques.
基金funded by the Excellent Youth Science Fund of Heilongjiang Province(Grant No.YQ2022F001).
文摘The estimation of sparse underwater acoustic(UWA)channels can be regarded as an inference problem involving hidden variables within the Bayesian framework.While the classical sparse Bayesian learning(SBL),derived through the expectation maximization(EM)algorithm,has been widely employed for UWA channel estimation,it still differs from the real posterior expectation of channels.In this paper,we propose an approach that combines variational inference(VI)and Markov chain Monte Carlo(MCMC)methods to provide a more accurate posterior estimation.Specifically,the SBL is first re-derived with VI,allowing us to replace the posterior distribution of the hidden variables with a variational distribution.Then,we determine the full conditional probability distribution for each variable in the variational distribution and then iteratively perform random Gibbs sampling in MCMC to converge the Markov chain.The results of simulation and experiment indicate that our estimation method achieves lower mean square error and bit error rate compared to the classic SBL approach.Additionally,it demonstrates an acceptable convergence speed.
基金supported by the National Natural Science Foundation of China(Grant Nos.11771257,12271468)the Natural Science Foundation of Shandong Province(Grant Nos.ZR2021MA010,ZR2021ZD03).
文摘We present a new category of physics-informed neural networks called physics informed variational embedding generative adversarial network(PI-VEGAN),that effectively tackles the forward,inverse,and mixed problems of stochastic differential equations.In these scenarios,the governing equations are known,but only a limited number of sensor measurements of the system parameters are available.We integrate the governing physical laws into PI-VEGAN with automatic differentiation,while introducing a variational encoder for approximating the latent variables of the actual distribution of the measurements.These latent variables are integrated into the generator to facilitate accurate learning of the characteristics of the stochastic partial equations.Our model consists of three components,namely the encoder,generator,and discriminator,each of which is updated alternatively employing the stochastic gradient descent algorithm.We evaluate the effectiveness of PI-VEGAN in addressing forward,inverse,and mixed problems that require the concurrent calculation of system parameters and solutions.Numerical results demonstrate that the proposed method achieves satisfactory stability and accuracy in comparison with the previous physics-informed generative adversarial network(PI-WGAN).
基金National Key Research and Development Program of China,Grant/Award Numbers:2020YFE0204200,2020YFE0204000National Natural Science Foundation of China,Grant/Award Numbers:11971039,12371286Foundation of Shuanghu Laboratory,Grant/Award Number:SH‐2024JK01。
文摘Gene regulatory network(GRN)refers to the complex network formed by regulatory interactions between genes in living cells.In this paper,we consider inferring GRNs in single cells based on single-cell RNA sequencing(scRNA-seq)data.In scRNA-seq,single cells are often profiled from mixed populations,and their cell identities are unknown.A common practice for single-cell GRN analysis is to first cluster the cells and infer GRNs for every cluster separately.However,this two-step procedure ignores uncertainty in the clustering step and thus could lead to inaccurate estimation of the networks.Here,we consider the mixture Poisson lognormal model(MPLN)for network inference of count data from mixed populations.The precision matrices of the MPLN are the GRNs of different cell types.To avoid the intractable optimization of the MPLN’s log-likelihood,we develop an algorithm called variational mixture Poisson log-normal(VMPLN)to jointly estimate the GRNs of different cell types based on the variational inference method.We compare VMPLN with state-of-the-art single-cell regulatory network inference methods.Comprehensive simulation shows that VMPLN achieves better performance,especially in scenarios where different cell types have a high mixing degree.Benchmarking on real scRNA-seq data also demonstrates that VMPLN can provide more accurate network estimation in most cases.Finally,we apply VMPLN to a large scRNA-seq dataset from patients infected with severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)and find that VMPLN identifies critical differences in regulatory networks in immune cells between patients with moderate and severe symptoms.The source codes are available on the GitHub website(github.com/XiDsLab/SCVMPLN).
基金supported by the National Natural Science Foundation of China(61303232,61363085,61121061,60972077)the Hi-Tech Research and Development Program of China(2009AA01Z430)
文摘In the article, an improved variational inference (VI) framework for learning finite Beta-Liouville mixture models (BLM) is proposed for proportional data classification and clustering. Within the VI framework, some non-linear approximation techniques are adopted to obtain the approximated variational object functions. Analytical solutions are obtained for the variational posterior distributions. Compared to the expectation maximization (EM) algorithm which is commonly used for learning mixture models, underfitting and overfitting events can be prevented. Furthermore, parameters and complexity of the mixture model (model order) can be estimated simultaneously. Experiment shows that both synthetic and real-world data sets are to demonstrate the feasibility and advantages of the proposed method.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.62006131,62071260)the National Natural Science Foundation of Zhejiang Province(LQ21F020009,LQ18F020001).
文摘This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous feature metric is first constructed to measure the similarity between different types of features.Then,a feature metric-restricted hierarchical sample generation process is established,in which sample with heterogeneous features is clustered by generating it from a similarity constraint hidden space.When estimating the model parameters and posterior probability,the corresponding variational inference algorithm is derived and implemented.To verify our model capability,we demonstrate our model on the synthetic dataset and show the superiority of the proposed method on some real datasets.Our source code is released on the website:Github.com/yexlwh/Heterogeneousclustering.
基金supported by the Science and Technology Project of State Grid Corporation of China(No.5455HJ180018).
文摘With the rapid growth of power systems measurements in terms of size and complexity,discovering statistical patterns for a large variety of real-world applications such as renewable energy prediction,demand response,energy disaggregation,and state estimation is considered a crucial challenge.In recent years,deep learning has emerged as a novel class of machine learning algorithms that represents power systems data via a large hypothesis space that leads to the state-of-the-art performance compared to most recent data-driven algorithms.This study explores the theoretical advantages of deep representation learning in power systems research.We review deep learning methodologies presented and applied in a wide range of supervised,unsupervised,and semi-supervised applications as well as reinforcement learning tasks.We discuss various settings of problems solved by discriminative deep models including stacked autoencoders and convolutional neural networks as well as generative deep architectures such as deep belief networks and variational autoencoders.The theoretical and experimental analysis of deep neural networks in this study motivates longterm research on optimizing this cutting-edge class of models to achieve significant improvements in the future power systems research.
基金supported by the Major Research Plan of the National Natural Science Foundation of China (90920006)
文摘The paper proposes a novel probabilistic generative model for simultaneous image classification and annotation. The model considers the fact that the category information can provide valuable information for image annotation. Once the category of an image is ascertained, the scope of annotation words can be narrowed, and the probability of generating irrelevant annotation words can be reduced. To this end, the idea that annotates images according to class is introduced in the model. Using variational methods, the approximate inference and parameters estimation algorithms of the model are derived, and efficient approximations for classifying and annotating new images are also given. The power of our model is demonstrated on two real world datasets: a 1 600-images LabelMe dataset and a 1 791-images UIUC-Sport dataset. The experiment results show that the classification performance is on par with several state-of-the-art classification models, while the annotation performance is better than that of several state-of-the-art annotation models.
基金supported by NSFC under grant No.71371074the 111 Project under No.B14019.
文摘In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of topics need to be given in advance.Moreover,theGTMreleases the assumption of‘bag of words’and considers the graph structure of the text.The combination of HDP and GTM takes advantage of both which is named as HDP–GTM.The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed.We apply the proposed model in text categorisation,comparing to three related topic models,latent Dirichlet allocation(LDA),GTM and HDP.