In order to meet the real-time performance requirements,intelligent decisions in Internet of things applications must take place right here right now at the network edge.Pushing the artificial intelligence frontier to...In order to meet the real-time performance requirements,intelligent decisions in Internet of things applications must take place right here right now at the network edge.Pushing the artificial intelligence frontier to achieve edge intelligence is nontrivial due to the constrained computing resources and limited training data at the network edge.To tackle these challenges,we develop a distributionally robust optimization(DRO)-based edge learning algorithm,where the uncertainty model is constructed to foster the synergy of cloud knowledge and local training.Specifically,the cloud transferred knowledge is in the form of a Dirichlet process prior distribution for the edge model parameters,and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples.The edge learning DRO problem,subject to these two distributional uncertainty constraints,is recast as a single-layer optimization problem using a duality approach.We then use an Expectation-Maximization algorithm-inspired method to derive a convex relaxation,based on which we devise algorithms to learn the edge model.Furthermore,we illustrate that the meta-learning fast adaptation procedure is equivalent to our proposed Dirichlet process prior-based approach.Finally,extensive experiments are implemented to showcase the performance gain over standard approaches using edge data only.展开更多
In the Bayesian mixture modeling framework it is possible to infer the necessary number of components to model the data and therefore it is unnecessary to explicitly restrict the number of components. Nonparametric mi...In the Bayesian mixture modeling framework it is possible to infer the necessary number of components to model the data and therefore it is unnecessary to explicitly restrict the number of components. Nonparametric mixture models sidestep the problem of finding the "correct" number of mixture components by assuming infinitely many components. In this paper Dirichlet process mixture (DPM) models are cast as infinite mixture models and inference using Markov chain Monte Carlo is described. The specification of the priors on the model parameters is often guided by mathematical and practical convenience. The primary goal of this paper is to compare the choice of conjugate and non-conjugate base distributions on a particular class of DPM models which is widely used in applications, the Dirichlet process Gaussian mixture model (DPGMM). We compare computational efficiency and modeling performance of DPGMM defined using a conjugate and a conditionally conjugate base distribution. We show that better density models can result from using a wider class of priors with no or only a modest increase in computational effort.展开更多
The core of the nonparametric/semiparametric Bayesian analysis is to relax the particular parametric assumptions on the distributions of interest to be unknown and random,and assign them a prior.Selecting a suitable p...The core of the nonparametric/semiparametric Bayesian analysis is to relax the particular parametric assumptions on the distributions of interest to be unknown and random,and assign them a prior.Selecting a suitable prior therefore is especially critical in the nonparametric Bayesian fitting.As the distribution of distribution,Dirichlet process(DP)is the most appreciated nonparametric prior due to its nice theoretical proprieties,modeling flexibility and computational feasibility.In this paper,we review and summarize some developments of DP during the past decades.Our focus is mainly concentrated upon its theoretical properties,various extensions,statistical modeling and applications to the latent variable models.展开更多
Interest in automated data classification and identification systems has increased over the past years in conjunction with the high demand for artificial intelligence and security applications.In particular,recognizin...Interest in automated data classification and identification systems has increased over the past years in conjunction with the high demand for artificial intelligence and security applications.In particular,recognizing human activities with accurate results have become a topic of high interest.Although the current tools have reached remarkable successes,it is still a challenging problem due to various uncontrolled environments and conditions.In this paper two statistical frameworks based on nonparametric hierarchical Bayesian models and Gamma distribution are proposed to solve some realworld applications.In particular,two nonparametric hierarchical Bayesian models based on Dirichlet process and Pitman-Yor process are developed.These models are then applied to address the problem of modelling grouped data where observations are organized into groups and these groups are statistically linked by sharing mixture components.The choice of the Gamma mixtures is motivated by its flexibility for modelling heavy-tailed distributions.In addition,deploying the Dirichlet process prior is justified by its advantage of automatically finding the right number of components and providing nice properties.Moreover,a learning step via variational Bayesian setting is presented in a flexible way.The priors over the parameters are selected appropriately and the posteriors are approximated effectively in a closed form.Experimental results based on a real-life applications that concerns texture classification and human actions recognition show the capabilities and effectiveness of the proposed framework.展开更多
In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of ...In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of topics need to be given in advance.Moreover,theGTMreleases the assumption of‘bag of words’and considers the graph structure of the text.The combination of HDP and GTM takes advantage of both which is named as HDP–GTM.The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed.We apply the proposed model in text categorisation,comparing to three related topic models,latent Dirichlet allocation(LDA),GTM and HDP.展开更多
In the present paper the transformation of symmetric Markov processes by symmetric martingale multiplicative functionals is studied and the corresponding Dirichlet form is formulated.
Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and the cor...Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and the corresponding classifier.展开更多
Given a Markov process satisfying certain general type conditions,whose paths are notassumed to be continuous. Let D by an open subset of the state space E. Any bounded function defined on thecomplement of D extends t...Given a Markov process satisfying certain general type conditions,whose paths are notassumed to be continuous. Let D by an open subset of the state space E. Any bounded function defined on thecomplement of D extends to be a function on E (?)uch that it is harmonic in D and satisfies the Dirichletboundary condition at any regular boundary point of D. The relation between harmonic functions and theebaracteristic operator of the given process is discussed.展开更多
The Gamma-Dirichlet algebra corresponds to the decomposition of the gamma process into the independent product of a gamma random variable and a Dirichlet process. This structure allows us to study the properties of th...The Gamma-Dirichlet algebra corresponds to the decomposition of the gamma process into the independent product of a gamma random variable and a Dirichlet process. This structure allows us to study the properties of the Dirichlet process through the gamma process and vice versa. In this article, we begin with a brief survey of several existing results concerning this structure. New results are then obtained for the large deviations of the jump sizes of the gamma process and the quasi-invariance of the two-parameter Poisson-Dirichlet distribution. We finish the paper with the derivation of the transition function of the Fleming-Viot process with parent independent mutation from the transition function of the measure-valued branching diffusion with immigration by exploring the Gamma-Dirichlet algebra embedded in these processes. This last result is motivated by an open R. C. Gritfiths. problem proposed by S. N. Ethier and展开更多
This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous fe...This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous feature metric is first constructed to measure the similarity between different types of features.Then,a feature metric-restricted hierarchical sample generation process is established,in which sample with heterogeneous features is clustered by generating it from a similarity constraint hidden space.When estimating the model parameters and posterior probability,the corresponding variational inference algorithm is derived and implemented.To verify our model capability,we demonstrate our model on the synthetic dataset and show the superiority of the proposed method on some real datasets.Our source code is released on the website:Github.com/yexlwh/Heterogeneousclustering.展开更多
Travel time reliability(TTR) modeling has gain attention among researchers’ due to its ability to represent road user satisfaction as well as providing a predictability of a trip travel time.Despite this significant ...Travel time reliability(TTR) modeling has gain attention among researchers’ due to its ability to represent road user satisfaction as well as providing a predictability of a trip travel time.Despite this significant effort,its impact on the severity of a crash is not well explored.This study analyzes the effect of TTR and other variables on the probability of the crash severity occurring on arterial roads.To address the unobserved heterogeneity problem,two random-effect regressions were applied;the Dirichlet random-effect(DRE)and the traditional random-effect(TRE) logistic regression.The difference between the two models is that the random-effect in the DRE is non-parametrically specified while in the TRE model is parametrically specified.The Markov Chain Monte Carlo simulations were adopted to infer the parameters’ posterior distributions of the two developed models.Using four-year police-reported crash data and travel speeds from Northeast Florida,the analysis of goodness-of-fit found the DRE model to best fit the data.Hence,it was used in studying the influence of TTR and other variables on crash severity.The DRE model findings suggest that TTR is statistically significant,at 95 percent credible intervals,influencing the severity level of a crash.A unit increases in TTR reduces the likelihood of a severe crash occurrence by 25 percent.Moreover,among the significant variables,alcohol/drug impairment was found to have the highest impact in influencing the occurrence of severe crashes.Other significant factors included traffic volume,weekends,speed,work-zone,land use,visibility,seatbelt usage,segment length,undivided/divided highway,and age.展开更多
In this paper we study the existence, pathwise uniqueness and homeomorphism flow of strong solutions to a class of one dimensional SDEs driven by infinitely many Brownian motions, and with Yamada- Watanabe diffusion c...In this paper we study the existence, pathwise uniqueness and homeomorphism flow of strong solutions to a class of one dimensional SDEs driven by infinitely many Brownian motions, and with Yamada- Watanabe diffusion coefficients and distributional drift coefficients.展开更多
In this paper, we propose a Bayesian semiparametric mean-covariance regression model with known covariance structures. A mixture model is used to describe the potential non-normal distribution of the regression errors...In this paper, we propose a Bayesian semiparametric mean-covariance regression model with known covariance structures. A mixture model is used to describe the potential non-normal distribution of the regression errors. Moreover, an empirical likelihood adjusted mixture of Dirichlet process model is constructed to produce distributions with given mean and variance constraints. We illustrate through simulation studies that the proposed method provides better estimations in some non-normal cases. We also demonstrate the implementation of our method by analyzing the data set from a sleep deprivation study.展开更多
基金This work was supported in part by NSF under Grant CPS-1739344,ARO under grant W911NF-16-1-0448the DTRA under Grant HDTRA1-13-1-0029Part of this work will appear in the Proceedings of 40th IEEE International Conference on Distributed Computing Systems(ICDCS),Singapore,July 8-10,2020。
文摘In order to meet the real-time performance requirements,intelligent decisions in Internet of things applications must take place right here right now at the network edge.Pushing the artificial intelligence frontier to achieve edge intelligence is nontrivial due to the constrained computing resources and limited training data at the network edge.To tackle these challenges,we develop a distributionally robust optimization(DRO)-based edge learning algorithm,where the uncertainty model is constructed to foster the synergy of cloud knowledge and local training.Specifically,the cloud transferred knowledge is in the form of a Dirichlet process prior distribution for the edge model parameters,and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples.The edge learning DRO problem,subject to these two distributional uncertainty constraints,is recast as a single-layer optimization problem using a duality approach.We then use an Expectation-Maximization algorithm-inspired method to derive a convex relaxation,based on which we devise algorithms to learn the edge model.Furthermore,we illustrate that the meta-learning fast adaptation procedure is equivalent to our proposed Dirichlet process prior-based approach.Finally,extensive experiments are implemented to showcase the performance gain over standard approaches using edge data only.
基金supported by Gatsby Charitable Foundation and PASCAL2
文摘In the Bayesian mixture modeling framework it is possible to infer the necessary number of components to model the data and therefore it is unnecessary to explicitly restrict the number of components. Nonparametric mixture models sidestep the problem of finding the "correct" number of mixture components by assuming infinitely many components. In this paper Dirichlet process mixture (DPM) models are cast as infinite mixture models and inference using Markov chain Monte Carlo is described. The specification of the priors on the model parameters is often guided by mathematical and practical convenience. The primary goal of this paper is to compare the choice of conjugate and non-conjugate base distributions on a particular class of DPM models which is widely used in applications, the Dirichlet process Gaussian mixture model (DPGMM). We compare computational efficiency and modeling performance of DPGMM defined using a conjugate and a conditionally conjugate base distribution. We show that better density models can result from using a wider class of priors with no or only a modest increase in computational effort.
基金supported in part by the National Natural Science Foundation of China(Grant No.11471161)the Technological Innovation Item in Jiangsu Province(No.BK2008156).
文摘The core of the nonparametric/semiparametric Bayesian analysis is to relax the particular parametric assumptions on the distributions of interest to be unknown and random,and assign them a prior.Selecting a suitable prior therefore is especially critical in the nonparametric Bayesian fitting.As the distribution of distribution,Dirichlet process(DP)is the most appreciated nonparametric prior due to its nice theoretical proprieties,modeling flexibility and computational feasibility.In this paper,we review and summarize some developments of DP during the past decades.Our focus is mainly concentrated upon its theoretical properties,various extensions,statistical modeling and applications to the latent variable models.
基金The authors would like to thank Taif University Researchers Supporting Project number(TURSP-2020/26),Taif University,Taif,Saudi ArabiaThey would like also to thank Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R40),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘Interest in automated data classification and identification systems has increased over the past years in conjunction with the high demand for artificial intelligence and security applications.In particular,recognizing human activities with accurate results have become a topic of high interest.Although the current tools have reached remarkable successes,it is still a challenging problem due to various uncontrolled environments and conditions.In this paper two statistical frameworks based on nonparametric hierarchical Bayesian models and Gamma distribution are proposed to solve some realworld applications.In particular,two nonparametric hierarchical Bayesian models based on Dirichlet process and Pitman-Yor process are developed.These models are then applied to address the problem of modelling grouped data where observations are organized into groups and these groups are statistically linked by sharing mixture components.The choice of the Gamma mixtures is motivated by its flexibility for modelling heavy-tailed distributions.In addition,deploying the Dirichlet process prior is justified by its advantage of automatically finding the right number of components and providing nice properties.Moreover,a learning step via variational Bayesian setting is presented in a flexible way.The priors over the parameters are selected appropriately and the posteriors are approximated effectively in a closed form.Experimental results based on a real-life applications that concerns texture classification and human actions recognition show the capabilities and effectiveness of the proposed framework.
基金supported by NSFC under grant No.71371074the 111 Project under No.B14019.
文摘In this paper,a nonparametric Bayesian graph topic model(GTM)based on hierarchical Dirichlet process(HDP)is proposed.The HDP makes the number of topics selected flexibly,which breaks the limitation that the number of topics need to be given in advance.Moreover,theGTMreleases the assumption of‘bag of words’and considers the graph structure of the text.The combination of HDP and GTM takes advantage of both which is named as HDP–GTM.The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed.We apply the proposed model in text categorisation,comparing to three related topic models,latent Dirichlet allocation(LDA),GTM and HDP.
基金in partby the National Natural Science Founda-tion of China(1 950 1 0 36)
文摘In the present paper the transformation of symmetric Markov processes by symmetric martingale multiplicative functionals is studied and the corresponding Dirichlet form is formulated.
文摘Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and the corresponding classifier.
文摘Given a Markov process satisfying certain general type conditions,whose paths are notassumed to be continuous. Let D by an open subset of the state space E. Any bounded function defined on thecomplement of D extends to be a function on E (?)uch that it is harmonic in D and satisfies the Dirichletboundary condition at any regular boundary point of D. The relation between harmonic functions and theebaracteristic operator of the given process is discussed.
文摘The Gamma-Dirichlet algebra corresponds to the decomposition of the gamma process into the independent product of a gamma random variable and a Dirichlet process. This structure allows us to study the properties of the Dirichlet process through the gamma process and vice versa. In this article, we begin with a brief survey of several existing results concerning this structure. New results are then obtained for the large deviations of the jump sizes of the gamma process and the quasi-invariance of the two-parameter Poisson-Dirichlet distribution. We finish the paper with the derivation of the transition function of the Fleming-Viot process with parent independent mutation from the transition function of the measure-valued branching diffusion with immigration by exploring the Gamma-Dirichlet algebra embedded in these processes. This last result is motivated by an open R. C. Gritfiths. problem proposed by S. N. Ethier and
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.62006131,62071260)the National Natural Science Foundation of Zhejiang Province(LQ21F020009,LQ18F020001).
文摘This paper aims to study the deep clustering problem with heterogeneous features and unknown cluster number.To address this issue,a novel deep Bayesian clustering framework is proposed.In particular,a heterogeneous feature metric is first constructed to measure the similarity between different types of features.Then,a feature metric-restricted hierarchical sample generation process is established,in which sample with heterogeneous features is clustered by generating it from a similarity constraint hidden space.When estimating the model parameters and posterior probability,the corresponding variational inference algorithm is derived and implemented.To verify our model capability,we demonstrate our model on the synthetic dataset and show the superiority of the proposed method on some real datasets.Our source code is released on the website:Github.com/yexlwh/Heterogeneousclustering.
基金the Center for Accessibility and Safety for an Aging Population at Florida State UniversityFlorida A&M UniversityUniversity of North Florida for funding support in research
文摘Travel time reliability(TTR) modeling has gain attention among researchers’ due to its ability to represent road user satisfaction as well as providing a predictability of a trip travel time.Despite this significant effort,its impact on the severity of a crash is not well explored.This study analyzes the effect of TTR and other variables on the probability of the crash severity occurring on arterial roads.To address the unobserved heterogeneity problem,two random-effect regressions were applied;the Dirichlet random-effect(DRE)and the traditional random-effect(TRE) logistic regression.The difference between the two models is that the random-effect in the DRE is non-parametrically specified while in the TRE model is parametrically specified.The Markov Chain Monte Carlo simulations were adopted to infer the parameters’ posterior distributions of the two developed models.Using four-year police-reported crash data and travel speeds from Northeast Florida,the analysis of goodness-of-fit found the DRE model to best fit the data.Hence,it was used in studying the influence of TTR and other variables on crash severity.The DRE model findings suggest that TTR is statistically significant,at 95 percent credible intervals,influencing the severity level of a crash.A unit increases in TTR reduces the likelihood of a severe crash occurrence by 25 percent.Moreover,among the significant variables,alcohol/drug impairment was found to have the highest impact in influencing the occurrence of severe crashes.Other significant factors included traffic volume,weekends,speed,work-zone,land use,visibility,seatbelt usage,segment length,undivided/divided highway,and age.
文摘In this paper we study the existence, pathwise uniqueness and homeomorphism flow of strong solutions to a class of one dimensional SDEs driven by infinitely many Brownian motions, and with Yamada- Watanabe diffusion coefficients and distributional drift coefficients.
基金Supported by National Natural Science Foundation of China(Grant Nos.11171007/A011103,11171230 and11471024)
文摘In this paper, we propose a Bayesian semiparametric mean-covariance regression model with known covariance structures. A mixture model is used to describe the potential non-normal distribution of the regression errors. Moreover, an empirical likelihood adjusted mixture of Dirichlet process model is constructed to produce distributions with given mean and variance constraints. We illustrate through simulation studies that the proposed method provides better estimations in some non-normal cases. We also demonstrate the implementation of our method by analyzing the data set from a sleep deprivation study.