期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
1
作者 Kun Jiang Wenzhang Liu +2 位作者 Yuanda Wang Lu Dong Changyin Sun 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第7期1591-1604,共14页
Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that ... Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that agents cannot directly observe. However, most of the existing latent variable discovery methods lack a clear representation of latent variables and an effective evaluation of the influence of latent variables on the agent. In this paper, we propose a new MARL algorithm based on the soft actor-critic method for complex continuous control tasks with confounders. It is called the multi-agent soft actor-critic with latent variable(MASAC-LV) algorithm, which uses variational inference theory to infer the compact latent variables representation space from a large amount of offline experience.Besides, we derive the counterfactual policy whose input has no latent variables and quantify the difference between the actual policy and the counterfactual policy via a distance function. This quantified difference is considered an intrinsic motivation that gives additional rewards based on how much the latent variable affects each agent. The proposed algorithm is evaluated on two collaboration tasks with confounders, and the experimental results demonstrate the effectiveness of MASAC-LV compared to other baseline algorithms. 展开更多
关键词 latent variable model maximum entropy multi-agent reinforcement learning(MARL) multi-agent system
下载PDF
Temporally Preserving Latent Variable Models:Offline and Online Training for Reconstruction and Interpretation of Fault Data for Gearbox Condition Monitoring
2
作者 Ryan Balshaw P.Stephan Heyns +1 位作者 Daniel N.Wilke Stephan Schmidt 《Journal of Dynamics, Monitoring and Diagnostics》 2024年第2期156-177,共22页
Latent variable models can effectively determine the condition of essential rotating machinery without needing labeled data.These models analyze vibration data via an unsupervised learning strategy.Temporal preservati... Latent variable models can effectively determine the condition of essential rotating machinery without needing labeled data.These models analyze vibration data via an unsupervised learning strategy.Temporal preservation is necessary to obtain an informative latent manifold for the fault diagnosis task.In a temporalpreserving context,two approaches exist to develop a condition-monitoring methodology:offline and online.For latent variable models,the available training modes are not different.While many traditional methods use offline training,online training can dynamically adjust the latent manifold,possibly leading to better fault signature extraction from the vibration data.This study explores online training using temporal-preserving latent variable models.Within online training,there are two main methods:one focuses on reconstructing data and the other on interpreting the data components.Both are considered to evaluate how they diagnose faults over time.Using two experimental datasets,the study confirms that models from both training modes can detect changes in machinery health and identify faults even under varying conditions.Importantly,the complementarity of offline and online models is emphasized,reassuring their versatility in fault diagnostics.Understanding the implications of the training approach and the available model formulations is crucial for further research in latent variable modelbased fault diagnostics. 展开更多
关键词 Condition monitoring unsupervised learning latent variable models temporal preservation training approaches
下载PDF
A linear varying coefficient ARCH-M model with a latent variable 被引量:4
3
作者 SONG ZeFang ZHANG XingFa +1 位作者 LI Yuan XIONG Qiang 《Science China Mathematics》 SCIE CSCD 2016年第9期1795-1814,共20页
Motivated by the psychological factor of time-varying risk-return relationship, this paper studies a linear varying coefficient ARCH-M model with a latent variable. Due to the unobservable property of the latent varia... Motivated by the psychological factor of time-varying risk-return relationship, this paper studies a linear varying coefficient ARCH-M model with a latent variable. Due to the unobservable property of the latent variable, a corrected likelihood method is employed for parametric estimation. Estimators are proved to be consistent and asymptotically normal under certain regularity conditions. A simple test statistic is also proposed for testing latent variable effect. Simulation results confirm that the proposed estimators and test perform well.The model is further applied to examine whether the risk-return relationship depends on investor's sentiment in American Market and some explainable results are obtained. 展开更多
关键词 ARCH-M model latent variable corrected likelihood risk-return relationship
原文传递
Bayesian Empirical Likelihood Estimation of Quantile Structural Equation Models 被引量:6
4
作者 ZHANG Yanqing TANG Niansheng 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2017年第1期122-138,共17页
Structural equation model(SEM) is a multivariate analysis tool that has been widely applied to many fields such as biomedical and social sciences. In the traditional SEM, it is often assumed that random errors and exp... Structural equation model(SEM) is a multivariate analysis tool that has been widely applied to many fields such as biomedical and social sciences. In the traditional SEM, it is often assumed that random errors and explanatory latent variables follow the normal distribution, and the effect of explanatory latent variables on outcomes can be formulated by a mean regression-type structural equation. But this SEM may be inappropriate in some cases where random errors or latent variables are highly nonnormal. The authors develop a new SEM, called as quantile SEM(QSEM), by allowing for a quantile regression-type structural equation and without distribution assumption of random errors and latent variables. A Bayesian empirical likelihood(BEL) method is developed to simultaneously estimate parameters and latent variables based on the estimating equation method. A hybrid algorithm combining the Gibbs sampler and Metropolis-Hastings algorithm is presented to sample observations required for statistical inference. Latent variables are imputed by the estimated density function and the linear interpolation method. A simulation study and an example are presented to investigate the performance of the proposed methodologies. 展开更多
关键词 Bayesian empirical likelihood estimating equations latent variable models MCMC algo-rithm quantile regression structural equation models.
原文传递
Dirichlet process and its developments: a survey
5
作者 Yemao XIA Yingan LIU Jianwei GOU 《Frontiers of Mathematics in China》 SCIE CSCD 2022年第1期79-115,共37页
The core of the nonparametric/semiparametric Bayesian analysis is to relax the particular parametric assumptions on the distributions of interest to be unknown and random,and assign them a prior.Selecting a suitable p... The core of the nonparametric/semiparametric Bayesian analysis is to relax the particular parametric assumptions on the distributions of interest to be unknown and random,and assign them a prior.Selecting a suitable prior therefore is especially critical in the nonparametric Bayesian fitting.As the distribution of distribution,Dirichlet process(DP)is the most appreciated nonparametric prior due to its nice theoretical proprieties,modeling flexibility and computational feasibility.In this paper,we review and summarize some developments of DP during the past decades.Our focus is mainly concentrated upon its theoretical properties,various extensions,statistical modeling and applications to the latent variable models. 展开更多
关键词 Nonparametric Bayes Dirichlet process Polya urn prediction Sethuraman representation stick-breaking procedure Chinese restaurant rule mixture of Dirichlet process dependence Dirichlet process Markov Chains Monte Carlo blocked Gibbs sampler latent variable models
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部