The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
考虑到运动目标跟踪系统机动、隐身等人为对抗特征以及非视距、干扰、遮挡等环境因素,其系统建模、估计与辨识过程中越来越无法回避非线性、非高斯以及参数未知等复杂系统特征的影响.针对过程噪声先验信息不准确以及量测噪声非高斯环境...考虑到运动目标跟踪系统机动、隐身等人为对抗特征以及非视距、干扰、遮挡等环境因素,其系统建模、估计与辨识过程中越来越无法回避非线性、非高斯以及参数未知等复杂系统特征的影响.针对过程噪声先验信息不准确以及量测噪声非高斯环境下运动目标的非线性状态估计问题,提出一种基于自然梯度的噪声自适应变分贝叶斯(Variational Bayes,VB)滤波算法.首先,利用指数族分布具有统一表达形式的优势,构建参数化逆威沙特(Inverse-Wishart,IW)分布作为状态一步预测误差协方差的共轭先验分布,同时选取学生t分布重构因量测随机缺失导致的具有非高斯特点的似然函数;其次,在变分贝叶斯优化框架下采用平均场理论将状态变量联合后验分布近似分解为独立的变分分布,在此基础上,结合坐标上升方法更新各变量的变分分布参数;进而,结合Fisher信息矩阵推导置信下界最大化关于状态估计及其估计误差协方差的自然梯度,使非线性状态后验分布的近似分布沿梯度下降,以实现对状态后验概率密度函数(Probability density function,PDF)的“紧密”逼近.理论分析和仿真实验表明:相对传统的非线性滤波方法,本文算法对噪声不确定问题具有较好的自适应能力,并且能够获得较高的状态估计精度.展开更多
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
文摘考虑到运动目标跟踪系统机动、隐身等人为对抗特征以及非视距、干扰、遮挡等环境因素,其系统建模、估计与辨识过程中越来越无法回避非线性、非高斯以及参数未知等复杂系统特征的影响.针对过程噪声先验信息不准确以及量测噪声非高斯环境下运动目标的非线性状态估计问题,提出一种基于自然梯度的噪声自适应变分贝叶斯(Variational Bayes,VB)滤波算法.首先,利用指数族分布具有统一表达形式的优势,构建参数化逆威沙特(Inverse-Wishart,IW)分布作为状态一步预测误差协方差的共轭先验分布,同时选取学生t分布重构因量测随机缺失导致的具有非高斯特点的似然函数;其次,在变分贝叶斯优化框架下采用平均场理论将状态变量联合后验分布近似分解为独立的变分分布,在此基础上,结合坐标上升方法更新各变量的变分分布参数;进而,结合Fisher信息矩阵推导置信下界最大化关于状态估计及其估计误差协方差的自然梯度,使非线性状态后验分布的近似分布沿梯度下降,以实现对状态后验概率密度函数(Probability density function,PDF)的“紧密”逼近.理论分析和仿真实验表明:相对传统的非线性滤波方法,本文算法对噪声不确定问题具有较好的自适应能力,并且能够获得较高的状态估计精度.