In Unsupervised Domain Adaptation(UDA)for person re-identification(re-ID),the primary challenge is reducing the distribution discrepancy between the source and target domains.This can be achieved by implicitly or expl...In Unsupervised Domain Adaptation(UDA)for person re-identification(re-ID),the primary challenge is reducing the distribution discrepancy between the source and target domains.This can be achieved by implicitly or explicitly constructing an appropriate intermediate domain to enhance recognition capability on the target domain.Implicit construction is difficult due to the absence of intermediate state supervision,making smooth knowledge transfer from the source to the target domain a challenge.To explicitly construct the most suitable intermediate domain for the model to gradually adapt to the feature distribution changes from the source to the target domain,we propose the Minimal Transfer Cost Framework(MTCF).MTCF considers all scenarios of the intermediate domain during the transfer process,ensuring smoother and more efficient domain alignment.Our framework mainly includes threemodules:Intermediate Domain Generator(IDG),Cross-domain Feature Constraint Module(CFCM),and Residual Channel Space Module(RCSM).First,the IDG Module is introduced to generate all possible intermediate domains,ensuring a smooth transition of knowledge fromthe source to the target domain.To reduce the cross-domain feature distribution discrepancy,we propose the CFCM Module,which quantifies the difficulty of knowledge transfer and ensures the diversity of intermediate domain features and their semantic relevance,achieving alignment between the source and target domains by incorporating mutual information and maximum mean discrepancy.We also design the RCSM,which utilizes attention mechanism to enhance the model’s focus on personnel features in low-resolution images,improving the accuracy and efficiency of person re-ID.Our proposed method outperforms existing technologies in all common UDA re-ID tasks and improves the Mean Average Precision(mAP)by 2.3%in the Market to Duke task compared to the state-of-the-art(SOTA)methods.展开更多
Domain shift is when the data used in training does not match the ones it will be applied to later on under similar conditions.Domain shift will reduce accuracy in results.To prevent this,domain adaptation is done,whi...Domain shift is when the data used in training does not match the ones it will be applied to later on under similar conditions.Domain shift will reduce accuracy in results.To prevent this,domain adaptation is done,which adapts the pre-trained model to the target domain.In real scenarios,the availability of labels for target data is rare thus resulting in unsupervised domain adaptation.Herein,we propose an innovative approach where source-free domain adaptation models and Generative Adversarial Networks(GANs)are integrated to improve the performance of computer vision or robotic vision-based systems in our study.Cosine Generative Adversarial Network(CosGAN)is developed as a GAN that uses cosine embedding loss to handle issues associated with unsupervised source-relax domain adaptations.For less complex architecture,the CosGAN training process has two steps that produce results almost comparable to other state-of-the-art techniques.The efficiency of CosGAN was compared by conducting experiments using benchmarked datasets.The approach was evaluated on different datasets and experimental results show superiority over existing state-of-the-art methods in terms of accuracy as well as generalization ability.This technique has numerous applications including wheeled robots,autonomous vehicles,warehouse automation,and all image-processing-based automation tasks so it can reshape the field of robotic vision with its ability to make robots adapt to new tasks and environments efficiently without requiring additional labeled data.It lays the groundwork for future expansions in robotic vision and applications.Although GAN provides a variety of outstanding features,it also increases the risk of instability and over-fitting of the training data thus making the data difficult to converge.展开更多
Labeled data scarcity of an interested domain is often a serious problem in machine learning.Leveraging the labeled data from other semantic-related yet co-variate shifted source domain to facilitate the interested do...Labeled data scarcity of an interested domain is often a serious problem in machine learning.Leveraging the labeled data from other semantic-related yet co-variate shifted source domain to facilitate the interested domain is a consensus.In order to solve the domain shift between domains and reduce the learning ambiguity,unsupervised domain adaptation(UDA)greatly promotes the transferability of model parameters.However,the dilemma of over-fitting(negative transfer)and under-fitting(under-adaptation)is always an overlooked challenge and potential risk.In this paper,we rethink the shallow learning paradigm and this intractable over/under-fitting problem,and propose a safer UDA model,coined as Bilateral Co-Transfer(BCT),which is essentially beyond previous well-known unilateral transfer.With bilateral co-transfer between domains,the risk of over/under-fitting is therefore largely reduced.Technically,the proposed BCT is a symmetrical structure,with joint distribution discrepancy(JDD)modeled for domain alignment and category discrimination.Specifically,a symmetrical bilateral transfer(SBT)loss between source and target domains is proposed under the philosophy of mutual checks and balances.First,each target sample is represented by source samples with low-rankness constraint in a common subspace,such that the most informative and transferable source data can be used to alleviate negative transfer.Second,each source sample is symmetrically and sparsely represented by target samples,such that the most reliable target samples can be exploited to tackle underadaptation.Experiments on various benchmarks show that our BCT outperforms many previous outstanding work.展开更多
:Cross-project defect prediction(CPDP)aims to predict the defects on target project by using a prediction model built on source projects.The main problem in CPDP is the huge distribution gap between the source project...:Cross-project defect prediction(CPDP)aims to predict the defects on target project by using a prediction model built on source projects.The main problem in CPDP is the huge distribution gap between the source project and the target project,which prevents the prediction model from performing well.Most existing methods overlook the class discrimination of the learned features.Seeking an effective transferable model from the source project to the target project for CPDP is challenging.In this paper,we propose an unsupervised domain adaptation based on the discriminative subspace learning(DSL)approach for CPDP.DSL treats the data from two projects as being from two domains and maps the data into a common feature space.It employs crossdomain alignment with discriminative information from different projects to reduce the distribution difference of the data between different projects and incorporates the class discriminative information.Specifically,DSL first utilizes subspace learning based domain adaptation to reduce the distribution gap of data between different projects.Then,it makes full use of the class label information of the source project and transfers the discrimination ability of the source project to the target project in the common space.Comprehensive experiments on five projects verify that DSL can build an effective prediction model and improve the performance over the related competing methods by at least 7.10%and 11.08%in terms of G-measure and AUC.展开更多
为了缓解基于伪标签的无监督域自适应行人重识别(UDA person ReID)方法中噪声标签带来的负面影响,提出了一种基于可靠性集成的无监督域自适应行人重识别(UDA-RI)方法。该方法包含渐进式伪标签提炼策略和基于可靠性集成策略两个部分。渐...为了缓解基于伪标签的无监督域自适应行人重识别(UDA person ReID)方法中噪声标签带来的负面影响,提出了一种基于可靠性集成的无监督域自适应行人重识别(UDA-RI)方法。该方法包含渐进式伪标签提炼策略和基于可靠性集成策略两个部分。渐进式伪标签提炼策略通过建立一个不确定性的定量标准来衡量伪标签的可靠性,并采用渐进式采样使得模型得到更加稳定的训练。基于可靠性集成策略考虑了来自不同适应时刻的知识,将来自不同迭代的模型按照可靠性高低分配的权重进行了集成,并将自集成后的两种不同架构的模型再进行集成作为最终推理模型。实验表明,与目前先进的无监督域自适应行人重识别方法相比,UDA-RI方法在Market1501、DukeMTMC-ReID和MSMT17数据集上都取得了优越的性能。展开更多
不同成像模式设备采集的医学图像存在不同程度的分布差异,无监督域自适应方法为了将源域训练的模型泛化到无标注的目标域,通常是将差异分布最小化,使用源域和目标域的共有特征进行结果预测,但会忽略目标域的私有特征.为了解决该问题,文...不同成像模式设备采集的医学图像存在不同程度的分布差异,无监督域自适应方法为了将源域训练的模型泛化到无标注的目标域,通常是将差异分布最小化,使用源域和目标域的共有特征进行结果预测,但会忽略目标域的私有特征.为了解决该问题,文中提出基于目标域增强表示的医学图像无监督跨域分割方法(Enhanced Target Domain Representation Based Unsupervised Cross-Domain Medical Image Segmentation,TreUCMIS).首先,通过共有特征学习获取源域和目标域的共有特征,通过图像重构训练目标域特征编码器,提取目标域完整特征.然后,通过目标域的无监督自学习方式,加强深层特征和浅层特征的共有性.最后,对齐使用共有特征和完整特征得到的预测结果,利用目标域的完整特征分割目标,提高模型在目标域的泛化性.在两个具有CT和MRI双向域自适应任务的医学图像分割数据集(腹部、心脏)上的实验表明TreUCMIS的有效性与优越性.展开更多
文摘In Unsupervised Domain Adaptation(UDA)for person re-identification(re-ID),the primary challenge is reducing the distribution discrepancy between the source and target domains.This can be achieved by implicitly or explicitly constructing an appropriate intermediate domain to enhance recognition capability on the target domain.Implicit construction is difficult due to the absence of intermediate state supervision,making smooth knowledge transfer from the source to the target domain a challenge.To explicitly construct the most suitable intermediate domain for the model to gradually adapt to the feature distribution changes from the source to the target domain,we propose the Minimal Transfer Cost Framework(MTCF).MTCF considers all scenarios of the intermediate domain during the transfer process,ensuring smoother and more efficient domain alignment.Our framework mainly includes threemodules:Intermediate Domain Generator(IDG),Cross-domain Feature Constraint Module(CFCM),and Residual Channel Space Module(RCSM).First,the IDG Module is introduced to generate all possible intermediate domains,ensuring a smooth transition of knowledge fromthe source to the target domain.To reduce the cross-domain feature distribution discrepancy,we propose the CFCM Module,which quantifies the difficulty of knowledge transfer and ensures the diversity of intermediate domain features and their semantic relevance,achieving alignment between the source and target domains by incorporating mutual information and maximum mean discrepancy.We also design the RCSM,which utilizes attention mechanism to enhance the model’s focus on personnel features in low-resolution images,improving the accuracy and efficiency of person re-ID.Our proposed method outperforms existing technologies in all common UDA re-ID tasks and improves the Mean Average Precision(mAP)by 2.3%in the Market to Duke task compared to the state-of-the-art(SOTA)methods.
文摘Domain shift is when the data used in training does not match the ones it will be applied to later on under similar conditions.Domain shift will reduce accuracy in results.To prevent this,domain adaptation is done,which adapts the pre-trained model to the target domain.In real scenarios,the availability of labels for target data is rare thus resulting in unsupervised domain adaptation.Herein,we propose an innovative approach where source-free domain adaptation models and Generative Adversarial Networks(GANs)are integrated to improve the performance of computer vision or robotic vision-based systems in our study.Cosine Generative Adversarial Network(CosGAN)is developed as a GAN that uses cosine embedding loss to handle issues associated with unsupervised source-relax domain adaptations.For less complex architecture,the CosGAN training process has two steps that produce results almost comparable to other state-of-the-art techniques.The efficiency of CosGAN was compared by conducting experiments using benchmarked datasets.The approach was evaluated on different datasets and experimental results show superiority over existing state-of-the-art methods in terms of accuracy as well as generalization ability.This technique has numerous applications including wheeled robots,autonomous vehicles,warehouse automation,and all image-processing-based automation tasks so it can reshape the field of robotic vision with its ability to make robots adapt to new tasks and environments efficiently without requiring additional labeled data.It lays the groundwork for future expansions in robotic vision and applications.Although GAN provides a variety of outstanding features,it also increases the risk of instability and over-fitting of the training data thus making the data difficult to converge.
基金supported by National Key R&D Program of China(2021YFB3100800)National Natural Science Foundation of China(62271090)+1 种基金Chongqing Natural Science Fund(cstc2021jcyjjqX0023)supported by Huawei computational power of Chongqing Artificial Intelligence Innovation Center.
文摘Labeled data scarcity of an interested domain is often a serious problem in machine learning.Leveraging the labeled data from other semantic-related yet co-variate shifted source domain to facilitate the interested domain is a consensus.In order to solve the domain shift between domains and reduce the learning ambiguity,unsupervised domain adaptation(UDA)greatly promotes the transferability of model parameters.However,the dilemma of over-fitting(negative transfer)and under-fitting(under-adaptation)is always an overlooked challenge and potential risk.In this paper,we rethink the shallow learning paradigm and this intractable over/under-fitting problem,and propose a safer UDA model,coined as Bilateral Co-Transfer(BCT),which is essentially beyond previous well-known unilateral transfer.With bilateral co-transfer between domains,the risk of over/under-fitting is therefore largely reduced.Technically,the proposed BCT is a symmetrical structure,with joint distribution discrepancy(JDD)modeled for domain alignment and category discrimination.Specifically,a symmetrical bilateral transfer(SBT)loss between source and target domains is proposed under the philosophy of mutual checks and balances.First,each target sample is represented by source samples with low-rankness constraint in a common subspace,such that the most informative and transferable source data can be used to alleviate negative transfer.Second,each source sample is symmetrically and sparsely represented by target samples,such that the most reliable target samples can be exploited to tackle underadaptation.Experiments on various benchmarks show that our BCT outperforms many previous outstanding work.
基金This paper was supported by the National Natural Science Foundation of China(61772286,61802208,and 61876089)China Postdoctoral Science Foundation Grant 2019M651923Natural Science Foundation of Jiangsu Province of China(BK0191381).
文摘:Cross-project defect prediction(CPDP)aims to predict the defects on target project by using a prediction model built on source projects.The main problem in CPDP is the huge distribution gap between the source project and the target project,which prevents the prediction model from performing well.Most existing methods overlook the class discrimination of the learned features.Seeking an effective transferable model from the source project to the target project for CPDP is challenging.In this paper,we propose an unsupervised domain adaptation based on the discriminative subspace learning(DSL)approach for CPDP.DSL treats the data from two projects as being from two domains and maps the data into a common feature space.It employs crossdomain alignment with discriminative information from different projects to reduce the distribution difference of the data between different projects and incorporates the class discriminative information.Specifically,DSL first utilizes subspace learning based domain adaptation to reduce the distribution gap of data between different projects.Then,it makes full use of the class label information of the source project and transfers the discrimination ability of the source project to the target project in the common space.Comprehensive experiments on five projects verify that DSL can build an effective prediction model and improve the performance over the related competing methods by at least 7.10%and 11.08%in terms of G-measure and AUC.
文摘具有混合记忆的自步对比学习(Self-paced Contrastive Learning,SpCL)通过集群聚类生成不同级别的伪标签来训练网络,取得了较好的识别效果,然而该方法从源域和目标域中捕获的行人数据之间存在典型的分布差异,使得训练出的网络不能准确区别目标域和源域数据域特征。针对此问题,提出了双分支动态辅助对比学习(Dynamic Auxiliary Contrastive Learning,DACL)框架。该方法首先通过动态减小源域和目标域之间的局部最大平均差异(Local Maximum Mean Discrepancy,LMMD),以有效地学习目标域的域不变特征;其次,引入广义均值(Generalized Mean,GeM)池化策略,在特征提取后再进行特征聚合,使提出的网络能够自适应地聚合图像的重要特征;最后,在3个经典行人重识别数据集上进行了仿真实验,提出的DACL与性能次之的无监督域自适应行人重识别方法相比,mAP和rank-1在Market1501数据集上分别增加了6.0个百分点和2.2个百分点,在MSMT17数据集上分别增加了2.8个百分点和3.6个百分点,在Duke数据集上分别增加了1.7个百分点和2.1个百分点。
文摘为了缓解基于伪标签的无监督域自适应行人重识别(UDA person ReID)方法中噪声标签带来的负面影响,提出了一种基于可靠性集成的无监督域自适应行人重识别(UDA-RI)方法。该方法包含渐进式伪标签提炼策略和基于可靠性集成策略两个部分。渐进式伪标签提炼策略通过建立一个不确定性的定量标准来衡量伪标签的可靠性,并采用渐进式采样使得模型得到更加稳定的训练。基于可靠性集成策略考虑了来自不同适应时刻的知识,将来自不同迭代的模型按照可靠性高低分配的权重进行了集成,并将自集成后的两种不同架构的模型再进行集成作为最终推理模型。实验表明,与目前先进的无监督域自适应行人重识别方法相比,UDA-RI方法在Market1501、DukeMTMC-ReID和MSMT17数据集上都取得了优越的性能。
文摘不同成像模式设备采集的医学图像存在不同程度的分布差异,无监督域自适应方法为了将源域训练的模型泛化到无标注的目标域,通常是将差异分布最小化,使用源域和目标域的共有特征进行结果预测,但会忽略目标域的私有特征.为了解决该问题,文中提出基于目标域增强表示的医学图像无监督跨域分割方法(Enhanced Target Domain Representation Based Unsupervised Cross-Domain Medical Image Segmentation,TreUCMIS).首先,通过共有特征学习获取源域和目标域的共有特征,通过图像重构训练目标域特征编码器,提取目标域完整特征.然后,通过目标域的无监督自学习方式,加强深层特征和浅层特征的共有性.最后,对齐使用共有特征和完整特征得到的预测结果,利用目标域的完整特征分割目标,提高模型在目标域的泛化性.在两个具有CT和MRI双向域自适应任务的医学图像分割数据集(腹部、心脏)上的实验表明TreUCMIS的有效性与优越性.