加权解耦语义表达的多源领域自适应方法

Multi-source Domain Adaptation of Weighted Disentangled Semantic Representation

下载PDF

导出

摘要近年来,深度学习受到越来越多研究者的重视并成功应用于许多领域.虽然深度学习在这些领域获得了巨大的成功,但是数据采集和标注成本高,严重限制了深度学习的推广应用.迁移学习不仅可以打破训练集数据和测试集数据独立同分布的假设,而且可以利用有标签的迁移源数据和没有标签的迁移目标数据训练得到具有良好泛化能力的模型,是扩展深度学习应用场景的重要研究方向.在众多的迁移学习方法中,多源领域自适应方法可以充分利用多个迁移源的信息,具有重要的实际价值.从数据的因果生成机制出发,假设观测数据由语义隐变量和领域隐变量这两组独立的隐变量同时生成.基于上述假设,提出了一种基于多种距离度量框架和加权解耦语义表达的多源领域自适应方法.该方法利用了双重对抗网络来提取解耦的语义信息和领域信息;另一方面,采用了3种不同的语义信息聚合策略获得领域不变的语义表达;最后使用领域不变的语义表达进行图片分类.在多个多源领域自适应数据上的对比及鲁棒性分析实验中,充分地验证了所提出方法的有效性. Recent years have witnessed the widespread use of domain adaptation.Thought having achieved significant performance in different fields,these methods are hungry for a large amount of labeled data,which requires unaffordable cost to meet the data quality and quantity and hinders the further application of deep learning model.Fortunately,domain adaptation,which not only relaxes the I.I.D assumption between the source and the target domain but also uses the labeled source domain data and the unlabeled target domain data simultaneously,is beneficial to achieve a well-generalized model.Among all the domain adaptation setting,multi-source domain adaptation,which takes full advantage of the information of multiple source domains,are more suitable to the real-world application.This study proposes a multi-source domain adaptation method via multi-measure framework and weighted disentangled semantic representation.Motivated from the data generation process in causal view,it is first assumed that the observed samples are controlled by the semantic latent variables and the domain latent variables,and it is further assumed that these variables are independent.As for the extraction of these variables,the duel adversarial training schema is used to extract and disentangle the semantic latent variables and the domain latent variables.As for the multi-domain aggregation,three different domain aggregation strategies are employed to obtain the weighted domain-invariant semantic representation.Finally,the weighted domain-invariant semantic representation is used for classification.Experiment studies not only testify that the proposed method yields state-of-the-art performance on many multi-source domain adaptation benchmark datasets but also validate the robust of the proposed method.

作者蔡瑞初郑丽娟李梓健 CAI Rui-Chu;ZHENG Li-Juan;LI Zi-Jian(School of Computer,Guangdong University of Technology,Guangzhou 510006,China)

机构地区广东工业大学计算机学院

出处《软件学报》 EI CSCD 北大核心 2022年第12期4517-4533,共17页 Journal of Software

基金国家自然科学基金(61876043,61976052) 广州市科技计划(201902010058)。

关键词迁移学习多源领域自适应解耦表达变分推理 transfer learning multi-source domain adaptation disentangle representation variational inference

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献15

1许敏,王士同,顾鑫,俞林.大样本领域自适应支撑向量回归机[J].软件学报,2013,24(10):2312-2326. 被引量：3
2王建新,王子亚,田萱.基于深度学习的自然场景文本检测与识别综述[J].软件学报,2020,31(5):1465-1496. 被引量：43
3姜枫,顾庆,郝慧珍,李娜,郭延文,陈道蓄.基于内容的图像分割方法综述[J].软件学报,2017,28(1):160-183. 被引量：133
4高玉双.深度学习在计算机视觉领域的应用发展探究[J].电脑编程技巧与维护,2020(9):125-127. 被引量：5
5王一帏.深度学习分类网络研究及其在计算机视觉中的应用[J].通讯世界,2019,26(3):195-196. 被引量：8
6张翔,肖小玲,徐光祐.基于样本之间紧密度的模糊支持向量机方法[J].软件学报,2006,17(5):951-958. 被引量：84
7庄福振,罗平,何清,史忠植.迁移学习研究进展[J].软件学报,2015,26(1):26-39. 被引量：471
8蔡瑞初,李嘉豪,郝志峰.基于类内最大均值差异的无监督领域自适应算法[J].计算机应用研究,2020,37(8):2371-2375. 被引量：6
9唐宋,叶茂,李旭冬.领域自适应目标识别综述[J].中兴通讯技术,2017,23(4):25-31. 被引量：4
10陶剑文,王士同.多核局部领域适应学习[J].软件学报,2012,23(9):2297-2310. 被引量：10

二级参考文献149

1曾子力.深度学习在计算机视觉领域的应用进展[J].计算机产品与流通,2020,0(1):230-230. 被引量：8
2Pan S J, Tsang IW, Kwok JT, Yang Q. Domain adaptation via transfer component analysis. IEEE Trans. on Neural Networks, 2011, 22(2):199-210. [doi: 10.1109/TNN.2010.2091281].
3Xiang EW, Cao B, Hu DH, Yang Q. Bridging domains using world wide knowledge for transfer learning. IEEE Trans. on Knowledge and Data Engineering, 2010,22(6):770-783. [doi: 10.1109/TKDE.2010.31 ].
4Joachims T. Transductive inference for text classification using support vector machines. In: Bratko I, Dzeroski S, eds. Proc. of the 16th Int'l Conf. on Machine Learning (ICML'99). Morgan Kaufmann Publishers, 1999.200-209.
5Bruzzone L, Marconcini M. Domain adaptation problems: A DASVM classification technique and a circular validation strategy. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2010,32(5):770-787. [doi: 10.1109/TPAMI.2009.57].
6Quanz B, Huan J. Large margin transductive transfer learning. In: Proc. of the 18th ACM Conf. on Information and Knowledge Management (CIKM). New York: ACM Press, 2009. 1327-1336. [doi: 10.1145/1645953.1646121].
7Ben-David S, Blitzer J, Crammer K, Pereira F. Analysis of representations for domain adaptation. In: Proc. of the NIPS. MIT Press, 2007.
8Ling X, Dai W, Xue G, Yang Q, Yu Y. Spectral domain transfer learning. In: Proc. of the 14th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. New York: ACM Press, 2008. [doi: 10.1145/1401890.1401951 ].
9Dai W, Xue GR, Yu Y. Co-Clustering based classification for out-of-domain documents. In: Proc. of the 13th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. San Jose: ACM Press, 2007.210-219. [doi: 10.1145/1281192.1281218].
10Sriperumbudur BK, Gretton A, Fukumizu K, Scholkopf B, Lanckriet GG. Hilbert space embeddings and metrics on probability measures. Journal o f Machine Learning Research, 2010,11 (3): 1517-1561.

共引文献840

1汪琛龙.基于计算机视觉算法的图像处理技术的研究[J].郑铁科技,2020,0(1):26-28. 被引量：1
2康文杰,田苗,林岚,孙珅,吴水才.深度卷积生成对抗网络对神经影像通用数据特征的学习[J].智慧健康,2020(31):1-4. 被引量：2
3严嘉钰,贝世之,章乐.基于VAE-GAN算法的信用卡欺诈检测模型[J].北京电子科技学院学报,2022,30(4):70-81.
4张政,严哲,顾汉明.基于残差网络与迁移学习的断层自动识别[J].石油地球物理勘探,2020(5):950-956. 被引量：24
5陈曙,叶俊民,刘童.一种基于领域适配的跨项目软件缺陷预测方法[J].软件学报,2020,31(2):266-281. 被引量：15
6吴锐帆,代海洋,杨坦,江颖,蔡志杰.直肠癌淋巴结转移的智能诊断研究[J].数学建模及其应用,2019,8(4):30-37. 被引量：2
7刘世晶,刘阳春,钱程,郑浩君,周捷,张成林.基于CycleGAN和注意力增强迁移学习的小样本鱼类识别[J].农业机械学报,2023,54(S01):296-302. 被引量：5
8张璐,黄琳,李备备,陈鑫,段青玲.基于多尺度融合与无锚点YOLO v3的鱼群计数方法[J].农业机械学报,2021,52(S01):237-244. 被引量：16
9王威,唐权.一种基于多特征及BP神经网络的高分辨率遥感影像道路提取方法[J].现代测绘,2020(2):8-10. 被引量：6
10张红洋,田瑞盟.基于SOLO分类理论的科学思维学业质量评价[J].湖南中学物理,2021(2):1-4. 被引量：1

1马天力,张扬,刘盼,高嵩.不确定重尾量测噪声干扰下的鲁棒目标跟踪算法[J].空军工程大学学报,2022,23(6):64-70. 被引量：1
2王军,韩淑雨,潘在宇,申政文.复杂多源数据表征学习理论研究与应用[J].中国基础科学,2022(3):40-46. 被引量：1

软件学报

2022年第12期

浏览历史

内容加载中请稍等...

加权解耦语义表达的多源领域自适应方法

参考文献15

二级参考文献149

共引文献840

相关作者

相关机构

相关主题

浏览历史