注意力迁移的联合平衡领域自适应被引量：1

Learning transferrable attention for joint balanced domain adaptation

导出

摘要目的现有的图像识别方法应用于从同一分布中提取的训练数据和测试数据时具有良好性能,但这些方法在实际场景中并不适用,从而导致识别精度降低。使用领域自适应方法是解决此类问题的有效途径,领域自适应方法旨在解决来自两个领域相关但分布不同的数据问题。方法通过对数据分布的分析,提出一种基于注意力迁移的联合平衡自适应方法,将源域有标签数据中提取的图像特征迁移至无标签的目标域。首先,使用注意力迁移机制将有标签源域数据的空间类别信息迁移至无标签的目标域。通过定义卷积神经网络的注意力,使用关注信息来提高图像识别精度。其次,基于目标数据集引入网络参数的先验分布,并且赋予网络自动调整每个领域对齐层特征对齐的能力。最后,通过跨域偏差来描述特定领域的特征对齐层的输入分布,定量地表示每层学习到的领域适应性程度。结果该方法在数据集Office-31上平均识别准确率为77.6%,在数据集Office-Caltech上平均识别准确率为90.7%,不仅大幅领先于传统手工特征方法,而且取得了与目前最优的方法相当的识别性能。结论注意力迁移的联合平衡领域自适应方法不仅可以获得较高的识别精度,而且能够自动学习领域间特征的对齐程度,同时也验证了进行域间特征迁移可以提高网络优化效果这一结论。 Objective Many image recognition methods demonstrate good performance when applied to training and test data extracted from the same distribution. However,these methods are unsuitable in practical scenarios and result in low performance. Using domain adaptive methods is an effective approach for solving such problem. Domain adaptation aims to solve various problems,such as when data are from two related domains but with different distributions. In practical applications,labeling data takes substantial manual labor. Thus,unsupervised learning has become a clear trend in image recognition. Transfer learning can extract knowledge from the labeled data in the source domain and transfer it to the unlabeled target domain. Method We propose a joint balanced adaptive method based on attention transfer mechanism,which transfers feature representations extracted from the labeled datasets in the source domain to the unlabeled datasets in the target domain. Specifically,we first transfer the labeled source-domain space category information to the unlabeled target domain via attention transfer mechanism. Neural networks reflect the basic characteristics of the human brain,and attention is precisely an important part of the human visual experience and closely related to perception. Artificial attention mechanism started to be developed as artificial neural network has become increasingly popular in various fields,such as computer vision and pattern recognition. Allowing a system to learn attending objects and understand the mechanism behind neural networks has become a research tool. Attention information can be used to improve image recognition accuracy significantly by defining the attention of convolutional neural networks( CNNs). In this study,attention can be seen as a set of spatial mappings that encode the spatial regions highly concerned with the network input to determine its possible output. Second,we introduce the prior distribution of the network parameters on the basis of the target dataset and endow the layer with the capability of automatically learning the alignment degree that should be pursued at different levels of the network. We expect to explore abundant source-domain attributes through cross-domain learning and capture substantial complex crossdomain knowledge by embedding cross-dataset information for minimizing the original function loss for the learning tasks in two domains as much as possible. Machine learning is an alternative approach for recognizing the refined features after preprocessing raw data into features on the basis of prior knowledge of humans. Machine learning experts have spent most of their time designing features in the past few years because recognition results depend on the quality of features. Recent breakthrough in object recognition has been mainly achieved by approaches based on deep CNN due to its more powerful feature extraction and image representation capabilities than manually defined features,such as HOG and SIFT. The higher the network layers are,the more specific the characteristics are for the target categorization tasks. Meanwhile,the features on successive layers interact with each other in a complex and fragile way. Accordingly,the neurons between neighboring layers co-adapt during training. Therefore,the mobility of features and classifiers decreases as the cross-domain difference increases. Finally,we describe the input distribution of the domain-specific adaptive alignment layer by introducing crossdomain biases,thereby quantitatively indicating the inter-domain adaptation degree that each layer learns. Meanwhile,we adaptively change the weight of each category in the dataset. Although deep CNN is a unified training and prediction framework that combines multi-level feature extractors and recognizers,end-to-end processing is particularly important. The design concept for our model fully utilizes the capability of CNN to perform end-to-end processing. Result The average recognition accuracies of the method in datasets Office-31 and Office-Caltech are 77. 6% and 90. 7%,respectively. Thus,this method significantly outperforms traditional methods based on handcrafted feature and is also comparable with state-of-theart methods. Although not all single transfer tasks achieve optimal results,the average recognition accuracy of the six transfer tasks is improved compared with the current mainstream methods. Conclusion Transferring image features extracted from labeled data in the source domain to the unlabeled target domain effectively solves data problems from two domains that are related but differently distributed. The method fully utilizes the spatial location information of the labeled data in the source domain through attention transfer mechanism and uses the deep CNN to learn the alignment degree of the features between domains automatically. Learning ability largely depends on the degree of inter-domain correlation,which is a major limitation for transfer learning. In addition,knowledge transition is apparently ineffective if no similarity exists between the domains. Thus,we fully consider the feature correlation in the dataset between source and target domains and adaptively change the weight of each category in the dataset. Our method can not only effectively obtain high recognition accuracy but also automatically learn the degree of feature alignment between domains. This method also verifies that the inter-domain feature transfer can improve network optimization effect.

作者汪荣贵姚旭晨杨娟薛丽霞 Wang Ronggui;Yao Xuchen;Yang Juan;Xue Lixia(School of Computer and Information, Hefei University of Technology, Hefei 230601, China)

机构地区合肥工业大学计算机与信息学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第7期1116-1125,共10页 Journal of Image and Graphics

关键词迁移学习领域自适应注意力机制无监督学习图像识别卷积神经网络 transfer learning domain adaptation attention mechanism unsupervised learning image recognition convolutional neural networks

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1刘万军,梁雪剑,曲海成.自适应增强卷积神经网络图像识别[J].中国图象图形学报,2017,22(12):1723-1736. 被引量：27
2王守义,周海英,杨阳.基于卷积特征的核相关自适应目标跟踪[J].中国图象图形学报,2017,22(9):1230-1239. 被引量：4

二级参考文献16

1汪济民,陆建峰.基于卷积神经网络的人脸性别识别[J].现代电子技术,2015,38(7):81-84. 被引量：25
2赵志勇,李元香,喻飞,易云飞.基于极限学习的深度学习算法[J].计算机工程与设计,2015,36(4):1022-1026. 被引量：15
3蔡娟,蔡坚勇,廖晓东,黄海涛,丁侨俊.基于卷积神经网络的手势识别初探[J].计算机系统应用,2015,24(4):113-117. 被引量：52
4柴瑞敏,曹振基.基于改进的稀疏深度信念网络的人脸识别方法[J].计算机应用研究,2015,32(7):2179-2183. 被引量：13
5李祖贺,樊养余,王凤琴.YUV空间中基于稀疏自动编码器的无监督特征学习[J].电子与信息学报,2016,38(1):29-37. 被引量：16
6蔡国永,夏彬彬.基于卷积神经网络的图文融合媒体情感预测[J].计算机应用,2016,36(2):428-431. 被引量：26
7汤鹏杰,王瀚漓,左凌轩.并行交叉的深度卷积神经网络模型[J].中国图象图形学报,2016,21(3):339-347. 被引量：11
8徐冉,张俊格,黄凯奇.利用双通道卷积神经网络的图像超分辨率算法[J].中国图象图形学报,2016,21(5):556-564. 被引量：18
9管皓,薛向阳,安志勇.深度学习在视频目标跟踪中的应用进展与展望[J].自动化学报,2016,42(6):834-847. 被引量：83
10张婷,李玉鑑,胡海鹤,张亚红.基于跨连卷积神经网络的性别分类模型[J].自动化学报,2016,42(6):858-865. 被引量：41

共引文献29

1宁海涛.非接触数字图像注入式红外目标分类识别仿真[J].计算机仿真,2018,35(12):376-379.
2程彬炜,安博文,赵明.基于卷积神经网络的遥感图像汽车识别[J].现代计算机（中旬刊）,2018(8):72-76.
3徐少平,张贵珍,李崇禧,刘婷云,唐祎玲.基于深度置信网络的随机脉冲噪声快速检测算法[J].电子与信息学报,2019,41(5):1130-1136. 被引量：6
4王玉晶,莫建麟.卷积神经网络算法在近红外光人脸检测中的研究[J].激光杂志,2019,40(4):180-183. 被引量：1
5吴沛佶,梅雪,何毅,袁申强.基于深度网络模型的视频序列中异常行为的检测方法[J].激光与光电子学进展,2019,56(13):126-132. 被引量：16
6蔡小青.复杂场景下非正交建筑图像自动识别仿真[J].计算机仿真,2019,36(10):339-343. 被引量：2
7汤凯,何庆,赵群,王旭.基于改进的深度残差网络的图像识别[J].南京师大学报（自然科学版）,2019,42(3):115-121. 被引量：6
8马骞.基于卷积神经网络的密集场景人流估计方案[J].电子设计工程,2020,28(5):189-193.
9郑庆翔,朱敏.基于深度卷积神经网络的高光谱图像分类算法研究[J].白城师范学院学报,2020,34(2):24-29. 被引量：1
10王元峰,龙思璇,曾惜,王宏远,林家杰,陈华彬.基于卷积神经网络的电网变压器铭牌识别技术研究[J].数字技术与应用,2020,38(7):113-115. 被引量：2

同被引文献14

1唐海滨,李树基.保护和建设甘肃草原绿色生态屏障[J].甘肃社会科学,2011(2):209-212. 被引量：7
2韩天虎,俞联平,张贞明.论草业与河西走廊生态安全[J].草业科学,2012,29(6):1013-1016. 被引量：6
3吴炜,沈占锋,李均力,杨海平,骆剑承.联合概率密度脊提取的影像镶嵌色彩一致性处理方法[J].测绘学报,2013,42(2):247-252. 被引量：10
4谢高地,张钇锂,鲁春霞,郑度,成升魁.中国自然草地生态系统服务价值[J].自然资源学报,2001,16(1):47-53. 被引量：867
5王国宏,任继周,张自和.河西山地绿洲荒漠植物群落种群多样性研究Ⅰ生态地理及植物群落的基本特征[J].草业学报,2001,10(1):1-12. 被引量：41
6任海娟,董建军,李晓媛,牛建明,张雪峰.利用多时相Landsat8图像提取苜蓿人工草地信息[J].中国草地学报,2015,37(2):81-87. 被引量：13
7杜培军,夏俊士,薛朝辉,谭琨,苏红军,鲍蕊.高光谱遥感影像分类研究进展[J].遥感学报,2016,20(2):236-256. 被引量：247
8王涛,高峰,王宝,王鹏龙,王勤花,宋华龙,尹常亮.祁连山生态保护与修复的现状问题与建议[J].冰川冻土,2017,39(2):229-234. 被引量：120
9苏伟,张明政,蒋坤萍,朱德海,黄健熙,王鹏新.Sentinel-2卫星影像的大气校正方法[J].光学学报,2018,38(1):314-323. 被引量：62
10吴田军,骆剑承,夏列钢,杨海平,沈占锋,胡晓东.迁移学习支持下的遥感影像对象级分类样本自动选择方法[J].测绘学报,2014,43(9):908-916. 被引量：33

引证文献1

1邢瑾,候建西,刘勇,张寅丹,刘立,郭根发.基于Sentinel-2数据的祁连山草地自动提取策略[J].兰州大学学报（自然科学版）,2021,57(4):473-482.

1李嘉豪,蔡瑞初.基于类内均方偏差的无监督领域自适应[J].现代计算机,2019,0(17):16-20. 被引量：1
2李林蔚,葛万成.相干光纤通信系统的色散补偿设计与优化[J].信息通信,2018,31(1):249-251. 被引量：1
3RFID跟踪建筑工具[J].中国自动识别技术,2018,0(6):30-30.
4张文田.基于BP神经网络的参数迁移学习算法研究[J].电脑知识与技术,2019,15(2):189-191. 被引量：2
5蒋卫丽,陈振华,邵党国,马磊,相艳,郑娜,余正涛.基于领域词典的动态规划分词算法[J].南京理工大学学报,2019,43(1):63-71. 被引量：16
6孙海松.大数据云计算环境下的数据安全研究[J].计算机产品与流通,2019,0(7):182-182. 被引量：3
7葛梦颖,于重重,周兰,马钰锡.基于协同半监督的深度学习图像分类算法[J].计算机仿真,2019,36(2):196-200. 被引量：9
8刘佳楠,刘任涛,赵娟,常海涛,罗雅曦,张静,马继.沙地柠条灌丛枯落叶输入特征及对土壤理化性质的影响[J].干旱区资源与环境,2018,32(11):169-175. 被引量：9
9宫在芹,朱拴成,代艳玲,武英刚,张扬.科技期刊刊群采编系统建设研究——以煤科总院出版传媒集团具体实践为例[J].中国科技期刊研究,2019,30(3):242-247. 被引量：4
10李鹏,蒋品群,曾上游,夏海英,廖志贤,范瑞.基于分组残差结构的轻量级卷积神经网络设计[J].微电子学与计算机,2019,36(7):43-47. 被引量：3

中国图象图形学报

2019年第7期

浏览历史

内容加载中请稍等...

注意力迁移的联合平衡领域自适应被引量：1

参考文献2

二级参考文献16

共引文献29

同被引文献14

引证文献1

相关作者

相关机构

相关主题

浏览历史

注意力迁移的联合平衡领域自适应 被引量：1

参考文献2

二级参考文献16

共引文献29

同被引文献14

引证文献1

相关作者

相关机构

相关主题

浏览历史

注意力迁移的联合平衡领域自适应被引量：1