融合NetVLAD和全连接层的三元神经网络交叉视角场景图像定位被引量：9

Cross-view scene image localization with Triplet Network integrating NetVLAD and Fully Connected Layers

导出

摘要研究场景图像的地理定位问题在室外定位、目标搜寻、军事侦察等领域具有重要意义。针对街景影像与鸟瞰影像之间的交叉视角场景图像匹配与定位问题,本文提出了一种融合可训练局部聚集描述子向量Net VLAD(Net Vector of locally aggregated descriptors)和全连接层的三元神经网络(Triplet Network)定位方法(Tri-Net VLAD)。三元神经网络由三组卷积神经网络CNN(Convolutional Neural Networks)构成,能同时处理3张影像,通过增大不匹配像对间的距离,减小匹配像对间的距离,实现图像检索与匹配;Net VLAD和全连接层的融合可以加强特征间的关联性。本文将CNN提取的局部卷积特征分别通过Net VLAD层和全连接层得到全局描述符与特征向量,并将二者融合,有效地提升了局部特征间的关联性,并保留了不同局部特征之间的差异性,提升了模型的定位精度;改进了DBL loss(Distance-based layer loss),通过加入参数λ增强函数判别困难样本的能力,在提升模型的收敛速度和稳定性的同时也提升了模型的定位精度。在美国Vo and Hays公开数据集上的实验结果表明,Tri-Net VLAD取得了优于MCVPlaces、Triplet e DBL-Net和CVM-Net等现有方法的定位精度,在测试集上的精度高于63%。 Cross-view scene image matching and positioning have a wide range of applications in target search,combating crime,and positioning.With the development of deep learning,neural networks have played an important role in this issue.Given the problem of crossview scene image matching and positioning between street view and bird’s eye images,the neural network model’s convergence is slow,and the feature correlation is weak.This paper proposes a triplet network model(Tri-NetVLAD)that combines NetVLAD and a fully connected layer and improves DBL Loss(ADBL loss).The proposed method can not only improve the convergence speed and stability of the network but also the overall positioning accuracy of the model.The proposed Tri-NetVLAD model extracts the local features of the three input images through a triplet network and inputs the local features to the fully connected and NetVLAD layers to obtain the feature vector and the global feature descriptor.The global feature descriptor can obtain the relative distribution between features,and on this basis,incorporate feature vectors,which can preserve the differences between features to improve the positioning accuracy of the model.ADBL loss improves the model’s ability to discriminate difficult samples by introducing parameters and the positioning accuracy of the model.The proposed Tri-NetVLAD is compared with several existing methods,namely,MCVPlaces,Triplet eDBL-Net,and CVM-Net,and loss functions,namely,contrastive loss,triplet loss,and DBL loss.In the US vo and hays dataset,the highest positioning accuracy of 63.5%is achieved,proving that the triplet network that combines the NetVLAD and fully connected layers can effectively improve the positioning accuracy with the ADBL Loss.Compared with existing methods,the proposed Tri-NetVLAD has the following advantages.(1)The Triplet network can increase the Euclidean distance between unmatched images while reducing the Euclidean distance between matched images.(2)The introduction of NetVLAD can aggregate the local features extracted by CNN to obtain global feature descriptors and the distribution relationship between features.(3)The fusing of the Fully Connected Layer adds the feature vector obtained through the fully connected layer to the global feature descriptor,so that the final feature vector not only represents the distribution relationship between features,but also retains the differences between features.(4)The improved loss function ADBL Loss can accelerate the gradient convergence speed and improve the overall positioning accuracy.

作者薛朝辉周逸飏强永刚刘弋锋林晖 XUE Zhaohui;ZHOU Yiyang;QIANG Yonggang;LIU Yifeng;LIN Hui(School of Earth Sciences and Engineering,Hohai University,Nanjing 211100,China;School of Computer Science and Technology,University of Science and Technology of China,Hefei 230026,China;National Engineering Laboratory for Social Security Risk Perception and Prevention and Control of Big Data Application,China Academy of Electronics,Beijing 100041,China)

机构地区河海大学地球科学与工程学院中国科学技术大学计算机科学与技术学院中国电子科学研究院社会安全风险感知与防控大数据应用国家工程实验室

出处《遥感学报》 EI CSCD 北大核心 2021年第5期1095-1107,共13页 NATIONAL REMOTE SENSING BULLETIN

基金国家自然科学基金(编号:41971279)。

关键词交叉视角场景图像匹配与定位三元神经网络 Net VLAD CNN(Convolutional Neural Networks) cross-view scene image matching and geolocation Triplet Network Net VLAD CNN

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1张洪群,刘雪莹,杨森,李宇.深度学习的半监督遥感图像检索[J].遥感学报,2017,21(3):406-414. 被引量：30
2张路,廖明生.一种顾及上下文的遥感影像模糊聚类[J].遥感学报,2006,10(1):58-65. 被引量：17
3赵理君,唐娉.典型遥感数据分类方法的适用性分析——以遥感图像场景分类为例[J].遥感学报,2016,20(2):157-171. 被引量：18

二级参考文献43

1鲁珂,赵继东,叶娅兰,曾家智.一种用于图像检索的新型半监督学习算法[J].电子科技大学学报,2005,34(5):669-671. 被引量：9
2李德仁,宁晓刚.一种新的基于内容遥感图像检索的图像分块策略[J].武汉大学学报（信息科学版）,2006,31(8):659-662. 被引量：16
3Jensen J R. Introductory Digital Image Processing: A Remote Sensing Perspective[ M]. New Jersey: Prentice Hall, 1996.
4Zadeh L A. Fuzzy Sets[ J]. Information and Control, 1965, 8(3) : 338-353.
5Baraldi A, Blonda P. A Survey of Fuzzy Clustering Algorithms for Pattern Recognition: Part Ⅰ [ J ]. IEEE Trans. on Systems,Man, and Cybernetics: Part B: Cybernetics, 1999, 29(6):778-785.
6Baraldi A, Blonda P. A Survey of Fuzzy Clustering Algorithms for Pattern Recognition : Part Ⅱ[ J ] . IEEE Trans. on Systems,Man, and Cybernetics: Part B: Cybernetics, 1999, 29(6):786-801.
7Zhang J X, Foedy G M. A Fuzzy Classifcation of Sub-urban Land Cover from Remotely Sensed Imagery [ J ]. Int. J. Remote Sensing, 1998, 19(14) : 2721-2738.
8Tso B, Mather P M. Classification Methods for Remotely Sensed Data[M]. Basingstoke: Taylor & Francis, 2001.
9Richards A J, Jia X P. Remote Sensing Digital Image Analysis:An Introduction, 3rd Edition[M]. New York: Springer, 1999.
10Wu F Y. The Potts Model [ J ]. Reviews of Modern Physics,1982, 54(1) : 235-268.

共引文献62

1龚循强,鲁铁定,刘星雷,周秀芳,崔统博.高分辨率遥感图像场景线性回归分类[J].东华理工大学学报（自然科学版）,2019,0(4):425-432. 被引量：6
2杨庆仙,王丽珍,周汝良.基于网格技术的聚类算法在遥感数据中的应用[J].云南大学学报（自然科学版）,2009,31(S1):75-79.
3郑明国,蔡强国.一种新的土地覆盖类别面积估计方法及其在最大似然分类法中的应用[J].资源科学,2007,29(3):214-220. 被引量：7
4叶庆华,陈锋,姚檀栋,王景华,刘强,张雪芹,康世昌.近30年来喜马拉雅山脉西段纳木那尼峰地区冰川变化的遥感监测研究[J].遥感学报,2007,11(4):511-520. 被引量：63
5刘晓云,陈武凡,王振松.基于MRF随机场的多光谱遥感影像最优化分级聚类[J].测绘学报,2007,36(4):400-405. 被引量：1
6龚雪晶,慈林林,姚康泽.基于邻域信息的遥感图像模糊聚类及并行算法设计[J].计算机应用,2007,27(10):2512-2514. 被引量：3
7刘小利,朱国宾,李清泉,贾治革.基于并行Tabu搜索和空间信息约束的遥感影像模糊聚类[J].武汉大学学报（信息科学版）,2009,34(5):527-530. 被引量：1
8聂勇,张镱锂,刘林山,张继平.近30年珠穆朗玛峰国家自然保护区冰川变化的遥感监测[J].地理学报,2010,65(1):13-28. 被引量：83
9NIE Yong,ZHANG Yili,LIU Linshan,ZHANG Jiping.Glacial change in the vicinity of Mt. Qomolangma （Everest）, central high Himalayas since 1976[J].Journal of Geographical Sciences,2010,20(5):667-686. 被引量：25
10来婷婷,王乃昂,黄银洲,张建明,赵力强,许洺山.2002年腾格里沙漠湖泊季节变化研究[J].湖泊科学,2012,24(6):957-964. 被引量：13

同被引文献77

1李文举,张耀星,陈慧玲,李培刚,沙利业.基于TSCD模型的轨道板裂缝检测方法[J].应用科学学报,2022,40(1):155-166. 被引量：2
2孟琭,高恒上,张含光,刘阳.基于全连接神经网络的三维人体姿态估计[J].仪器仪表学报,2020(10):165-177. 被引量：9
3韦娜,耿国华,周明全.基于内容的图像检索系统性能评价[J].中国图象图形学报（A辑）,2004,9(11):1271-1276. 被引量：22
4董卫军,周明全,耿国华,黎晓.基于内容的图像检索技术研究[J].计算机工程,2005,31(10):162-163. 被引量：23
5向友君,谢胜利.图像检索技术综述[J].重庆邮电学院学报（自然科学版）,2006,18(3):348-354. 被引量：39
6王国胤,张清华,胡军.粒计算研究综述[J].智能系统学报,2007,2(6):8-26. 被引量：111
7胡清华,于达仁,谢宗霞.基于邻域粒化和粗糙逼近的数值属性约简[J].软件学报,2008,19(3):640-649. 被引量：290
8纪华,吴元昊,孙宏海,王延杰.结合全局信息的SIFT特征匹配算法[J].光学精密工程,2009,17(2):439-444. 被引量：70
9秦志新,裴东兴.基于内容的图像检索技术概述[J].数字技术与应用,2012,30(1):159-159. 被引量：3
10张建华,孔繁涛,吴建寨,翟治芬,韩书庆,曹姗姗.基于改进VGG卷积神经网络的棉花病害识别模型[J].中国农业大学学报,2018,23(11):161-171. 被引量：107

引证文献9

1施群山,蓝朝桢,徐青,周杨,胡校飞.面向卫星遥感影像检索定位的深度学习全局表征模型评估与分析[J].地球信息科学学报,2022,24(11):2245-2263. 被引量：2
2左浩.基于Netvlad神经网络的变磁力吸附爬壁机器人控制系统设计[J].计算机测量与控制,2023,31(1):106-112.
3李子彧,周维勋,耿万轩.联合类别筛选与重排序的交叉视角图像地理定位[J].测绘通报,2023(2):40-45. 被引量：1
4傅兴宇,陈颖悦,陈玉明,江海亮,黄涛.一种全连接粒神经网络分类方法[J].山西大学学报（自然科学版）,2023,46(1):91-100. 被引量：2
5朱昊,田翮.季节变化下的基于学习的图像检索和位置识别[J].信息与电脑,2023,35(11):182-185.
6刘琳,赵化启.基于分段组合特征降维的交叉视角目标定位研究[J].现代计算机,2023,29(13):39-44.
7何柳,刘姝妍,李润岐,陶剑,安然.基于自注意力和类监督的遥感图像跨模态检索[J].火力与指挥控制,2023,48(10):84-92. 被引量：1
8张新雨.高速铁路钢轨焊接接头表面裂纹智能检测方法[J].焊接技术,2023,52(11):114-119. 被引量：1
9何清,李丽琳,林子安.基于FEEMD-GRU-FC模型的滑坡位移预测[J].人民长江,2024,55(7):108-114.

二级引证文献7

1侯贤宇,陈玉明,吴克寿.多采样近似粒集成学习[J].南京大学学报（自然科学版）,2024,60(1):118-129.
2何柳,刘姝妍,李润岐,陶剑,安然.基于自注意力和类监督的遥感图像跨模态检索[J].火力与指挥控制,2023,48(10):84-92. 被引量：1
3赵伟.钢轨焊接接头圆弧区域超声爬波探伤技术要点探析[J].中国机械,2024(4):40-43.
4李晓春,黄丹阳,罗易智,吴宏伊.基于人工智能视频分析技术的收费站拥堵智能感知平台[J].交通建设与管理,2024(2):142-145.
5郑晨颖,陈颖悦,侯贤宇,江连吉,廖亮.一种邻域粒的模糊C均值聚类算法[J].山东大学学报（理学版）,2024,59(5):35-44.
6吴志斌.基于改进密集连接网络的土地卫片场景分类方法[J].北京测绘,2024,38(9):1341-1345.
7盛怡宁,赵理君,张正,崔绍龙,饶梦彬,唐娉.跨视角图像地理定位方法综述[J].中国图象图形学报,2024,29(9):2716-2736.

1朱明明,尚丹婷,曹育森,雷涛,夏娟娟,李钊,景裕,张林媛,王健琪,路国华.基于高光谱的低对比度伤员目标搜寻技术研究[J].中国医疗设备,2021,36(6):5-8.
2唐虎生.激光供能无人机系统及应用前景[J].中国科技信息,2021(12):24-27. 被引量：1
3李艳艳,潘晋孝,刘宾.基于相似度匹配的场景深度估计方法[J].国外电子测量技术,2021,40(3):37-40. 被引量：4
4Yujun Xie,Zhen Li.Development of aggregated state chemistry accelerated by aggregation-induced emission[J].National Science Review,2021,8(6):14-16. 被引量：5
5薛雨.基于NLP和深度学习方法的英文情感分析方法研究[J].电子设计工程,2021,29(13):95-99. 被引量：5
6王帅,孙喜民,高亚斌,孙博.基于神经协同过滤的个性化商品推荐方法[J].信息技术,2021,45(6):143-147. 被引量：3
7余志锋,熊邦书,熊天旸,欧巧凤,李新民.基于VMD-CWT和改进CNN的直升机轴承故障诊断[J].航空动力学报,2021,36(5):948-958. 被引量：18
8莫海军,陈杰,王顺栋.结合点云纹理信息的快速点特征直方图描述子算法[J].华南理工大学学报（自然科学版）,2021,49(6):56-65. 被引量：2
9Wenchang Xiao,Danna Yeerken,Jia Li,Zhangfu Li,Lanfang Jiang,Dan Li,Ming Fu,Liying Ma,Yongmei Song,Weimin Zhang,Qimin Zhan.Nip promotes autophagy through facilitating the interaction of Rab7 and FYCO1[J].Signal Transduction and Targeted Therapy,2021,6(5):1559-1572. 被引量：1
10Lan Chen,Juntao Ye,Xiaopeng Zhang.Multi-Feature Super-Resolution Network for Cloth Wrinkle Synthesis[J].Journal of Computer Science & Technology,2021,36(3):478-493. 被引量：1

遥感学报

2021年第5期

浏览历史

内容加载中请稍等...

融合NetVLAD和全连接层的三元神经网络交叉视角场景图像定位被引量：9

参考文献3

二级参考文献43

共引文献62

同被引文献77

引证文献9

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

融合NetVLAD和全连接层的三元神经网络交叉视角场景图像定位 被引量：9

参考文献3

二级参考文献43

共引文献62

同被引文献77

引证文献9

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

融合NetVLAD和全连接层的三元神经网络交叉视角场景图像定位被引量：9