迁移学习下高分快视数据道路快速提取被引量：4

Rapid road extraction from quick view imagery of high-resolution satellites with transfer learning

导出

摘要目的传统的道路提取方法自动化程度不高,无法满足快速获取道路信息的需求。使用深度学习的道路提取方法多关注精度的提升,网络冗余度较高。而迁移学习通过将知识从源领域迁移到目标领域,可以快速完成目标学习任务。因此,本文利用高分辨率卫星快视数据快速获取的特性,构建了一种基于迁移学习的道路快速提取深度神经网络。方法采用基于预训练网络的迁移学习方法,可以将本文整个道路提取过程分为两个阶段:首先在开源大型数据库Image Net上训练源网络,保存此阶段最优模型;第2阶段迁移预训练保存的模型至目标网络,利用预训练保存的权重参数指导目标网络继续训练,此时快视数据作为输入,只做目标任务的定向微调,从而加速网络训练。总体来说,前期预训练是一个抽取通用特征参数的过程,目标训练是针对道路提取任务特化的过程。结果本文构建的基于迁移学习的快速道路提取网络,迁移预训练模型与不迁移相比验证精度提升6.0%,单幅尺寸为256×256像素的数据测试时间减少49.4%。快视数据测试集平均精度可达88.3%。截取一轨中7304×6980像素位于天津滨海新区的快视数据,可在54 s内完成道路提取。与其他迁移模型对比,本文方法在快速预测道路的同时且能达到较高的准确率。结论实验结果表明,本文针对高分卫星快视数据,提出的利用预训练模型初始化网络能有效利用权重参数,使模型趋于轻量化,使得精度提升的同时也加快了提取速度,能够实现道路信息快速精准获取。 Objective Quick view data generated by high-resolution satellites provide real-time reception and full resolution for quick view imaging.Such imaging offers a timely source of data for practical applications,such as fire detection,moving window display,disaster observation,and military information acquisition.Road extraction from remote sensing images has been a popular research topic in the field of remote sensing image analysis.Traditional object-oriented methods are not highly automated,and road features require prior knowledge for manual selection and design.These conditions lead to problems in real-time road information acquisition.The popular deep learning road extraction method mainly focuses on the improvement of precision and lacks research on the timeliness of road information extraction.Transfer learning can rapidly complete the task in the target area through weight sharing among different fields and make the model algorithm highly personalized.A transfer learning deep network for rapidly extracting roads is constructed to utilize quick view data from highresolution satellites.Method First,we propose a least-square fitting method of devignetting to solve the most serious radiation problem of TDICCD(time delay and integration charge coupled devices)vignetting phenomenon appearing in raw quick view data.The results of the preprocessing of the quick view data serve as our training dataset.Then,we choose Link Net as the target network after comparing the performance among different real-time semantic segmentation networks,such as ENet,U-Net,Link Net,and D-LinkNet.Link Net is efficient in computation memory,can learn from a relatively small training set,and allows residual unit ease training of deep networks.The rich bypass links each encoder with decoder.Thus,the networks can be designed with few parameters.The encoder starts with a kernel of size 7×7.In the next encoder block,its contracting path to capture context uses 3×3 full convolution.We use batch normalization in each convolutional layer,followed by Re LU nonlinearity.Reflection padding is used to extrapolate the missing context in the training data for predicting the pixels in the border region of the input image.The input of each encoder layer of Link Net is bypassed to the output of its corresponding decoder.Lost spatial information about the max pooling can then be recovered by the decoder and its upsampling operations.Finally,we modify Link Net to keep it consistent with Res Net34 network layer features,the so-called fine tuning,for accelerating Link Net network training process.Fine tuning is a useful efficient method of transfer learning.The use of Res Net34 weight parameter pretrained on Image Net initializing Link Net34 can accelerate the network convergence and lead to improved performance with almost no additional cost.Result In the process of devignetting quick view data,the least-square linear fitting method proposed in this study can efficiently remove the vignetting strip of the original image,which meets practical applications.In our road extraction experiment,Link Net34 using the pretrained Res Net34 as encoder has a 6%improvement in Dice accuracy compared with that when using Res Net34 not pretrained on the valid dataset.The time consumption of a single test feature map is reduced by 39 ms,and the test Dice accuracy can reach 88.3%.Pretrained networks substantially reduce training time that also helps prevent overfitting.Consequently,we achieve over 88%test accuracy and 40 ms test time on the quick view dataset.With an input feature map size of 3×256×256 pixels,the data of Tianjin Binhai with a size of 7304×6980 pixels take 54s.The original Link Net using Res Net18 as its encoder only has a Dice coefficient of 85.7%.We evaluate Res Net50 and Res Net101 as pretrained encoders.The Dice accuracy of the former is not improved,whereas the latter takes too much test time.We compare the performance of Link Net34 with those of three other popular deep transfer models for classification,namely,U-Net;two modifications of Ternaus Net and Alub Net using VGG11(visual geometry group)and Res Net34 as encoders separately;and a modification of D-LinkNet.The two U-Net modifications are likely to incorrectly recognize roads as background or recognize something nonroad,such as tree,as road.D-LinkNet has higher Dice than Link Net34 on the validation set,but the testing time takes59 ms more than that of Link Net34.Link Net34 avoids the weaknesses of Ternuas Net and Alub Net and makes better predictions than them.The small nonroad gap between two roads can also be avoided.Many methods mix the two roads into one.The method proposed in this study generally achieves good connectivity,accurate edge,and clear outline in the case of complete extraction of the entire road and fine location.It is especially suitable for rural linear roads and the extraction of area roads in towns.However,the extraction effect for complex road networks in urban areas is incomplete.Conclusion In this study,we build a deep transfer learning neural network,Link Net34,which uses a pretrained network,Res Net34,as an encoder.Res Net34 allows Link Net34 to learn without any significant increase in the number of parameters,solves the problem that the bottom layer features randomly initialized with weights of neural networks are inadequately rich,and accelerates network convergence.Our approach demonstrates the improvement in Link Net34 by the use of the pretrained encoder and the better performance of Link Net34 than other real-time segmentation architecture.The experimental results show that Link Net34 can handle road properties,such as narrowness,connectivity,complexity,and long span,to some extent.This architecture proves useful for binary classification with limited data and realizes fast and accurate acquisition of road information.Future research should consider increasing the quick view database.The pretrained network Link Net34 trains on the expanded quick view database and then transfers.The"semantic gap"between the source and target networks is reduced,and the data distribute similarly.These features are conducive to model initialization.

作者张军军万广通张洪群李山山冯旭祥 Zhang Junjun;Wan Guangtong;Zhang Hongqun;Li Shanshan;Feng Xuxiang(Institute of Remote Sensing and Digital Earth,Chinese Academy of Sciences,Beijing 100094,China;University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区中国科学院遥感与数字地球研究所中国科学院大学

出处《中国图象图形学报》 CSCD 北大核心 2020年第7期1501-1512,共12页 Journal of Image and Graphics

基金中国科学院战略性先导科技专项(A类)地球大数据科学工程子课题项目:CASEarth小卫星产品服务研究(XDA19010401) 中国科学院遥感与数字地球研究所集成课题项目(Y6JD260057)。

关键词高分辨率卫星快视数据道路快速提取迁移学习微调 high-resolution satellite quick view data fast road extraction transfer learning fine-tuning

分类号 TP751.1 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献3

1顾久祥,杨仁忠,石璐,韦宏卫.基于GPU的HJ-1C实时成像处理技术[J].中国科学院大学学报（中英文）,2014,31(5):708-713. 被引量：2
2甘俊英,戚玲,秦传波,何国辉.结合迁移学习的轻量级指纹分类模型[J].中国图象图形学报,2019,24(7):1086-1095. 被引量：1
3史文中,朱长青,王昱.从遥感影像提取道路特征的方法综述与展望[J].测绘学报,2001,30(3):257-262. 被引量：157

二级参考文献17

1侯明辉.基于仿真数据的HJ-1C卫星SAR成像处理方法[J].中国科学：信息科学,2011,41(S1):42-54. 被引量：1
2郭琨毅,盛新庆.高性能Beowulf集群的SAR并行成像的处理[J].现代雷达,2005,27(1):38-40. 被引量：4
3熊君君,王贞松,姚建平,石长振.星载SAR实时成像处理器的FPGA实现[J].电子学报,2005,33(6):1070-1072. 被引量：19
4Yassin M Y，IEEE Trans Pattern Analysis Machine Intelligence，2000年，22卷，3期，227页
5Hu Xiangyun，Int Achives of Photogrammetry and Remote Sensing XXXIII（Part B3），2000年，994页
6Gruen A，Photogrammetric Engineering and Remote Sensing，1997年，63卷，8期，985页
7Steger C，Automatic Extraction of Manmade Objects from Aerial and Space Images（2），1997年，245页
8TRINDER J C，Automatic Extraction of Manmade Objects from Aerial and Space Images（2），1997年，257页
9Gruen A，ISPRS J Photogrammetry Remote Sensing，1995年，50卷，4期，11页
10TRINDER J C，Automatic Extraction of Manmade Objects from Aerial and Space Images，1995年，95页

共引文献157

1胥亚,杜为财,袁立伟,叶荣生.遥感影像道路提取现状与展望[J].测绘通报,2012(S1):427-429. 被引量：4
2谭庆全,薄涛,罗桂纯,王占英,刘年平,刘群.遥感图像道路信息提取算法研究现状与地震应急应用[J].自然灾害学报,2015,24(3):52-57. 被引量：4
3史玉峰,靳奉祥.高维数据有效特征的提取方法及其在测绘信息模式识别中的应用[J].有色金属,2004,56(41):114-118. 被引量：2
4郭海涛,张衡,张保明,陈江.GIS数据辅助下的线状目标自动提取[J].测绘学院学报,2004,21(4):275-278. 被引量：3
5郭海涛,张保明,徐青.一种从遥感影像中自动提取线状目标的方法[J].信息工程大学学报,2004,5(4):83-88. 被引量：3
6范大昭,雷蓉.利用空间叠置分析探测地形图数据库的变化[J].海洋测绘,2005,25(2):44-47. 被引量：6
7翟辉琴,何乔,王素敏.基于数学形态学的遥感影像水域提取[J].海洋测绘,2005,25(2):52-54. 被引量：7
8黄建军,唐亮,谢维信.航空影像中立交桥的自动检测[J].中国体视学与图像分析,2004,9(1):41-46. 被引量：2
9黄建军,唐亮,谢维信,谢兴灿.基于对称边缘方向直方图自动提取主要道路[J].中国体视学与图像分析,2005,10(2):112-115. 被引量：2
10贾玲,赵云升,张建辉,董贵华.基于Lansat7 ETM+影像的城市道路信息提取研究[J].遥感技术与应用,2005,20(5):478-482. 被引量：5

同被引文献27

1周黎鸣,陈璐,刘金明,左宪禹,葛强,陈小潘.基于迁移学习的高分辨率遥感影像场景分类研究[J].河南大学学报（自然科学版）,2020(4):443-450. 被引量：4
2刘庆利,吴国平,胡剑策,侯卫国.铸体薄片图像分析法求取储层孔隙度[J].测绘科学技术学报,2009,26(1):69-71. 被引量：11
3史文中,朱长青,王昱.从遥感影像提取道路特征的方法综述与展望[J].测绘学报,2001,30(3):257-262. 被引量：157
4王健.Android平台下农村居民体育运动智能信息系统的设计与实现[J].自动化与仪器仪表,2019(1):113-116. 被引量：2
5高璐佼,牛婷立,芦煜,马良宵,陈畑宇,武燕静,张毓晋,程祯祯,孟繁超,杨洋,杨学智,牛欣.四诊合参辅助诊疗仪辨识量化人体气血状态应用研究[J].中华中医药杂志,2019,34(4):1419-1422. 被引量：10
6王琪,陈元凤.测谎技术的应用与实践[J].实验技术与管理,2018,35(4):79-81. 被引量：1
7吴磊,胡维平.基于LoRa的心率血氧实时监测系统的研究[J].电子设计工程,2019,27(14):97-101. 被引量：20
8贺玉珍,汤敏芳,何征岭,赵荣建,陈贤祥,赵湛,方震.手持式微型多生理参数监测设备的研制[J].中国医疗设备,2019,34(10):41-44. 被引量：4
9李明超,刘承照,张野,朱月琴.耦合颜色和纹理特征的矿物图像数据深度学习模型与智能识别方法[J].大地构造与成矿学,2020,44(2):203-211. 被引量：24
10王李管,陈斯佳,贾明滔,涂思羽.基于深度学习的黑钨矿图像识别选矿方法[J].中国有色金属学报,2020,30(5):1192-1201. 被引量：27

引证文献4

1张晨,赵轶飞.基于学习迁移的心理障碍特征多导测试仪设计[J].自动化与仪器仪表,2021(2):123-126.
2许苗,李元祥,钟娟娟,左宗成,熊伟.L-UNet:轻量化云遮挡道路提取网络[J].中国图象图形学报,2021,26(11):2670-2679. 被引量：4
3李天平,李功权.铸体薄片图像孔隙自动分割方法优选[J].电脑知识与技术,2023,19(5):19-22. 被引量：1
4李天平,李功权.融合多种深度学习模型的岩石铸体薄片孔隙自动分割方法[J].科学技术与工程,2023,23(21):9168-9175. 被引量：1

二级引证文献6

1任喜伟,韩欣,钟弋,何立风.基于改进U-Net网络的光伏板图像分割方法[J].陕西科技大学学报,2023,41(2):155-161. 被引量：3
2李海波.基于铸体薄片和CT扫描分析灰岩的微观结构[J].华北科技学院学报,2023,20(6):83-89.
3王磊.轻型卷积网络遥感影像多地物快速检测方法研究[J].测绘与空间地理信息,2023,46(12):120-123.
4贺航,许连杰,李高源,吕容飞,王喜良.基于改进蜉蝣算法优化多阈值图像分割[J].科学技术与工程,2024,24(12):5059-5068.
5汪方斌,李文豪.基于改进轻量化U-Net模型的光伏电池EL图像缺陷检测[J].电子测量技术,2024,47(5):102-111.
6孟月波,王宙,韩九强,刘光辉,徐胜军.虚实坐标智能匹配与自适应三维轨迹规划[J].控制工程,2024,31(8):1468-1477.

1课题组,石柱,张帆,赵国林,石君,杨雁.新常态下货币政策传导机制有效性探析--基于利率传导机制的实证分析[J].华北金融,2017,0(8):19-25. 被引量：1
2甄爱军.政策定向微调或为自住需求[J].理财周刊,2019,0(41):64-65.
3李盼盼.监管定向微调城投债迎来及时雨[J].中国战略新兴产业,2018,0(16):54-56.
4李绍友.数控车床加工精度提升方法探索[J].湖北农机化,2020(9):155-156. 被引量：1
5黄松林.依托项目化学习,培育学生自主探究能力[J].湖南教育（D版）,2020(5):38-39.
6《湖北工业职业技术学院学报》编辑部.《湖北工业职业技术学院学报》征稿启事[J].湖北工业职业技术学院学报,2020,33(2).
7唐柳,王晓东,陈哲彬,文含,姚宇.基于Octave卷积的超声心动图左心室分割方法[J].计算机应用,2020,40(S01):215-219. 被引量：4
8王娟,谢跃辉,江丽芬.基于端到端全卷积神经网络的道路提取[J].北京测绘,2020,34(7):901-904. 被引量：6
9天津滨海新区:2020年国企混改计划出清出让140家企业[J].国企,2020(16):14-14.
10李永盛,何佳洲,赵国清,刘义海.关于迁移学习中的负迁移方向研究[J].指挥控制与仿真,2020,42(4):28-33. 被引量：6

中国图象图形学报

2020年第7期

浏览历史

内容加载中请稍等...

迁移学习下高分快视数据道路快速提取被引量：4

参考文献3

二级参考文献17

共引文献157

同被引文献27

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

迁移学习下高分快视数据道路快速提取 被引量：4

参考文献3

二级参考文献17

共引文献157

同被引文献27

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

迁移学习下高分快视数据道路快速提取被引量：4