期刊文献+

并行注意力机制在图像语义分割中的应用 被引量:7

Application of Parallel Attention Mechanism in Image Semantic Segmentation
下载PDF
导出
摘要 在卷积神经网络中融入注意力机制越来越成为语义分割强化特征学习的重要方法。提出了一种融合了局部注意力和全局注意力的卷积神经网络。输入图像经主干网络的特征提取,并行输入给局部注意力和全局注意力模块。局部注意力模块以编码-解码结构实现多尺寸的局部特征融合,全局注意力模块根据每个像素与其所在特征图上所有像素的相关性捕获全局信息。融合两个注意力模块不仅减少了局部信息的丢失,而且捕获了具有长距离依赖的全局信息,有效提升了特征提取的能力。采用一种数据相关的上采样方法代替双线性插值法恢复特征图至输入尺寸,同时改善了分割效果。采用Dice Loss损失函数并针对样本不平衡问题在类别损失前加入权重系数进一步改善了分割效果。该方法在药丸污点数据集、药丸缺损数据集以及走廊数据集上分别得到了96.39%、93.44%、96.28%的平均交并比结果。 The integration of attention mechanism in convolutional neural networks has increasingly become an important method for semantic segmentation to strengthen feature learning.This paper proposes a convolutional neural network that combines local attention and global attention.The input image is extracted by the backbone network and input to the local attention and global attention modules in parallel.The local attention module uses an encoding-decoding structure to achieve multi-scale local feature fusion.The global attention module captured global information based on the correlation between each pixel and all pixels on the feature map.Fusion of two attention modules not only reduce the loss of local information but also capture global information with long distance dependencies.This paper uses a data-dependent upsampling method to replace the bilinear interpolation method to upsample the feature map to the input size and improves the segmentation results.This paper uses Dice Loss loss function and adds weight coefficients before the category loss for the imbalanced of sample to further improve the segmentation results.The method obtains Mean IoU scores of 96.39%,93.44%,96.28% on the pill contamination dataset,pill crack dataset,and corridor dataset,respectively.
作者 张汉 张德祥 陈鹏 章军 王兵 ZHANG Han;ZHANG Dexiang;CHEN Peng;ZHANG Jun;WANG Bing(School of Electrical Engineering and Automation,Anhui University,Hefei 230601,China;National Engineering Research Center for Agro-Ecological Big Data Analysis&Application,Internet Academy,Anhui University,Hefei 230601,China;School of Electrical and Information Engineering,Anhui University of Technology,Ma’anshan,Anhui 201804,China)
出处 《计算机工程与应用》 CSCD 北大核心 2022年第9期151-160,共10页 Computer Engineering and Applications
基金 国家自然科学基金(62072002,61672035)。
关键词 局部注意力 全局注意力 数据相关上采样 样本不平衡 local attention global attention data-dependent upsampling imbalanced of sample
  • 相关文献

参考文献8

二级参考文献51

  • 1孙晓鹏,李华.三维网格模型的分割及应用技术综述[J].计算机辅助设计与图形学学报,2005,17(8):1647-1655. 被引量:49
  • 2严国莉,黄山,王新增,凌彤辉.基于局部动态阈值的矾花图像分割[J].计算机应用与软件,2006,23(10):105-107. 被引量:6
  • 3Dizenzo S, CinqueL, LeviaidiS. Image Thresholding Using Fuzzy Entropies. IEEE Trans on System, Man, and Cybernetics, B. 1998, 28(1): 15-23.
  • 4章毓晋.图像分割[M].北京:科学出版社,2001.34.
  • 5章毓晋.图像分割[M].北京:科学出版社,2001..
  • 6Pratt W K. Digital Image Processing[M]. New York: Wiley,1991.
  • 7CastlemanKennethR.数字图像处理[M].北京:电子工业出版社,1998..
  • 8Mangan A,Whitaker R.Surface segmentation using morphological watersheds[C] //Proceedings of IEEE Visualization'98,Chapel Hill,North Carolina,1998:29-32
  • 9Mangan A,Whitaker R.Partitioning 3D surface meshes using watershed segmentation[J].IEEE Transactions on Visualization and Computer Graphics,1999,5(4):308-321
  • 10Meyer M,Desbrun M,Schroder P,et al.Discrete differentialgeometry operators for triangulated 2 manifolds[C]//Proceedings of Visualization and Mathematics,Hege C,Polthier Keds.2003:35-57

共引文献485

同被引文献52

引证文献7

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部