摘要
近年来,基于卷积神经网络图像拼接篡改检测算法取得了相当的进展.然而,由于篡改对象的大小和类型不同,现有的大多数模型仍然不能取得令人满意的效果.针对这些问题,提出一种集成多尺度注意力的网络进行图像篡改定位算法.首先在编码器中添加多尺度的双注意力模块——位置注意力和通道注意力,其中,位置注意力模块通过捕捉任意2个特征图的位置关系获取特征图在空间维度上的语义信息依赖关系,使每个像素点均能感知其余位置像素点的信息;通道注意力模块采用与位置注意力相似的自注意力操作捕捉任意2个通道映射之间的关系,使像素点感知到其余通道像素点的信息.考虑到篡改目标大小不同,多尺度注意力模块将特征图划分为多个子区域,在捕获长程语义信息依赖关系的同时也能适应各种形状大小的篡改区域,可以更好地处理不同尺度的拼接篡改图,降低高分辨率特征图的计算开销.在公开数据集CASIA上进行实验的结果表明,所提算法得到的F1和IoU值分别达到62.3%和61.2%,比其他现有算法有明显提升.
Recently,image splicing forgery detection methods based on convolutional neural networks(CNNs)have been widely studied with continuous advancements.However,the performance of most exist-ing models may not be satisfied caused by objects with various types and sizes.In this paper,we propose a new integrated multi-scale attention network to accommodate these problems.Specifically,we append two types of self-attention modules,namely,position attention model and channel attention model,between two convolution layers in feature extraction procedure.For position attention model,we emphasize the semantic interdependencies in spatial dimension by capturing the relationships between any two feature positions so that each pixel can perceive the information of the rest of the pixels.For channel attention model,we apply similar self-attention operations to capture the relationships between any two-channel maps in order that each pixel can perceive the information of other channel pixels.Meanwhile,by dividing the feature maps into multiple subregions,our attention modules can better preserve and highlight the details while capturing long-range semantic information dependencies,which not only concern the spliced forgeries of various sizes but also reduce the computational cost for feature maps with high resolutions.Experimental results show that the F1 and IoU of the integrated multi-scale attention network algorithm on the CASIA test set are 62.3%and 61.2%,respectively,which are significantly improved compared to other existing algorithms.
作者
魏华建
严彩萍
李红
Wei Huajian;Yan Caiping;Li Hong(School of Information Science and Technology,Hangzhou Normal University,Hangzhou 311121;Hangzhou Insvision Technology Co.,Ltd.,Hangzhou 311121)
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2024年第8期1237-1245,共9页
Journal of Computer-Aided Design & Computer Graphics
基金
国家自然科学基金(61902102)
浙江省自然科学基金(LQ19F020004).
关键词
图像拼接定位
多尺度
空间通道关系
自注意力
image splicing localization
multi-scale
spatial-channel relationships
self-attention