基于集成多尺度注意力的图像篡改定位

Image Tampering Localization Based on Integrated Multiscale Attention

下载PDF

导出

摘要近年来,基于卷积神经网络图像拼接篡改检测算法取得了相当的进展.然而,由于篡改对象的大小和类型不同,现有的大多数模型仍然不能取得令人满意的效果.针对这些问题,提出一种集成多尺度注意力的网络进行图像篡改定位算法.首先在编码器中添加多尺度的双注意力模块——位置注意力和通道注意力,其中,位置注意力模块通过捕捉任意2个特征图的位置关系获取特征图在空间维度上的语义信息依赖关系,使每个像素点均能感知其余位置像素点的信息;通道注意力模块采用与位置注意力相似的自注意力操作捕捉任意2个通道映射之间的关系,使像素点感知到其余通道像素点的信息.考虑到篡改目标大小不同,多尺度注意力模块将特征图划分为多个子区域,在捕获长程语义信息依赖关系的同时也能适应各种形状大小的篡改区域,可以更好地处理不同尺度的拼接篡改图,降低高分辨率特征图的计算开销.在公开数据集CASIA上进行实验的结果表明,所提算法得到的F1和IoU值分别达到62.3%和61.2%,比其他现有算法有明显提升. Recently,image splicing forgery detection methods based on convolutional neural networks(CNNs)have been widely studied with continuous advancements.However,the performance of most exist-ing models may not be satisfied caused by objects with various types and sizes.In this paper,we propose a new integrated multi-scale attention network to accommodate these problems.Specifically,we append two types of self-attention modules,namely,position attention model and channel attention model,between two convolution layers in feature extraction procedure.For position attention model,we emphasize the semantic interdependencies in spatial dimension by capturing the relationships between any two feature positions so that each pixel can perceive the information of the rest of the pixels.For channel attention model,we apply similar self-attention operations to capture the relationships between any two-channel maps in order that each pixel can perceive the information of other channel pixels.Meanwhile,by dividing the feature maps into multiple subregions,our attention modules can better preserve and highlight the details while capturing long-range semantic information dependencies,which not only concern the spliced forgeries of various sizes but also reduce the computational cost for feature maps with high resolutions.Experimental results show that the F1 and IoU of the integrated multi-scale attention network algorithm on the CASIA test set are 62.3%and 61.2%,respectively,which are significantly improved compared to other existing algorithms.

作者魏华建严彩萍李红 Wei Huajian;Yan Caiping;Li Hong(School of Information Science and Technology,Hangzhou Normal University,Hangzhou 311121;Hangzhou Insvision Technology Co.,Ltd.,Hangzhou 311121)

机构地区杭州师范大学信息科学与技术学院杭州启源视觉科技有限公司

出处《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2024年第8期1237-1245,共9页 Journal of Computer-Aided Design & Computer Graphics

基金国家自然科学基金(61902102) 浙江省自然科学基金(LQ19F020004).

关键词图像拼接定位多尺度空间通道关系自注意力 image splicing localization multi-scale spatial-channel relationships self-attention

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1黄镇文,吴少杰.卷积神经网络在新能源光伏电站发电功率预测中的应用[J].今日制造与升级,2023(9):55-57. 被引量：2
2李莉,徐张昕子,王存睿,战国栋.一种汉字字体家族生成算法[J].大连民族大学学报,2024,26(5):439-443.
3赵洁,常皓婵,武斌.融入混合注意力的低缩放因子SeamCarving篡改检测算法[J].智能科学与技术学报,2024,6(2):244-252.
4高翔,白静,薛珮芸,董浙南,强彦.解耦知识蒸馏优化的域自适应跨库情感识别[J].现代电子技术,2024,47(17):173-180.
5李岩超,史卫亚,冯灿.面向无人机航拍小目标检测的轻量级YOLOv8检测算法[J].计算机工程与应用,2024,60(17):167-178.
6杨锦光,熊菲,顾峻瑜,席炜亭.基于图划分的分布式推荐系统[J].数据与计算发展前沿（中英文）,2024,6(5):102-110.
7赵继发,王呈,荣英佼.融合双目信息的队列姿态检测[J].计算机应用研究,2024,41(9):2860-2866.
8周玉国,张金超,孙伊萍,于春风,周立俭.基于TET与DSRNet-AttBiLSTM的滚动轴承剩余使用寿命预测[J].振动与冲击,2024,43(19):163-173.
9王姣,吴萌,相建凯.空洞卷积优化U^(2)-Net的X光快速分散检测模型[J].激光与光电子学进展,2024,61(15):180-189.
10谷凤伟,陆军,刘子玄,蔡成涛.基于深度卷积判别网络的人脸比对方法[J].哈尔滨工程大学学报,2024,45(9):1770-1782.

计算机辅助设计与图形学学报

2024年第8期

浏览历史

内容加载中请稍等...

基于集成多尺度注意力的图像篡改定位

相关作者

相关机构

相关主题

浏览历史