基于特征聚合与边缘检测的伪装目标检测

Camouflage Object Detection Based on Feature Fusion and Edge Detection

下载PDF

导出

摘要针对伪装目标边缘模糊、相关检测模型上下文特征利用率低、边缘特征融合繁琐的问题,提出一种基于特征融合与边缘检测的伪装目标检测模型F2-EDNet。首先构造特征增强模块,细化主干网络的多尺度上下文特征,有效增强伪装目标特征信息;同时,引入跨层特征引导的边缘预测支路以集成来自主干网络底层和顶层的跨层特征,在辅助检测伪装目标边缘的同时,提取边缘特征;最后,提出多尺度特征聚合模块,通过结合注意力机制,充分融合边缘特征与上下文特征,有效提高预测精度。实验结果表明,F2-EDNet在公开数据集CAMO、COD10K和NC4K上的结构相似性、平均精度与召回率、相关性、平均误差指标均值分别提高了1.41%、1.74%、0.14%、0.77%;和同类模型相比,该模型具有更丰富的边缘,定位伪装区域更准确;在实际应用中,模型检测速率可达46帧/s,证明模型具有较好的实时检测能力。 Camouflaged Object Detection(COD)holds significant research and application value in various fields.The ability of deep learning is pushing the performance of target detection algorithms to new heights.Designing a network that effectively integrates features of different layer sizes and eliminates background noise while preserving detailed information presents the main challenges in this field.We propose Feature Fusion and Edge Detection Net(F2-EDNet),a camouflaged object segmentation model based on feature fusion and edge detection.ConvNeXt is used as the backbone to extract multi-scale contextual features.The extensiveness and diversity of features are then enhanced through two approaches.The first approach involves using the Feature Enhancement Module(FEM)to refine and downsize the multi-scale contextual features.The second approach introduces an auxiliary task to fuse cross-layer features through the Cross-layer Guided Edge prediction Branch(CGEB).The process extracts edge features and predicts edge information to increase feature diversity.Additionally,the Multiscale Feature Aggregation Module(MFAM)improves feature fusion by capturing and fusing information about interlayer differences between edge features and contextual features through multiscale attention and feature cascading.The model's prediction results are subjected to deep supervision to obtain the final target detection results.To validate the performance of the proposed model,it is compared qualitatively and quantitatively with eight camouflage object models from the past three years on three publicly available datasets.This comparison aims to observe its detection accuracy.Additionally,a model efficiency analysis is conducted by comparing it with five open-source models.Finally,the module's effectiveness is verified through ablation experiments to determine the optimal structure.The results of a quantitative experiment indicate that on the CAMO dataset,the S-measure,Fmeasure,E-measure correlation and mean absolute error metrics for F2-EDNet are optimal.On the COD10K dataset,the structural similarity metric indicates that the proposed algorithm is optimal,while the mean precision and recall,E-measure and MAEmetrics reach sub-optimal levels.O n N C4K,all four metrics for the proposed algorithm reach optimization.From the visualized detection results,i t c an b e observed that in the camouflage object detection task,the prediction results of the proposed model are more accurate and refined than those of other methods.Compared with other models,although the number of parameters in the proposed model is higher,the simple structure of the model framework enables it to outperform models specifically designed for lightweight purposes,faster than most other models.In comparison of the number of operations,the arithmetic complexity of the proposed model shows a significant decrease compared to a model that also utilizes multi-task learning.The model presented maintains high accuracy in target detection performance while ensuring a reasonable balance between computing speed and the number of operations.The results of ablation experiments demonstrate that each of the current modules plays the expected role,and the model's performance has been optimized.Experimental results show that the proposed algorithm achieves optimal detection accuracy.Compared to suboptimal models,our model demonstrates an average improvement of 1.41%,1.74%,0.14%,and 0.77%on the S-measure,F-measure,MAE,and E-measure indices across three datasets.Additionally,the model's design achieves a reasonable balance between operation volume and operation rate.During performance testing,the model's test speed was 46 fps,striking a balance between detection accuracy and execution efficiency,demonstrating practical application value.In future work,the algorithms will be lightened to further reduce the amount of computation to improve the speed of model inference;in applications,the model can be helpful in directions such as medical segmentation,defect detection with transparent object segmentation through migration learning.

作者丁铖白雪琼吕勇刘洋牛春晖刘鑫 DING Cheng;BAI Xueqiong;LV Yong;LIU Yang;NIU Chunhui;LIU Xin(School of Instrumentation Science and Opto-electronics Engineering,Beijing Information Science and Technology University,Beijing 100192,China)

机构地区北京信息科技大学仪器科学与光电工程学院

出处《光子学报》 EI CAS CSCD 北大核心 2024年第8期260-271,共12页 Acta Photonica Sinica

基金北京市自然科学基金(Nos.4244105,4224094)。

关键词伪装目标检测特征融合边缘检测伪装图像深度学习 Camouflaged object detection Feature fusion Edge detection Camouflaged image Deep learning

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1刘文犀,张家榜,李悦洲,赖宇,牛玉贞.基于边界特征融合和前景引导的伪装目标检测[J].电子学报,2024,52(7):2279-2290.
2苑紫烨,邱宝林,叶妤,温文媖,化定丽,张玉书.面向编码伪装的鲁棒无载体图像隐写方法[J].应用科学学报,2024,42(3):469-485.
3陈元妹,王凤随,王路遥.细化特征引导对抗性解纠缠学习的无监督行人重识别[J].电子测量与仪器学报,2024,38(5):130-138.
4刘良帅,赵建利,刘成龙,赵劭康,董娜.面向烟雾风险的轻量级二阶段检测器[J].制造业自动化,2024,46(8):129-135.
5徐万春,张焱,张景华,凌峰,李顺.基于梯度引导偏振度估算的图像去雾[J].电子学报,2024,52(6):2011-2024.

光子学报

2024年第8期

浏览历史

内容加载中请稍等...

基于特征聚合与边缘检测的伪装目标检测

相关作者

相关机构

相关主题

浏览历史