摘要
Floating wastes in rivers have specific characteristics such as small scale,low pixel density and complex backgrounds.These characteristics make it prone to false and missed detection during image analysis,thus resulting in a degradation of detection performance.In order to tackle these challenges,a floating waste detection algorithm based on YOLOv7 is proposed,which combines the improved GFPN(Generalized Feature Pyramid Network)and a long-range attention mechanism.Firstly,we import the improved GFPN to replace the Neck of YOLOv7,thus providing more effective information transmission that can scale into deeper networks.Secondly,the convolution-based and hardware-friendly long-range attention mechanism is introduced,allowing the algorithm to rapidly generate an attention map with a global receptive field.Finally,the algorithm adopts the WiseIoU optimization loss function to achieve adaptive gradient gain allocation and alleviate the negative impact of low-quality samples on the gradient.The simulation results reveal that the proposed algorithm has achieved a favorable average accuracy of 86.3%in real-time scene detection tasks.This marks a significant enhancement of approximately 6.3%compared with the baseline,indicating the algorithm's good performance in floating waste detection.
河面漂浮垃圾具有尺度小、像素少、信息量低和背景复杂的特点,容易产生误检、漏检的问题,从而导致检测效果不佳。针对这些问题,本文提出了一种基于YOLOv7的河面漂浮垃圾检测算法,该算法融合了改进的广义特征金字塔网络(GFPN)和长程注意力机制。首先,将YOLOv7中的Neck替换为改进的GFPN网络,从而提供更有效的信息传输,以方便扩展到更深的网络。其次,引入了基于卷积且硬件友好的长程注意力机制,使算法能够快速生成具有全局感受野的注意力图。最后,算法采用WiseIoU优化损失函数,实现自适应梯度增益分配,缓解低质量样本对梯度的负面影响。仿真结果表明,所提出的算法在实时场景检测任务中取得了86.3%的平均准确率,这比基准提高了6.3%,表明该算法在漂浮垃圾检测方面表现优异。
基金
Supported by the Science Foundation of the Shaanxi Provincial Department of Science and Technology,General Program-Youth Program(2022JQ-695)
the Scientific Research Program Funded by Education Department of Shaanxi Provincial Government(22JK0378)
the Talent Program of Weinan Normal University(2021RC20)
the Educational Reform Research Project(JG202342)。