摘要
针对烟雾形状不规则、呈半透明状且边界模糊导致烟雾分割困难的问题,提出一种融合全局和局部信息的双分支实时烟雾分割算法。该算法设计了轻量级的Transformer分支和卷积神经网络分支分别提取烟雾的全局特征和局部特征,Transformer分支和卷积神经网络分支共同作用,可以在充分学习烟雾的长距离像素依赖关系的同时保留烟雾细节信息,从而准确区分烟雾和背景像素,改善烟雾分割效果。同时该结构可以满足实际烟雾检测任务的实时性要求;基于多层感知机的解码器充分利用不同尺度的烟雾特征图,并进一步建模烟雾全局上下文信息,增强模型对多尺度烟雾的感知能力,从而提升烟雾分割精度;而且解码器结构简单,可以降低解码器部分的计算量。该算法在自建烟雾分割数据集上的平均交并比为92.88%,模型参数量为2.96 M,推理速度为56.94帧/s。该算法在公开数据集上的综合性能优于其他烟雾检测算法。实验结果表明,该算法分割烟雾的准确率高,推理速度快,可以满足实际烟雾检测任务的准确性和实时性需求。
The smoke segmentation is challenging because the smoke is irregular and translucent and the boundary is fuzzy.A dual-branch real-time smoke segmentation algorithm based on global and local information is proposed to solve this problem.In this algorithm,a lightweight Transformer branch and a convolutional neural networks branch are designed to extract the global and local features of smoke respectively,which can fully learn the long-distance pixel dependence of smoke and retain the details of smoke.It can distinguish smoke and background accurately and improve the accuracy of smoke segmentation.It can satisfy the real-time requirement of the actual smoke detection tasks.The multilayer perceptron decoder makes full use of multi-scale smoke features and further models the global context information of smoke.It can enhance the perception of multi-scale smoke,and thus improve the accuracy of smoke segmentation.The simple structure can reduce the computation of the decoder.The algorithm reaches 92.88%mean intersection over union on the self-built smoke segmentation dataset with 2.96M parameters and a speed of 56.94 frames per second.The comprehensive performance of the proposed algorithm is better than that of other smoke detection algorithms on public dataset.Experimental results show that the algorithm has a high accuracy and fast inference speed.The algorithm can meet the accuracy and real-time requirements of actual smoke detection tasks.
作者
张欣雨
梁煜
张为
ZHANG Xinyu;LIANG Yu;ZHANG Wei(School of Microelectronics,Tianjin University,Tianjin 300072,China)
出处
《西安电子科技大学学报》
EI
CAS
CSCD
北大核心
2024年第1期147-156,共10页
Journal of Xidian University
基金
天津市新一代人工智能科技重大专项(19ZXZNGX00030)。