摘要
在智能驾驶领域,准确识别交通标志对行车安全具有重要意义,交通标志训练集往往服从长尾分布,这为交通标志识别带来极大难度。针对于长尾分布数据集训练出的模型在尾类上表现差的现象,提出一种基于YOLOX-Tiny的长尾分布交通标志识别模型。在TT100K_2021(tsinghua-tencent 100K 2021)数据集基础上制作交通标志长尾数据集;从制作数据集图片数量、样本分布以及模型大小出发,选择YOLOX-Tiny作为基础模型;采用EQL v2(equalization loss v2)和FL(focal loss)作为分类损失和目标置信度损失,平衡分类器头尾差距,增强模型对目标置信度的预测;在颈部双向金字塔中引入上采样算子CARAFE、坐标注意力机制(coordinate attention,CA)和CAR-ASFF模块(CARAFE+adaptively spatial feature fusion),解决传统特征金字塔上不同层级特征图的反向传播冲突问题,提升特征重组效果,突出目标特征。研究结果表明:改进的YOLOX-Tiny模型在制作的长尾交通标志数据集上m AP_(50)和m AP_(50:95)分别达到了43.67%和29.98%,改进模型相比较其他几种目标检测模型具有更高的检测精度。
Accurate recognition of traffic signs plays an important role in the field of intelligent driving.Traffic sign training datasets with long-tail distribution increase the difficulty of traffic sign recognition.A traffic sign recognition model with long-tail distribution based on YOLOX-Tiny was proposed to improve the poor performance of the model trained on long-tail distribution datasets.A long-tail traffic sign dataset was created based on the TT100K_2021(tsinghua-tencent 100K 2021)dataset.YOLOX-Tiny was chosen as the underlying model by considering picture numbers in datasets,sample distribution,and model size.Equalization loss v2(EQL v2)was used as classification loss to balance the head and tail of the classifier,and focal loss(FL)was used as target confidence loss to enhance the model's prediction of target confidence.In order to solve the backpropagation conflicts of feature graphs at different levels on the traditional feature pyramid,enhance the feature reorganization effect,and highlight target feature,up-sampling operator CARAFE,coordinate attention(CA),and CARAFE+adaptively spatial feature fusion modules(CAR-ASFF)were introduced to the neck bidirectional pyramid.The research results show that the improved YOLOX-Tiny model achieves 43.67%and 29.98%respectively in the long-tail traffic sign datasets,namely mA_^(50)and mAP_(50:95).The improved model has higher detection accuracy than other target detection models.
作者
伍云鹏
付应雄
沈丽君
崔峰
Wu Yunpeng;Fu Yingxiong;Shen Lijun;Cui Feng(Hubei University,Wuhan 430000,China;Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China;Beijing Smarter Eye Technology Company,Beijing 100190,China)
出处
《系统仿真学报》
CAS
CSCD
北大核心
2024年第11期2503-2516,共14页
Journal of System Simulation
基金
国家重点研发计划(2018AAA0103103)
国家自然科学基金(32171461)。
关键词
长尾分布
YOLOX
交通标志识别
注意力机制
特征重组
多尺度特征融合
long-tail distribution
YOLOX
traffic sign recognition
attention mechanism
feature reorganization
multiscale feature fusion