Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge ...Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge in real-time road target detection is the ability to properly pinpoint relatively small vehicles in complicated environments.However,because road targets are prone to complicated backgrounds and sparse features,it is challenging to detect and identify vehicle kinds fast and reliably.We suggest a new vehicle detection model called MEB-YOLO,which combines Mosaic and MixUp data augmentation,Efficient Channel Attention(ECA)attention mechanism,Bidirectional Feature Pyramid Network(BiFPN)with You Only Look Once(YOLO)model,to overcome this problem.Four sections make up this model:Input,Backbone,Neck,and Prediction.First,to improve the detection dataset and strengthen the network,MixUp and Mosaic data improvement are used during the picture processing step.Second,an attention mechanism is introduced to the backbone network,which is Cross Stage Par-tial Darknet(CSPDarknet),to reduce the influence of irrelevant features in images.Third,to achieve more sophisticated feature fusion without increasing computing cost,the BiFPN structure is utilized to build the Neck network of the model.The final prediction results are then obtained using Decoupled Head.Experiments demonstrate that the proposed model outperforms several already available detection methods and delivers good detection results on the University at Albany DEtection and TRACking(UA-DETRAC)public dataset.It also enables effective vehicle detection on real traffic monitoring data.As a result,this technique is efficient for detecting road targets.展开更多
基金funded by the National Natural Science Foundation of China(NSFC)(No.61170110)Zhejiang Provincial Natural Science Foundation of China(LY13F020043).
文摘Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge in real-time road target detection is the ability to properly pinpoint relatively small vehicles in complicated environments.However,because road targets are prone to complicated backgrounds and sparse features,it is challenging to detect and identify vehicle kinds fast and reliably.We suggest a new vehicle detection model called MEB-YOLO,which combines Mosaic and MixUp data augmentation,Efficient Channel Attention(ECA)attention mechanism,Bidirectional Feature Pyramid Network(BiFPN)with You Only Look Once(YOLO)model,to overcome this problem.Four sections make up this model:Input,Backbone,Neck,and Prediction.First,to improve the detection dataset and strengthen the network,MixUp and Mosaic data improvement are used during the picture processing step.Second,an attention mechanism is introduced to the backbone network,which is Cross Stage Par-tial Darknet(CSPDarknet),to reduce the influence of irrelevant features in images.Third,to achieve more sophisticated feature fusion without increasing computing cost,the BiFPN structure is utilized to build the Neck network of the model.The final prediction results are then obtained using Decoupled Head.Experiments demonstrate that the proposed model outperforms several already available detection methods and delivers good detection results on the University at Albany DEtection and TRACking(UA-DETRAC)public dataset.It also enables effective vehicle detection on real traffic monitoring data.As a result,this technique is efficient for detecting road targets.