The Fusion of Temporal Sequence with Scene Priori Information in Deep Learning Object Recognition

The Fusion of Temporal Sequence with Scene Priori Information in Deep Learning Object Recognition

下载PDF

导出

摘要 For some important object recognition applications such as intelligent robots and unmanned driving, images are collected on a consecutive basis and associated among themselves, besides, the scenes have steady prior features. Yet existing technologies do not take full advantage of this information. In order to take object recognition further than existing algorithms in the above application, an object recognition method that fuses temporal sequence with scene priori information is proposed. This method first employs YOLOv3 as the basic algorithm to recognize objects in single-frame images, then the DeepSort algorithm to establish association among potential objects recognized in images of different moments, and finally the confidence fusion method and temporal boundary processing method designed herein to fuse, at the decision level, temporal sequence information with scene priori information. Experiments using public datasets and self-built industrial scene datasets show that due to the expansion of information sources, the quality of single-frame images has less impact on the recognition results, whereby the object recognition is greatly improved. It is presented herein as a widely applicable framework for the fusion of information under multiple classes. All the object recognition algorithms that output object class, location information and recognition confidence at the same time can be integrated into this information fusion framework to improve performance. For some important object recognition applications such as intelligent robots and unmanned driving, images are collected on a consecutive basis and associated among themselves, besides, the scenes have steady prior features. Yet existing technologies do not take full advantage of this information. In order to take object recognition further than existing algorithms in the above application, an object recognition method that fuses temporal sequence with scene priori information is proposed. This method first employs YOLOv3 as the basic algorithm to recognize objects in single-frame images, then the DeepSort algorithm to establish association among potential objects recognized in images of different moments, and finally the confidence fusion method and temporal boundary processing method designed herein to fuse, at the decision level, temporal sequence information with scene priori information. Experiments using public datasets and self-built industrial scene datasets show that due to the expansion of information sources, the quality of single-frame images has less impact on the recognition results, whereby the object recognition is greatly improved. It is presented herein as a widely applicable framework for the fusion of information under multiple classes. All the object recognition algorithms that output object class, location information and recognition confidence at the same time can be integrated into this information fusion framework to improve performance.

作者 Yongkang Cao Fengjun Liu Xian Wang Wenyun Wang Zhaoxin Peng Yongkang Cao;Fengjun Liu;Xian Wang;Wenyun Wang;Zhaoxin Peng(School of Mechanical Engineering, Hunan University of Science and Technology, Xiangtan, China)

机构地区 School of Mechanical Engineering

出处《Open Journal of Applied Sciences》 2024年第9期2610-2627,共18页 应用科学（英文）

关键词 Computer Vison Object Recognition Deep Learning Consecutive Scene Information Fusion Computer Vison Object Recognition Deep Learning Consecutive Scene Information Fusion

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Dongjian Song,Bing Zhu,Jian Zhao,Jiayi Han.Human-Machine Shared Lateral Control Strategy for Intelligent Vehicles Based on Human Driver Risk Perception Reliability[J].Automotive Innovation,2024,7(1):102-120.
2Anzhan Liu,Yilu Ding,Xiangyang Lu.Fusion of Convolutional Self-Attention and Cross-Dimensional Feature Transformationfor Human Posture Estimation[J].Journal of Beijing Institute of Technology,2024,33(4):346-360.
3Pei Liu.Analysis of the Application of Artificial Intelligence in Transportation[J].Journal of World Architecture,2024,8(3):78-83.
4Ziwang FU,Feng LIU,Qing XU,Xiangling FU,Jiayin QI.LMR-CBT: learning modality-fused representations with CB-Transformer for multimodal emotion recognition from unaligned multimodal sequences[J].Frontiers of Computer Science,2024,18(4):39-47.
5Feng Liu,Ziwang Fu,Yunlong Wang,Qijian Zheng.TACFN:Transformer-Based Adaptive Cross-Modal Fusion Network for Multimodal Emotion Recognition[J].CAAI Artificial Intelligence Research,2023,2(1):75-82.
6Jianye Li,Hao Wang,Yibing Luo,Zijing Zhou,He Zhang,Huizhi Chen,Kai Tao,Chuan Liu,Lingxing Zeng,Fengwei Huo,Jin Wu.Design of AI-Enhanced and Hardware-Supported Multimodal E-Skin for Environmental Object Recognition and Wireless Toxic Gas Alarm[J].Nano-Micro Letters,2024,16(12):1-22.
7Abou Abdoulaye Sow,Papa Yona Boubacar Mane,Mamma Sawaneh,Halidou Kafando.Adoption of Solar Pumping Systems by Vegetable Farmers in Niayes Agro-Ecological Zone of Senegal: Adoption as a Sequential Process[J].Journal of Power and Energy Engineering,2024,12(9):19-36.
8Xiaoxiong Zuo,Yihan Tao,Yuan Liu,Yunfei Xu,Wenda Zhang,Haiwu Pan,Hui Sun,Zhen Zhang,Chenzhou Cui,Weimin Yuan.X-Ray Source Classification Using Machine Learning:A Study with EP-WXT Pathfinder LEIA[J].Research in Astronomy and Astrophysics,2024,24(8):175-195.
9Zhixun Zhang,Leizheng Shu,Keke Zhang,Zhencai Zhu,Meijiang Zhou,Xinwei Wang,Weidong Yin.Orbit Determination and Thrust Estimation for Noncooperative Target Using Angle-Only Measurement[J].Space(Science & Technology),2023,3(1):496-513.
10Jiacheng CHEN,Jie CHEN,Xunchang John ZHANG,Peiyi PENG.Stable hydrogen isoscape in precipitation generated using data fusion for East China[J].Science China Earth Sciences,2024,67(9):2972-2988.

Open Journal of Applied Sciences

2024年第9期

浏览历史

内容加载中请稍等...

The Fusion of Temporal Sequence with Scene Priori Information in Deep Learning Object Recognition

相关作者

相关机构

相关主题

浏览历史