跨模态噪声过滤的事件相机目标检测算法

Event-based Camera Object Detection Algorithm for Cross-modal Noisy Annotations Filtering

下载PDF

导出

摘要事件相机具有高时间分辨率、高动态范围和低功耗等特性,通常被用于传统相机应用受限场景(高速度、强光、弱光等)下的目标检测任务中。然而由于事件相机的像素异步性,其输出的事件序列难以进行人工标注,为此现有方法通过RGB图像标记迁移得到事件序列标记。然而,迁移标记中存在大量噪声标记和事件序列中部分目标纹理模糊,导致难以取得理想的模型性能。为了解决此问题,提出了一种跨模态噪声过滤的事件相机目标检测算法。算法利用预训练后的事件相机检测器对开源RGB目标检测数据集进行筛选,得到对训练事件相机检测器最具价值的RGB图像和事件图像一起构成跨模态混合图像,帮助检测器更准确地识别、定位事件图像目标;为了缓解噪声标记对检测器性能的影响,设计了一种多阶段目标检测联合优化策略,单个阶段训练完成时,在全局标记中识别噪声标记,并对噪声标记进行修正后在下一阶段使用。实验结果表明,在1Mpx Detection Dataset上,与基准模型相比,跨模态噪声过滤的事件相机目标检测算法提供了8.35%的模型增益,远优于Co-teaching,O2U-net等噪声标签学习方法,具体地,跨模态混合图像训练、联合优化框架分别提供了6.44%,4.77%的模型增益。 Event-based camera is commonly seen in object detection in limited scenarios for traditional camera applications(high speed,strong light,low light,etc.)due to their high time resolution,high dynamic range and low power consumption.However,the event sequence output of event camera is difficult to be manually labeled due to its pixel asynchronism,so the existing me-thods obtain event sequence annotations through the migration of RGB image annotations.However,since the migrated annotations have numerous inaccurate bounding boxes and some object textures in event sequence are fuzzy,leading to poor model performance.To address this problem,event-based camera object detection algorithm for cross-modal noisy annotations filtering is proposed.The method uses a pre-trained event-based camera detector to filter open-source RGB object detection datasets and selects RGB images that are most valuable for training the event-based camera detector.These selected RGB images are combined with event images to construct cross-domain mixed images,helping the detector to identify and locate the event image object more accurately.To mitigate the impact of noisy annotations on detector performance,a multi-stage object detection joint optimization strategy is designed.After each stage of training is completed,noisy annotations are identified in the global annotations and are corrected use in the next stage.Experimental results show that,on the 1Mpx Detection Dataset,the robust event-based camera cross-modal object detection method based on noisy annotations provides 8.35% model gain compared to the baseline model,significantly outperforming noise-label learning methods such as Co-teaching and O2U-net.Specifically,cross-modal hybrid images training and joint optimization frameworks offer model gains of 6.44% and 4.77%,respectively.

作者胡刚梁栋黄圣君 HU Gang;LIANG Dong;HUANG Shengjun(School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China)

机构地区南京航空航天大学计算机科学与技术学院

出处《计算机科学》 CSCD 北大核心 2024年第S02期242-247,共6页 Computer Science

关键词事件相机目标检测噪声标记跨模态联合优化 Event-based camera Object detection Noisy annotations Cross-modal Joint optimization

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1王琳,刘哲,史殿习,周晨磊,杨绍武,张拥军.融合跟踪器:融合图像特征和事件特征的单目标跟踪框架[J].计算机科学,2023,50(10):96-103. 被引量：1
2徐齐,邓洁,申江荣,唐华锦,潘纲.基于事件相机的图像重构综述[J].电子与信息学报,2023,45(8):2699-2709. 被引量：2
3刘康,钱旭,王自强.主动学习算法综述[J].计算机工程与应用,2012,48(34):1-4. 被引量：26

二级参考文献24

1Hastie T,Tibshirani R, Friedman J.The elements of sta- tistical learning: data mining, inference, and prediction, ser.statistics[M].2nd ed.New York:Springer,2009.
2Boser B E, Guyon I M, Vapnik V N.A training algo- rithm for optimal margin classifiers[C]//Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, 1992 : 144-152.
3Haykin S.Neural networks and learning machines[M]. 3rd ed.Cambridge, MA: Prentice-Hall, 2008.
4Settles B.Active learning literature survey[R].Univ of Wisconsin-Madison, 2011.
5Tuia D, Ratle F, Pacifici F, et al.Active learning meth- ods for remote sensing image classification[J].IEEE Trans on Geosci Remote Sens,2009,47(7):2218-2232.
6Copa L,'Tuia D, Volpi M, et al.Unbiased query-by-bagging active learning for VHR image classification[C]//Proc SPIE Remote Sens Conf,2010.
7Di W, Crawford M.Multi-view adaptive disagreement based active learning for hyperspectral image classification[C]// IEEE International Geoscience and Remote Sensing Symposium, 2010 : 1374-1377.
8Muslea I.Active learning with multiple views[J].Journal of Artificial Intelligence Research,2006,27:203-233.
9Campbell C, Cristianini N, Smola A J.Query leaming with large margin classifiers[C]//Proc Int Conf Mach Leam(ICML), 2000: 111-118.
10Schohn G, Cohn D.Less is more: Active learning with support vector machines[C]//Proc 17th ICML, 2000: 839-846.

共引文献26

1刘振宇,李钦富,杨硕,邓应强,刘芬,赖新明,白雪珂.一种基于主动学习和多种监督学习的情感分析模型[J].中国电子科学研究院学报,2020,15(2):171-176. 被引量：2
2邵忻.基于跨领域主动学习的图像分类方法[J].计算机应用,2014,34(4):1169-1171. 被引量：6
3张静,聂章龙.基于主动学习的动态模糊聚类算法[J].计算机与现代化,2014(5):24-27.
4张雁,吕丹桔,王红崧.基于主动学习的环境音分类研究[J].计算机技术与发展,2014,24(6):110-113.
5梁喜涛,顾磊.基于分层选择策略的主动学习分词方法[J].计算机应用研究,2015,32(5):1353-1356.
6李艳玲,颜永红.中文口语理解弱监督训练方法[J].计算机应用,2015,35(7):1965-1968. 被引量：2
7梁喜涛,顾磊.基于最近邻的主动学习分词方法[J].计算机科学,2015,42(6):228-232. 被引量：1
8高学伟,郑世珏,高丽,李松丽.基于SVM主动学习的微信监测研究[J].计算机与数字工程,2016,44(4):715-719.
9朱丽,陆建峰.基于主动学习的微博聚类分析[J].数据采集与处理,2016,31(3):599-605. 被引量：1
10任红格,李冬梅,李福进.动态神经网络分类器主动学习算法及其智能控制应用[J].计算机应用与软件,2016,33(7):247-251. 被引量：2

1进三.#数字时代,画质真的重要吗?#[J].摄影之友,2024(11):12-12.
2顾卫清.关键能力视域下高中数学混合式教学模式探究——以圆锥曲线定点定值问题为例[J].数学之友,2024,38(17):37-39.
3徐珏,刘双,符宏高,邵美瑛,陈美玲,黄镇.Wnt1-Cre和Pax2-Cre标记的小鼠第一鳃弓颅颌面部神经嵴细胞异质性研究[J].华西口腔医学杂志,2024,42(4):435-443.
4超表面元件加神经网络创建多维“视野”相[J].电子产品可靠性与环境试验,2024,42(5):82-82.
5夏强强,李菲菲.基于Co-Teaching的噪声标签深度学习[J].电子科技,2024,37(11):1-6.
6党思航,李晓哲,夏召强,蒋晓悦,桂术亮,冯晓毅.采用自适应预筛选的遥感图像目标开集检测研究[J].电子与信息学报,2024,46(10):3908-3917.
7黄婷,于恩洋.从“老适技术”到“技术适老”:适老化视域下代际数字鸿沟的纾解之道[J].图书馆,2024(10):43-51.
8张志强,暴亚东.融合RF和CNN的异常流量检测算法[J].信息网络安全,2024(11):1655-1664.
9范学晶,薛笑荣,杜意超.基于双重注意力机制的遥感图像时空融合方法[J].计算机科学,2024,51(S02):495-500.
10路梅,杨雨萱.自适应融合相似图的多视图谱聚类算法[J].金陵科技学院学报,2024,40(3):1-12.

计算机科学

2024年第S02期

浏览历史

内容加载中请稍等...

跨模态噪声过滤的事件相机目标检测算法

参考文献3

二级参考文献24

共引文献26

相关作者

相关机构

相关主题

浏览历史