融合视觉关系检测的电力场景自动危险预警被引量：7

Visual relationship detection-based emergency early-warning description generation in electric power industry

导出

摘要目的借助深度学习强大的识别与检测能力,辅助人工进行电力场景下的危险描述与作业预警是一种较为经济和高效的电力安全监管手段。然而,目前主流的以目标检测技术为基础的预警系统只能给出部分危险目标的信息,忽视了电力设备的单目危险关系和成对对象间潜在的二元危险关系。不同于以往的方法,为了拓展危险预警模块的识别能力与功能范畴,本文提出了一种在电力场景下基于视觉关系检测的自动危险预警描述生成方法。方法对给定的待检测图像,通过目标检测模块得到图中对象的类别名称和限界框位置;分别对图像进行语义特征、视觉特征和空间位置特征的抽取,将融合后的总特征送入关系检测模块,输出单个对象的一元关系和成对对象间的关系三元组;根据检测出的对象类别和关系信息,进行危险预测并给出警示描述。结果本文自主搜集了多场景下的电力生产作业图像并进行标注,同时进行大量消融实验。实验显示,结合了语义特征、空间特征和视觉特征的关系检测器在前5召回率Recall@5和前10召回率Recall@10上的精度分别达到86.80%和93.93%,比仅使用视觉特征的关系检测器的性能提高约15%。结论本文提出的融合多模态特征输入的视觉关系检测网络能够较好地给出谓词关系的最佳匹配,并减少不合理的关系预测,且具有一定零样本学习(zero-shot learning)能力。相关可视化结果表明,整体系统能够较好地完成电力场景下的危险预警描述任务。 Objective The past decade has seen a steady increase in deep learning areas,where extensive research has been published to improve the learning capabilities of deep neural networks.Thus,a growing number of regulators in the electric power industry utilize such deep learning techniques with powerful recognition and detection capabilities to build their surveillance systems,which greatly reduce the risk of major accidents in daily work.However,most of the current early-warning systems are based on object detection technologies,which can only provide annotations of dangerous targets within the image,ignoring the significant information about unary relationships of electrical equipment and binary relationships between paired objects.This condition limits the capabilities of emergency recognition and forewarning.With the presence of powerful object detectors such as Faster region convolutional neural network(R-CNN)and huge visual datasets such as visual genome,visual relationship detection has attracted much attention in recent years.By utilizing the basic building blocks for single-object detection and understanding,visual relationship detection aims to not only accurately localize a pair of objects but also precisely determine the predicate between them.As a mid-level learning task,visual relationship detection can capture the detailed semantics of visual scenes by explicitly modeling objects along with their relationships with other objects.This approach bridges the gap between low-level visual tasks and high-level vision-language tasks,as well as helps machines to solve more challenging visual tasks such as image captioning,visual question answering,and image generation.However,the difficulty is in developing robust algorithms to recognize relationships between paired objects with challenging factors,such as highly diverse visual features in the same predicate category,incomplete annotation and longtailed distribution in the dataset,and optimum predicate matching problem.Although numerous methods have been proposed to build efficient relationship detectors,few of them concentrate on applying detection technologies to actual use.Method Different from existing methods,our method introduces the visual relationship detection technology into current early-warning systems.Specifically,our method not only identifies dangerous objects but also recognizes the potential unary or binary relationships that may cause an accident.To sum up,we propose a two-stage emergency recognition and forewarning system for the electric power industry.The system consists of a pre-trained object-detection module and a relationship detection module.The pipeline of our system mainly includes three stages.First,we train an object-detection module based on Faster R-CNN in advance.When given an image,the pre-trained object detector localizes all the object bounding boxes and annotates their categories.Then,the relationship-detection module integrates multiple cues(visual appearance,spatial location,and semantic embedding)to compute the predicate confidence of all the object pairs,and output the top instances as the relationship predictions.Finally,based on the targets and relationship information provided by the detectors,our system performs emergency prediction and generates a warning description that may help regulators in the electric power industry to make suitable decisions.Result We conduct several experiments to prove the efficiency and superiority of our method.First,we collect and build a dataset consisting of large amounts of images from multiple scenarios in the electric power industry.Using instructions from experts,we define and label the relationship categories that may pose risks to the images in the dataset.Then,according to the number of objects forming a relationship,we divide the dataset into two parts.Thus,our experiments involve two relevant tasks to evaluate the proposed method:unary relationship detection and binary relationship detection.For the unary relationship detection,we use precision and recall as thee valuation metrics.For the binary relationship detection,the evaluation metrics are Recall@5 and Recall@10.As our proposed relationshipdetection module contains multiple cues to learn the holistic representation of a relationship instance,we conduct ablation experiments to explore their influence on the final performance.Experiment results show that the detector that uses visual,spatial,and semantic features as input achieve the best performance of 86.80%in Recall@5 and 93.93%in Recall@10.Conclusion Extensive experiments show that our proposed method is efficient and effective in detecting defective electrical equipment and dangerous relationships between paired objects.Moreover,we formulate a pre-defined rule to generate the early-warning description according to the results of the object and relationship detectors.All of the proposed methods can help regulators take proper and timely actions to avoid harmful accidents in the electric power industry.

作者高明左红群柏帆田清阳葛志峰董兴宁甘甜 Gao Ming;Zuo Hongqun;Bai Fan;Tian Qingyang;Ge Zhifeng;Dong Xingning;Gan Tian(State Grid Ninghai Power Supply Company,Ningbo 315600,China;Ninghai Yancang Mountain Electric Power Construction Company,Ningbo 315600,China;School of Computer Science and Technology,Shandong University,Qingdao 266237,China)

机构地区国网浙江宁海县供电有限公司宁海县雁苍山电力建设有限公司山东大学计算机科学与技术学院

出处《中国图象图形学报》 CSCD 北大核心 2021年第7期1583-1593,共11页 Journal of Image and Graphics

基金宁波永耀电力投资集团有限公司科技项目(YYKJ202013)。

关键词危险预警目标检测视觉关系检测多模态特征融合多标签余量损失 emergency early-warning object detection visual relationship detection multimodal feature fusion multilabel margin loss

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1胡正文.输电线路工程的安全危险辨识及管理方法探讨[J].中国高新技术企业,2016(32):131-132. 被引量：6
2刘玮,黄曙,马凯,陈皓.视频监控技术在电力系统中的应用[J].广东电力,2014,27(4):57-60. 被引量：7
3马莉,明月,郭婷,廖爽,邹雨馨,熊一.基于视频监控的电力设备防破坏预警系统的方法[J].信息技术,2020,44(4):115-120. 被引量：7
4曾宪武,冉祯伟.电力杆塔应力综合监测危险预警系统[J].贵州电力技术,2015,18(8):70-72. 被引量：2

二级参考文献19

1王斌,楼颖稚,张肖宁.视频监控的发展及在电力系统中的应用[J].电力系统通信,2004,25(11):57-60. 被引量：13
2经翔飞.输电线路施工安全管理[J].中国电力企业管理,2007(3):70-71. 被引量：18
3黄立新.输配电线路实训教程[M].北京:中国电力出版社,2009.9.
4MOESLUND T B. HILTON A, KRUGER V. A Survey of Advances in Vision-based Human Motion Capture and Analysis [J].Computer Vision and Image Understanding, 2006 ( 104 ):90-126.
5LAO W, HAN J, WITH P H N. Automatic Surveillance Analyzer Using Trajectory and Body-based Modeling [C ]// Proc. IEEE International Conference on Consumer Electronics, Las Vegas, US: 2009.
6LAO W, HAN J, WITH P H N. Automatic Video-based Hu- man Motion Analyzer for Consumer Surveillance System[J]. IEEE Trans. Consumer Electronics, 2009, 55(2) : 591-598.
7ZIVKOVIC Z, VAN der HEIJDEN F. Efficient Adaptive Density Estimation per Image Pixel for the Task of Background Subtraction [J].Pattern Recognition Letters, 2006(27): 773-780.
8D. COMANICIU, V. RAMESH and P. MEER, "Kernel-based Object Tracking", IEEE Trans. Pattern Analysis and Machine Intelligence, 2003(25): 564-577.
9HANJ, FARIND. DE PHN et al. An Automatic Analyzer for Sports Video Databases Using Visual Cues and Real-world Modeling[C]//Consumer Electronics, 2006. ICCE06.2006 Digest of Technical Papers. International Conference on. IEEE. 2006, 477-478.
10LI H, WU S, BA S, et al. Automatic Detection and Recognition of Athlete Actions in Diving Video[C]//Advances in Multimedia Modeling, Springer Berlin Heidelberg: 2006.

共引文献18

1杨世贞.电力运行设备日常保养与维护策略[J].冶金管理,2020(21):61-62.
2宋宜雷.图像监控技术在电力系统中的应用[J].通讯世界（下半月）,2015(1):97-98.
3李锡忠,孙超,郑薇,谢晖,王峥.无线传感器网络中基于DSSS抗电力系统强电磁干扰技术研究[J].重庆邮电大学学报（自然科学版）,2016,28(6):815-821. 被引量：5
4邱永安.输电线路工程安全危险辨识与管理方法研究[J].居业,2017(7):139-139. 被引量：2
5胡辉,蔡映雪,胡松,黄思博,陈伽,蔡昭权.基于视频分析的异常事件检测技术研究[J].电脑知识与技术（过刊）,2017,23(12X):235-237. 被引量：1
6陈伽,胡辉,黄思博,蔡映雪,胡松,蔡昭权.异常事件检测的智能视频监控系统分析[J].电脑知识与技术,2017,13(12X):174-176. 被引量：1
7胡忠伟.输电线路建设工程管理和若干问题研究[J].环球市场,2017,0(20):169-169.
8保绍昆.输电线路工程安全危险辨识与管理[J].科技创新导报,2018,15(10):212-212. 被引量：2
9姜新亮,徐仕超,王兆辉,张石峰.物联网在电力杆塔状态监测中的应用[J].电力系统装备,2019,0(22):167-168.
10金玥佟,杨耀权,杜永昂.电力监控场景下基于光流特征点的目标跟踪算法[J].电力科学与工程,2020,36(5):40-47. 被引量：2

同被引文献63

1王燕,童博,李佳东,刘溟江,苏新民,周国栋.5G技术在海上风电场智慧运维中的应用现状及展望[J].船舶工程,2021,43(S01):130-133. 被引量：10
2蒲天骄,乔骥,韩笑,张国宾,王新迎.人工智能技术在电力设备运维检修中的研究及应用[J].高电压技术,2020,46(2):369-383. 被引量：216
3吴双,胡伟,张林,刘欣宇.基于AI技术的电网关键稳定特征智能选择方法[J].中国电机工程学报,2019,39(1):14-21. 被引量：29
4张明媛,曹志颖,赵雪峰,杨震.基于深度学习的建筑工人安全帽佩戴识别研究[J].安全与环境学报,2019,19(2):535-541. 被引量：65
5方路平,何杭江,周国民.目标检测算法研究综述[J].计算机工程与应用,2018,54(13):11-18. 被引量：114
6张晨宇,王慧芳,叶晓君.基于XGBoost算法的电力系统暂态稳定评估[J].电力自动化设备,2019,39(3):77-83. 被引量：44
7刘齐,王茂军,高强,李晓明,石林.基于红外成像技术的电气设备故障检测[J].电测与仪表,2019,56(10):122-126. 被引量：62
8吴建军,陈灵,李磊,杨金刚,梁晓莉.机载LiDAR点云中电力线的提取和重建研究[J].激光技术,2019,43(4):500-505. 被引量：29
9朱利鹏,陆超,黄河,刘映尚.基于时序轨迹特征学习的暂态电压稳定评估[J].电网技术,2019,43(6):1922-1930. 被引量：32
10邢法财,徐政,王世佳.非同步机电源接入电网后的谐振问题分析及抑制[J].电力系统自动化,2019,43(15):71-79. 被引量：21

引证文献7

1曹捷,郭志彬,潘立志,丁兴号.高空作业场景下的安全带穿戴检测[J].湖南科技大学学报（自然科学版）,2022,37(1):92-99. 被引量：12
2杨润霞,邵洁,罗岩,白万荣.基于编解码器的电力施工场景可控图像字幕生成[J].电网技术,2022,46(7):2572-2580. 被引量：2
3余国忠,邹健辉,宋华,樊中奎.智能无人监管配电网实训室的建设与研究[J].现代信息科技,2022,6(12):115-118.
4李杨,董元龙,林明晖,高明,岳衡,丁靖.基于AI视觉技术的电力设备检测方法[J].微型电脑应用,2023,39(9):90-93.
5扈永鹏.基于深度残差网络的输油气站场双电源快速切换风险预警模型[J].微型电脑应用,2024,40(6):184-188.
6陈巳阳,罗文盛,施振.背景分区增强下的超高压典型AR运维场景重建方法[J].微型电脑应用,2024,40(7):76-79.
7彭章龙,余华平.基于YOLOv5s的注意力改进研究[J].计算机科学与应用,2022,12(2):366-375.

二级引证文献14

1周兆银,田书函,谢艳,蔡硕累.计算机视觉技术在施工人员不安全行为管理中的应用[J].重庆大学学报,2022,45(S01):74-78. 被引量：6
2刘东甲,何川,胡欣钧,云磊.一种基于特高压直流高空3D防护柔性外骨骼的健康工效[J].临床医学工程,2023,30(5):611-612.
3何敏,秦亮,赵峰,余金沄,刘浩锋,王秋琳,徐兴华,刘开培.面向电力系统现场作业的安全风险管控智能检测算法[J].高电压技术,2023,49(6):2442-2457. 被引量：9
4谢国波,唐晶晶,林志毅,郑晓锋,方明.复杂场景下的改进YOLOv4安全帽检测算法[J].激光与光电子学进展,2023,60(12):129-137. 被引量：3
5刘石桥,胡鑫涛,胡玉莹,王礼坤.基于YOLOv5的高空作业下安全带佩戴检测[J].计算机应用文摘,2023,39(20):98-102.
6王若晨,赵江平.基于NanoDet的轻量级空中作业安全绳检测研究[J].工业安全与环保,2024,50(1):54-59.
7王彬燕.基于编码-解码技术的图像标题生成方法分析[J].计算机应用文摘,2024,40(5):110-112.
8黄培新,黄晓彬.提升高空作业安全性的智能锁扣和监督装置设计[J].电气技术与经济,2024(3):193-195.
9高翔,王志远,徐亮.基于计算机视觉技术的石化码头不安全行为智能识别[J].石油化工自动化,2024,60(2):106-108. 被引量：1
10李航,王少帅.高空作业安全带智能报警与远程监控设备的研究与设计[J].模具制造,2024,24(4):201-203. 被引量：1

1武志红.我们不缺德,缺的是规则[J].法制博览（名家讲坛、经典杂文）,2020,0(6):16-17.
2张相君,魏寒冰.海洋微塑料污染的国际法和国内法协同规制路径[J].中国海商法研究,2021,32(2):92-101. 被引量：11
3海如拉·热合曼.煤矿采空区地表塌陷的预测分析[J].华北自然资源,2021(4):112-113. 被引量：4
4樊熙奇.生命的自主极性运动:论福柯“治理术”概念中的康吉莱姆生命哲学意蕴[J].福建论坛（人文社会科学版）,2021(4):118-127. 被引量：1
5邓涛.岩巷掘进过断层突水危险预测及综合防治水技术分析[J].科学技术创新,2021(24):132-133. 被引量：5
6李庆园,蔡文静.基于树莓派的实时视频监控机器人设计[J].现代信息科技,2021,5(7):130-132. 被引量：1
7陈玲,曾真,陶晨.常见实用性辅料在服装设计中的装饰性应用[J].服饰导刊,2021,10(4):105-111. 被引量：2

中国图象图形学报

2021年第7期

浏览历史

内容加载中请稍等...

融合视觉关系检测的电力场景自动危险预警被引量：7

参考文献4

二级参考文献19

共引文献18

同被引文献63

引证文献7

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

融合视觉关系检测的电力场景自动危险预警 被引量：7

参考文献4

二级参考文献19

共引文献18

同被引文献63

引证文献7

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

融合视觉关系检测的电力场景自动危险预警被引量：7