摘要
目的输电线路金具种类繁多、用处多样,与导线和杆塔安全密切相关。评估金具运行状态并实现故障诊断,需对输电线路金具目标进行精确定位和识别,然而随着无人机巡检采集的数据逐渐增多,将全部数据进行人工标注愈发困难。针对无标注数据无法有效利用的问题,提出一种基于自监督E-Swin Transformer(efficient shifted win⁃dows Transformer)的输电线路金具检测模型,充分利用无标注数据提高检测精度。方法首先,为了减少自注意力的计算量、提高模型计算效率,对Swin Transformer自注意力计算进行优化,提出一种高效的主干网络E-Swin。然后,为了利用无标注金具数据加强特征提取效果,针对E-Swin设计轻量化的自监督方法,并进行预训练。最后,为了提高检测定位精度,采用一种添加额外分支的检测头,并结合预训练之后的主干网络构建检测模型,利用少量有标注的数据进行微调训练,得到最终检测结果。结果实验结果表明,在输电线路金具数据集上,本文模型的各目标平均检测精确度(AP50)为88.6%,相比传统检测模型提高了10%左右。结论本文改进主干网络的自注意力计算,并采用自监督学习,使模型高效提取特征,实现无标注数据的有效利用,构建的金具检测模型为解决输电线路金具检测的数据利用问题提供了新思路。
Objective Transmission line is a key of infrastructure of power system.To keep the stability of the power sys⁃tem,it is required to preserve key components-based operation in the transmission line like fittings.Fittings are recognized as aluminum or iron-made metal accessories for multiple applications in relevant to such domains of protective fittings,con⁃necting fittings,tension clamps and suspension clamps.Fittings can be mainly used to support,fix and connect bare con⁃ductors and insulators.Such components are erosional for such complicated natural environment year by year.They are likely to have displacement,deflection and damage,which will affect the stability of the transmission system structure.If the defects of fittings are not sorted out quickly,they will cause severe circuit-damaged accidents.To assess status of the fittings and realize fault diagnosis,it is required to locate and identify the target of the transmission line fittings accurately.The emerging deep learning and unmanned aerial vehicle inspection techniques have been developing to optimize conven⁃tional single manual inspection technology further.A maintenance mode is melted into gradually,which can use unmanned aerial vehicle to acquire images,and the deep learning method is then incorporated to process aerial photos automatically.Most of these methods are focused on supervised learning only,that is,model training-before artificial data annotation is required for.As more and more data on transmission line components are collected by unmanned aerial vehicle patrols,manual labeling requires a large amount of human resources,and such missing and incorrect labeling problems will be occurred after that.To resolve this problem,we develop a fitting detection model based on self-supervised Transformer.Self-supervised learning is focused on unlabeled data-related pretext task design to mine the feature representation of the data itself and improve the feature extraction ability of the model.Less supervised data is then used for fine-tuning training through detection or segmentation-related downstream tasks.To resolve the problem of large amount of the model calcula⁃tion,Swin Transformer is improved and an efficient one-stage fitting detection model is built up based on self-supervised learning.Method Transformer model has shown its great potentials for computer vision in recent years.Due to its global self-attention calculation,Transformer can be used to extract more effective image feature information than convolutional neural network(CNN)to some extent.In addition,self-supervised learning feature of Transformer in natural language pro⁃cessing(NLP)domain has been gradually developing in computer vision(CV)domain.The fitting detection method pro⁃posed is segmented into three main categories.First,Swin Transformer is used as the backbone network.The calculation of self-attention is improved to solve the problem of large amount of calculation,and a smaller and more efficient backbone E-Swin is generated further.Second,the self-supervised pretext task of image reconstruction is designed.The improved backbone network is pre-trained in terms of self-supervised learning,and feature extraction ability of the model is trained in related to a large number of unlabeled data.After the self-supervised training,the network will be used as the backbone of the detection model.Finally,to improve the detection accuracy and get the final model,an optimized detector head is used to establish a high-precision one-stage detection model,and a small amount of labeled data is used for fine-tuning training.Result The transmission line fittings dataset is used to train and evaluate the model.The samples of image data are cut out derived from the inspection of the transmission line unmanned aerial vehicle(UAV).The first dataset is a large number of unlabeled data for self-supervised learning.The aerial photos-related dataset is clipped directly to remove background redundancy and preserve valid target information.The second dataset is a kind of labeled dataset for fine-tuning with a total of 1600 images.It is split into train samples and test samples according to the ratio of 4∶1.These samples consist of 12 types of fittings with a total labeled target of 10178.The experimental results show that the average precision(AP50)of the model on the transmission line fittings dataset is 88.6%,which is 10%higher than the traditional detection models.Con⁃clusion Self-attention calculation of the backbone network is improved.Self-supervised learning can be used to extract effi⁃cient features and effective application of unlabeled data can be realized.A one-stage fitting detection model is facilitated for resolving the problem of data application in transmission line fittings detection further.
作者
张珂
周睿恒
石超君
韩槊
杜明坤
赵振兵
Zhang Ke;Zhou Ruiheng;Shi Chaojun;Han Shuo;Du Mingkun;Zhao Zhenbing(Department of Electronic and Communication Engineering,North China Electric Power University,Baoding 071003,China;Hebei Key Laboratory of Power Internet of Things Technology,North China Electric Power University,Baoding 071003,China)
出处
《中国图象图形学报》
CSCD
北大核心
2023年第10期3064-3076,共13页
Journal of Image and Graphics
基金
国家自然科学基金项目(62076093,62206095,61871182)
中央高校基本科研业务费专项资金资助(2023JG002,2022MS078,2023JC006)。