Enhanced Self-Supervised Transmission Inspection with Improved Region Prior and Scale Variation

被引:0
作者
Xie, Wei [1 ]
Wu, Fei [2 ]
Ouyang, Chao [3 ]
Yang, Yan [1 ]
Qian, Jian [1 ]
Lin, Shuang [1 ]
Zhou, Chenxi [1 ]
Zhang, Jun [3 ]
机构
[1] State Grid Fujian Elect Power Res Inst, Fuzhou 350000, Peoples R China
[2] State Grid Fujian Elect Power CO LTD, Fuzhou 350000, Peoples R China
[3] Wuhan Univ, Sch Elect Engn & Automat, Wuhan 430000, Peoples R China
关键词
computer vision; self-supervised learning; object detection;
D O I
10.3390/pr12122913
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
As an important means to ensure the safety of power transmission, the inspection of overhead transmission lines requires high accuracy for detecting small objects on the transmission lines and relies heavily on the construction of large-scale datasets by using deep learning instead of manual inspection. However, transmission inspection data often involve some sensitive information and need to be labeled by professionals, so it is difficult to construct a large transmission inspection dataset. In order to solve the problem of how to effectively train only on a small amount of transmission line data and achieve high object detection accuracy considering the large-scale variation in transmission objects, we propose an enhanced self-supervised pre-training model for DETR-like models, which are innovative object detectors eliminating hand-crafted non-maximum suppression and manual anchor design compared to previous CNN-based detectors. This paper mainly covers the following two points: (i) We compare UP-DETR and DETReg, noting that UP-DETR's random cropping method performs poorly on small datasets and affects DETR's localization ability. To address this, we adopt DETReg's approach, replacing Selective Search with Edge Boxes for better results. (ii) To tackle large-scale variations in transmission inspection datasets, we propose a multi-scale feature reconstruction task, aligning feature embeddings with multi-scale encoder embeddings, and enhancing multi-scale object detection. Our method surpasses UP-DETR DETReg with DETR variants when fine-tuning PASCAL VOC and PTL-AI Furnas for object detection.
引用
收藏
页数:13
相关论文
共 16 条
[1]   DETReg: Unsupervised Pretraining with Region Priors for Object Detection [J].
Bar, Amir ;
Wang, Xin ;
Kantorov, Vadim ;
Reed, Colorado J. ;
Herzig, Roei ;
Chechik, Gal ;
Rohrbach, Anna ;
Darrell, Trevor ;
Globerson, Amir .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :14585-14595
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[4]   UP-DETR: Unsupervised Pre-training for Object Detection with Transformers [J].
Dai, Zhigang ;
Cai, Bolun ;
Lin, Yugeng ;
Chen, Junying .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1601-1610
[5]   PTL-AI Furnas Dataset: A Public Dataset for Fault Detection in Power Transmission Lines Using Aerial Images [J].
de Oliveira, Frederico Santos ;
de Carvalho, Marcelo ;
Tancredo Campos, Pedro Henrique ;
da Silva Soares, Anderson ;
Candido Junior, Arnaldo ;
Rodrigues da Silva Quirino, Ana Claudia .
2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, :7-12
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[9]  
Liu S., 2022, arXiv, DOI DOI 10.48550/ARXIV.2201.12329
[10]   Conditional DETR for Fast Training Convergence [J].
Meng, Depu ;
Chen, Xiaokang ;
Fan, Zejia ;
Zeng, Gang ;
Li, Houqiang ;
Yuan, Yuhui ;
Sun, Lei ;
Wang, Jingdong .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3631-3640