Object detection of transmission line visual images based on deep convolutional neural network

被引:6
作者
Zhou Zhu-bo [1 ]
Gao Jiao [2 ]
Zhang Wei [3 ]
Wang Xiao-jing [1 ]
Zhang Jiang [1 ]
机构
[1] Tianjin ZhongWei Aerospace Data Syst Technol Co L, Tianjin Key Lab Intelligent Informat Proc Remote, Tianjin 300301, Peoples R China
[2] Jinan Tony Robot Co Ltd, Jinan 250101, Shandong, Peoples R China
[3] China Southern Power Grid Co Ltd, Elect Power Res Inst, Guangzhou 510080, Guangdong, Peoples R China
关键词
transmission line image; insulator; object detection; deep learning; convolutional neural networks;
D O I
10.3788/YJYXS20183304.0317
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
A deep convolutional neural network based method is adopted to detect objects such as tower, glass insulator and composite insulator in visible images of transmission lines. About 600 visible images of 19 different transmission lines are captured by manned helicopter with high-definition camera. All of the images are then annotated manually and segmented into blocks with 4 different labels: background, tower, glass insulator and composite insulator. These blocks are then augmented to around 150 000 training samples which comprise the transmission line image dataset. A five-layer deep convolutional neural network is designed and pre-trained by using Cifar-100 dataset, the trained network is then fine-tuned by using transmission line image dataset. The experimental results show that when detection true positive rate is 90 %, the false alarm rate is less than 10 %, which is obviously superior to the traditional methods. It can be used for the detection of tower, glass insulator and composite insulator in visible images of transmission lines. The detection result can be used as reference for diagnosis or state analysis of transmission lines. This method can be used to detect tower and insulator in visible images of transmission lines, and can be extended to detect other typical objects.
引用
收藏
页码:317 / 325
页数:9
相关论文
共 19 条
[1]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[2]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[3]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[4]  
Han Jun, 2013, Infrared and Laser Engineering, V42, P3458
[5]  
[黄文博 Huang Wenbo], 2017, [光学精密工程, Optics and Precision Engineering], V25, P1378
[6]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[7]  
Krizhevsky A., 2009, LEARNING MULTIPLE LA
[8]  
Krizhevsky A., 2017, ADV NEURAL INFORM PR, V60, DOI [10.1145/3065386, DOI 10.1145/3065386]
[9]  
LeCun Y, 2004, PROC CVPR IEEE, P97
[10]   Backpropagation Applied to Handwritten Zip Code Recognition [J].
LeCun, Y. ;
Boser, B. ;
Denker, J. S. ;
Henderson, D. ;
Howard, R. E. ;
Hubbard, W. ;
Jackel, L. D. .
NEURAL COMPUTATION, 1989, 1 (04) :541-551