TTPLA: An Aerial-Image Dataset for Detection and Segmentation of Transmission Towers and Power Lines

被引:15
作者
Abdelfattah, Rabab [1 ]
Wang, Xiaofeng [1 ]
Wang, Song [2 ]
机构
[1] Univ South Carolina, Dept Elect Engn, Columbia, SC 29208 USA
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC USA
来源
COMPUTER VISION - ACCV 2020, PT VI | 2021年 / 12627卷
基金
美国国家科学基金会;
关键词
NMS;
D O I
10.1007/978-3-030-69544-6_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate detection and segmentation of transmission towers (TTs) and power lines (PLs) from aerial images plays a key role in protecting power-grid security and low-altitude UAV safety. Meanwhile, aerial images of TTs and PLs pose a number of new challenges to the computer vision researchers who work on object detection and segmentation - PLs are long and thin, and may show similar color as the background; TTs can be of various shapes and most likely made up of line structures of various sparsity; The background scene, lighting, and object sizes can vary significantly from one image to another. In this paper we collect and release a new TT/PL Aerial-image (TTPLA) dataset, consisting of 1,100 images with the resolution of 3,840x2,160 pixels, as well as manually labeled 8,987 instances of TTs and PLs. We develop novel policies for collecting, annotating, and labeling the images in TTPLA. Different from other relevant datasets, TTPLA supports evaluation of instance segmentation, besides detection and semantic segmentation. To build a baseline for detection and segmentation tasks on TTPLA, we report the performance of several state-of-the-art deep learning models on our dataset. TTPLA dataset is publicly available at https://github.com/r3ab/ttpla_dataset.
引用
收藏
页码:601 / 618
页数:18
相关论文
共 42 条
[11]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[12]   Learning non-maximum suppression [J].
Hosang, Jan ;
Benenson, Rodrigo ;
Schiele, Bernt .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6469-6477
[13]   Relation Networks for Object Detection [J].
Hu, Han ;
Gu, Jiayuan ;
Zhang, Zheng ;
Dai, Jifeng ;
Wei, Yichen .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3588-3597
[14]   Mask Scoring R-CNN [J].
Huang, Zhaojin ;
Huang, Lichao ;
Gong, Yongchao ;
Huang, Chang ;
Wang, Xinggang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6402-6411
[15]   Vision-based autonomous navigation approach for unmanned aerial vehicle transmission-line inspection [J].
Hui, Xiaolong ;
Bian, Jiang ;
Zhao, Xiaoguang ;
Tan, Min .
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (01)
[16]  
Li DD, 2019, 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), P131, DOI [10.1109/ICCCBDA.2019.8725702, 10.1109/icccbda.2019.8725702]
[17]   Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade [J].
Li, Xiaoxiao ;
Liu, Ziwei ;
Luo, Ping ;
Loy, Chen Change ;
Tang, Xiaoou .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6459-6468
[18]   Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation [J].
Lin, Guosheng ;
Shen, Chunhua ;
van den Hengel, Anton ;
Reid, Ian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3194-3203
[19]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[20]   Adaptive NMS: Refining Pedestrian Detection in a Crowd [J].
Liu, Songtao ;
Huang, Di ;
Wang, Yunhong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6452-6461