A full data augmentation pipeline for small object detection based on generative adversarial networks

被引:108
作者
Bosquet, Brais [1 ]
Cores, Daniel [1 ]
Seidenari, Lorenzo [2 ]
Brea, Victor M. [1 ]
Mucientes, Manuel [1 ]
Del Bimbo, Alberto [2 ]
机构
[1] Univ Santiago de Compostela, Ctr Singular Investiganon Tecnoloxias Intelixente, Santiago De Compostela, Spain
[2] Univ Florence, Media Integrat & Commun Ctr MICC, Florence, Italy
关键词
Small object detection; Data augmentation; Generative adversarial network; SUPERRESOLUTION;
D O I
10.1016/j.patcog.2022.108998
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection accuracy on small objects, i.e., objects under 32 x 32 pixels, lags behind that of large ones. To address this issue, innovative architectures have been designed and new datasets have been re-leased. Still, the number of small objects in many datasets does not suffice for training. The advent of the generative adversarial networks (GANs) opens up a new data augmentation possibility for training architectures without the costly task of annotating huge datasets for small objects. In this paper, we pro-pose a full pipeline for data augmentation for small object detection which combines a GAN-based object generator with techniques of object segmentation, image inpainting, and image blending to achieve high -quality synthetic data. The main component of our pipeline is DS-GAN, a novel GAN-based architecture that generates realistic small objects from larger ones. Experimental results show that our overall data augmentation method improves the performance of state-of-the-art models up to 11.9% AP@.5 s on UAVDT and by 4.7% AP@.5 s on iSAID, both for the small objects subset and for a scenario where the number of training instances is limited.(c) 2022 The Author(s). Published by Elsevier Ltd.This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页数:12
相关论文
共 45 条
[1]  
[Anonymous], 2015, INT C LEARN REPR ICL
[2]   Finding Tiny Faces in the Wild with Generative Adversarial Network [J].
Bai, Yancheng ;
Zhang, Yongqiang ;
Ding, Mingli ;
Ghanem, Bernard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :21-30
[3]   SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network [J].
Bai, Yancheng ;
Zhang, Yongqiang ;
Ding, Mingli ;
Ghanem, Bernard .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :210-226
[4]   STDnet-ST: Spatio-temporal ConvNet for small object detection [J].
Bosquet, Brais ;
Mucientes, Manuel ;
Brea, Victor M. .
PATTERN RECOGNITION, 2021, 116 (116)
[5]   STDnet: Exploiting high resolution feature maps for small object detection [J].
Bosquet, Brais ;
Mucientes, Manuel ;
Brea, Victor M. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
[6]   To Learn Image Super-Resolution, Use a GAN to Learn How to Do Image Degradation First [J].
Bulat, Adrian ;
Yang, Jing ;
Tzimiropoulos, Georgios .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :187-202
[7]   A MULTIRESOLUTION SPLINE WITH APPLICATION TO IMAGE MOSAICS [J].
BURT, PJ ;
ADELSON, EH .
ACM TRANSACTIONS ON GRAPHICS, 1983, 2 (04) :217-236
[8]   RRNet: A Hybrid Detector for Object Detection in Drone-captured Images [J].
Chen, Changrui ;
Zhang, Yu ;
Lv, Qingxuan ;
Wei, Shuo ;
Wang, Xiaorui ;
Sun, Xin ;
Dong, Junyu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :100-108
[9]   A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection [J].
Chen, Zhe ;
Ouyang, Wanli ;
Liu, Tongliang ;
Tao, Dacheng .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) :1121-1138
[10]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307