A full data augmentation pipeline for small object detection based on generative adversarial networks

被引:79
作者
Bosquet, Brais [1 ]
Cores, Daniel [1 ]
Seidenari, Lorenzo [2 ]
Brea, Victor M. [1 ]
Mucientes, Manuel [1 ]
Del Bimbo, Alberto [2 ]
机构
[1] Univ Santiago de Compostela, Ctr Singular Investiganon Tecnoloxias Intelixente, Santiago De Compostela, Spain
[2] Univ Florence, Media Integrat & Commun Ctr MICC, Florence, Italy
关键词
Small object detection; Data augmentation; Generative adversarial network; SUPERRESOLUTION;
D O I
10.1016/j.patcog.2022.108998
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection accuracy on small objects, i.e., objects under 32 x 32 pixels, lags behind that of large ones. To address this issue, innovative architectures have been designed and new datasets have been re-leased. Still, the number of small objects in many datasets does not suffice for training. The advent of the generative adversarial networks (GANs) opens up a new data augmentation possibility for training architectures without the costly task of annotating huge datasets for small objects. In this paper, we pro-pose a full pipeline for data augmentation for small object detection which combines a GAN-based object generator with techniques of object segmentation, image inpainting, and image blending to achieve high -quality synthetic data. The main component of our pipeline is DS-GAN, a novel GAN-based architecture that generates realistic small objects from larger ones. Experimental results show that our overall data augmentation method improves the performance of state-of-the-art models up to 11.9% AP@.5 s on UAVDT and by 4.7% AP@.5 s on iSAID, both for the small objects subset and for a scenario where the number of training instances is limited.(c) 2022 The Author(s). Published by Elsevier Ltd.This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页数:12
相关论文
共 45 条
  • [1] [Anonymous], 2014, ARXIV14091556
  • [2] Finding Tiny Faces in the Wild with Generative Adversarial Network
    Bai, Yancheng
    Zhang, Yongqiang
    Ding, Mingli
    Ghanem, Bernard
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 21 - 30
  • [3] SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network
    Bai, Yancheng
    Zhang, Yongqiang
    Ding, Mingli
    Ghanem, Bernard
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 210 - 226
  • [4] STDnet-ST: Spatio-temporal ConvNet for small object detection
    Bosquet, Brais
    Mucientes, Manuel
    Brea, Victor M.
    [J]. PATTERN RECOGNITION, 2021, 116 (116)
  • [5] STDnet: Exploiting high resolution feature maps for small object detection
    Bosquet, Brais
    Mucientes, Manuel
    Brea, Victor M.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
  • [6] To Learn Image Super-Resolution, Use a GAN to Learn How to Do Image Degradation First
    Bulat, Adrian
    Yang, Jing
    Tzimiropoulos, Georgios
    [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 187 - 202
  • [7] A MULTIRESOLUTION SPLINE WITH APPLICATION TO IMAGE MOSAICS
    BURT, PJ
    ADELSON, EH
    [J]. ACM TRANSACTIONS ON GRAPHICS, 1983, 2 (04): : 217 - 236
  • [8] RRNet: A Hybrid Detector for Object Detection in Drone-captured Images
    Chen, Changrui
    Zhang, Yu
    Lv, Qingxuan
    Wei, Shuo
    Wang, Xiaorui
    Sun, Xin
    Dong, Junyu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 100 - 108
  • [9] A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection
    Chen, Zhe
    Ouyang, Wanli
    Liu, Tongliang
    Tao, Dacheng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) : 1121 - 1138
  • [10] Image Super-Resolution Using Deep Convolutional Networks
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) : 295 - 307