A full data augmentation pipeline for small object detection based on generative adversarial networks

被引：117

作者：

Bosquet, Brais ^{[1
]}

Cores, Daniel ^{[1
]}

Seidenari, Lorenzo ^{[2
]}

Brea, Victor M. ^{[1
]}

Mucientes, Manuel ^{[1
]}

Del Bimbo, Alberto ^{[2
]}

机构：

[1] Univ Santiago de Compostela, Ctr Singular Investiganon Tecnoloxias Intelixente, Santiago De Compostela, Spain

[2] Univ Florence, Media Integrat & Commun Ctr MICC, Florence, Italy

来源：

PATTERN RECOGNITION | 2023年 / 133卷

关键词：

Small object detection; Data augmentation; Generative adversarial network; SUPERRESOLUTION;

D O I：

10.1016/j.patcog.2022.108998

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection accuracy on small objects, i.e., objects under 32 x 32 pixels, lags behind that of large ones. To address this issue, innovative architectures have been designed and new datasets have been re-leased. Still, the number of small objects in many datasets does not suffice for training. The advent of the generative adversarial networks (GANs) opens up a new data augmentation possibility for training architectures without the costly task of annotating huge datasets for small objects. In this paper, we pro-pose a full pipeline for data augmentation for small object detection which combines a GAN-based object generator with techniques of object segmentation, image inpainting, and image blending to achieve high -quality synthetic data. The main component of our pipeline is DS-GAN, a novel GAN-based architecture that generates realistic small objects from larger ones. Experimental results show that our overall data augmentation method improves the performance of state-of-the-art models up to 11.9% AP@.5 s on UAVDT and by 4.7% AP@.5 s on iSAID, both for the small objects subset and for a scenario where the number of training instances is limited.(c) 2022 The Author(s). Published by Elsevier Ltd.This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

引用

页数：12

共 45 条

[1]

[Anonymous], 2018, 6 INT C LEARNING REP

[2] Finding Tiny Faces in the Wild with Generative Adversarial Network [J].