Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving

被引：7

作者：

Li, Naifan ^{[1
]}

Song, Fan ^{[1
]}

Zhang, Ying ^{[1
]}

Liang, Pengpeng ^{[2
]}

Cheng, Erkang ^{[1
]}

机构：

[1] NullMax, Shanghai 201210, Peoples R China

[2] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年

关键词：

D O I：

10.1109/ICRA46639.2022.9811724

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Detection of rare objects (e.g., traffic cones, traffic barrels and traffic warning triangles) is an important perception task to improve the safety of autonomous driving. Training of such models typically requires a large number of annotated data which is expensive and time consuming to obtain. To address the above problem, an emerging approach is to apply data augmentation to automatically generate cost-free training samples. In this work, we propose a systematic study on simple Copy-Paste data augmentation for rare object detection in autonomous driving. Specifically, local adaptive instance-level image transformation is introduced to generate realistic rare object masks from source domain to the target domain. Moreover, traffic scene context is utilized to guide the placement of masks of rare objects. To this end, our data augmentation generates training data with high quality and realistic characteristics by leveraging both local and global consistency. In addition, we build a new dataset named NM10k consisting 10k training images, 4k validation images and the corresponding labels with a diverse range of scenarios in autonomous driving. Experiments on NM10k show that our method achieves promising results on rare object detection. We also present a thorough study to illustrate the effectiveness of our local-adaptive and global constraints based Copy-Paste data augmentation for rare object detection. The data, development kit and more information of NM10k dataset are available online at: https://nullmax-vision.github.io.

引用

页码：4548 / 4554

页数：7

共 27 条

[1]

[Anonymous], **DATA OBJECT**, DOI DOI 10.5281/ZENODO.4154370

[2]

Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934

[3] Class-Balanced Loss Based on Effective Number of Samples [J].

Cui, Yin ;

Jia, Menglin ;

Lin, Tsung-Yi ;

Song, Yang ;

Belongie, Serge .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269

[4]

DeVries T., 2017, arXiv

[5] Modeling Visual Context Is Key to Augmenting Object Detection Datasets [J].

Dvornik, Nikita ;

Mairal, Julien ;

Schmid, Cordelia .

COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :375-391

[6] On the Importance of Visual Context for Data Augmentation in Scene Understanding [J].

Dvornik, Nikita ;

Mairal, Julien ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) :2014-2028

[7] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection [J].

Dwibedi, Debidatta ;

Misra, Ishan ;

Hebert, Martial .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1310-1319

[8] InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting [J].

Fang, Hao-Shu ;

Sun, Jianhua ;

Wang, Runzhong ;

Gou, Minghao ;

Li, Yong-Lu ;

Lu, Cewu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :682-691

[9] Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation [J].

Ghiasi, Golnaz ;

Cui, Yin ;

Srinivas, Aravind ;

Qian, Rui ;

Lin, Tsung-Yi ;

Cubuk, Ekin D. ;

Le, Quoc, V ;

Zoph, Barret .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2917-2927

[10]

Joseph RK, 2016, CRIT POL ECON S ASIA, P1

← 1 2 3 →