UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images

被引：9

作者：

He, Boyong ^{[1
]}

Li, Xianjiang ^{[1
]}

Huang, Bo ^{[1
]}

Gu, Enhui ^{[2
]}

Guo, Weijie ^{[3
]}

Wu, Liaoni ^{[1
]}

机构：

[1] Xiamen Univ, Sch Aerosp Engn, Xiamen 361102, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Aeronaut & Astronaut, Shanghai 200240, Peoples R China

[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China

来源：

REMOTE SENSING | 2021年 / 13卷 / 24期

基金：

中国国家自然科学基金;

关键词：

deep learning; synthetic data; ship recognition; aerial imagery; VEHICLE DETECTION; TARGET DETECTION;

D O I：

10.3390/rs13244999

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

As a data-driven approach, deep learning requires a large amount of annotated data for training to obtain a sufficiently accurate and generalized model, especially in the field of computer vision. However, when compared with generic object recognition datasets, aerial image datasets are more challenging to acquire and more expensive to label. Obtaining a large amount of high-quality aerial image data for object recognition and image understanding is an urgent problem. Existing studies show that synthetic data can effectively reduce the amount of training data required. Therefore, in this paper, we propose the first synthetic aerial image dataset for ship recognition, called UnityShip. This dataset contains over 100,000 synthetic images and 194,054 ship instances, including 79 different ship models in ten categories and six different large virtual scenes with different time periods, weather environments, and altitudes. The annotations include environmental information, instance-level horizontal bounding boxes, oriented bounding boxes, and the type and ID of each ship. This provides the basis for object detection, oriented object detection, fine-grained recognition, and scene recognition. To investigate the applications of UnityShip, the synthetic data were validated for model pre-training and data augmentation using three different object detection algorithms and six existing real-world ship detection datasets. Our experimental results show that for small-sized and medium-sized real-world datasets, the synthetic data achieve an improvement in model pre-training and data augmentation, showing the value and potential of synthetic data in aerial image recognition and understanding tasks.

引用

页数：21

共 50 条

[41] A large-scale container dataset and a baseline method for container hole localization
Yunfeng Diao
Xin Tang
He Wang
Emma Christophine Florence Taylor
Shirui Xiao
Mengtian Xie
Wenming Cheng
Journal of Real-Time Image Processing, 2022, 19 : 577 - 589
[42] PESTD: a large-scale Persian-English scene text dataset
Rashtehroudi, Atefeh Ranjkesh
Akoushideh, Alireza
Shahbahrami, Asadollah
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34793 - 34808
[43] Large-scale multiview 3D hand pose dataset
Gomez-Donoso, Francisco
Orts-Escolano, Sergio
Cazorla, Miguel
IMAGE AND VISION COMPUTING, 2019, 81 : 25 - 33
[44] MSIF: Multisize Inference Fusion-Based False Alarm Elimination for Ship Detection in Large-Scale SAR Images
Zhang, Chao
Yang, Chule
Cheng, Kaihui
Guan, Naiyang
Dong, Hongbin
Deng, Baosong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[45] Building Instance Change Detection from Large-Scale Aerial Images using Convolutional Neural Networks and Simulated Samples
Ji, Shunping
Shen, Yanyun
Lu, Meng
Zhang, Yongjun
REMOTE SENSING, 2019, 11 (11)
[46] Classifying for a Mixture of Object Images and Character Patterns by Using CNN Pre-trained for Large-scale Object Image Dataset
Shima, Yoshihiro
Nakashima, Yumi
Yasuda, Michio
PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 2360 - 2365
[47] ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network
Min, Weiqing
Liu, Linhu
Wang, Zhiling
Luo, Zhengdong
Wei, Xiaoming
Wei, Xiaolin
Jiang, Shuqiang
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 393 - 401
[48] RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection
Ying, Qichao
Liu, Jiaxin
Li, Sheng
Xu, Haisheng
Qian, Zhenxing
Zhang, Xinpeng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 737 - 746
[49] MCMOD: The Multi-Category Large-Scale Dataset for Maritime Object Detection
Sun, Zihao
Hu, Xiao
Qi, Yining
Huang, Yongfeng
Li, Songbin
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 1657 - 1669
[50] OLKAVS: AN OPEN LARGE-SCALE KOREAN AUDIO-VISUAL SPEECH DATASET
Park, Jeongkyun
Hwang, Jung-Wook
Choi, Kwanghee
Lee, Seung-Hyeon
Ahn, Jun Hwan
Park, Rae-Hong
Park, Hyung-Min
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6385 - 6389

← 1 2 3 4 5 →