UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images

被引：9

作者：

He, Boyong ^{[1
]}

Li, Xianjiang ^{[1
]}

Huang, Bo ^{[1
]}

Gu, Enhui ^{[2
]}

Guo, Weijie ^{[3
]}

Wu, Liaoni ^{[1
]}

机构：

[1] Xiamen Univ, Sch Aerosp Engn, Xiamen 361102, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Aeronaut & Astronaut, Shanghai 200240, Peoples R China

[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China

来源：

REMOTE SENSING | 2021年 / 13卷 / 24期

基金：

中国国家自然科学基金;

关键词：

deep learning; synthetic data; ship recognition; aerial imagery; VEHICLE DETECTION; TARGET DETECTION;

D O I：

10.3390/rs13244999

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

As a data-driven approach, deep learning requires a large amount of annotated data for training to obtain a sufficiently accurate and generalized model, especially in the field of computer vision. However, when compared with generic object recognition datasets, aerial image datasets are more challenging to acquire and more expensive to label. Obtaining a large amount of high-quality aerial image data for object recognition and image understanding is an urgent problem. Existing studies show that synthetic data can effectively reduce the amount of training data required. Therefore, in this paper, we propose the first synthetic aerial image dataset for ship recognition, called UnityShip. This dataset contains over 100,000 synthetic images and 194,054 ship instances, including 79 different ship models in ten categories and six different large virtual scenes with different time periods, weather environments, and altitudes. The annotations include environmental information, instance-level horizontal bounding boxes, oriented bounding boxes, and the type and ID of each ship. This provides the basis for object detection, oriented object detection, fine-grained recognition, and scene recognition. To investigate the applications of UnityShip, the synthetic data were validated for model pre-training and data augmentation using three different object detection algorithms and six existing real-world ship detection datasets. Our experimental results show that for small-sized and medium-sized real-world datasets, the synthetic data achieve an improvement in model pre-training and data augmentation, showing the value and potential of synthetic data in aerial image recognition and understanding tasks.

引用

页数：21

共 50 条

[1] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
Hua, Yuansheng
Mou, Lichao
Jin, Pu
Zhu, Xiao Xiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2] Large-Scale Synthetic Urban Dataset for Aerial Scene Understanding
Gao, Qian
Shen, Xukun
Niu, Wensheng
IEEE ACCESS, 2020, 8 (08): : 42131 - 42140
[3] DOTA: A Large-scale Dataset for Object Detection in Aerial Images
Xia, Gui-Song
Bai, Xiang
Ding, Jian
Zhu, Zhen
Belongie, Serge
Luo, Jiebo
Datcu, Mihai
Pelillo, Marcello
Zhang, Liangpei
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3974 - 3983
[4] OpenSARWake: A Large-Scale SAR Dataset for Ship Wake Recognition With a Feature Refinement Oriented Detector
Xu, Chengji
Wang, Xiaoqing
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[5] SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection
Shao, Zhenfeng
Wu, Wenjing
Wang, Zhongyuan
Du, Wan
Li, Chengyuan
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (10) : 2593 - 2604
[6] A large-scale fMRI dataset for human action recognition
Zhou, Ming
Gong, Zhengxin
Dai, Yuxuan
Wen, Yushan
Liu, Youyi
Zhen, Zonglei
SCIENTIFIC DATA, 2023, 10 (01)
[7] A large-scale fMRI dataset for human action recognition
Ming Zhou
Zhengxin Gong
Yuxuan Dai
Yushan Wen
Youyi Liu
Zonglei Zhen
Scientific Data, 10
[8] ARTIFACT: A LARGE-SCALE DATASET WITH ARTIFICIAL AND FACTUAL IMAGES FOR GENERALIZABLE AND ROBUST SYNTHETIC IMAGE DETECTION
Rahman, Md Awsafur
Paul, Bishmoy
Sarker, Najibul Haque
Hakim, Zaber Ibn Abdul
Fattah, Shaikh Anowarul
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2200 - 2204
[9] SegTex: A Large Scale Synthetic Face Dataset for Face Recognition
Ambardi, Laudwika
Hong, Sungeun
Park, In Kyu
IEEE ACCESS, 2023, 11 : 131939 - 131949
[10] AIFood: A Large Scale Food Images Dataset for Ingredient Recognition
Lee, Gwo Giun
Huang, Chin-Wei
Chen, Jia-Hong
Chen, Shih-Yu
Chen, Hsiu-Ling
PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 802 - 805

← 1 2 3 4 5 →