UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images

被引:9
|
作者
He, Boyong [1 ]
Li, Xianjiang [1 ]
Huang, Bo [1 ]
Gu, Enhui [2 ]
Guo, Weijie [3 ]
Wu, Liaoni [1 ]
机构
[1] Xiamen Univ, Sch Aerosp Engn, Xiamen 361102, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Aeronaut & Astronaut, Shanghai 200240, Peoples R China
[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; synthetic data; ship recognition; aerial imagery; VEHICLE DETECTION; TARGET DETECTION;
D O I
10.3390/rs13244999
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
As a data-driven approach, deep learning requires a large amount of annotated data for training to obtain a sufficiently accurate and generalized model, especially in the field of computer vision. However, when compared with generic object recognition datasets, aerial image datasets are more challenging to acquire and more expensive to label. Obtaining a large amount of high-quality aerial image data for object recognition and image understanding is an urgent problem. Existing studies show that synthetic data can effectively reduce the amount of training data required. Therefore, in this paper, we propose the first synthetic aerial image dataset for ship recognition, called UnityShip. This dataset contains over 100,000 synthetic images and 194,054 ship instances, including 79 different ship models in ten categories and six different large virtual scenes with different time periods, weather environments, and altitudes. The annotations include environmental information, instance-level horizontal bounding boxes, oriented bounding boxes, and the type and ID of each ship. This provides the basis for object detection, oriented object detection, fine-grained recognition, and scene recognition. To investigate the applications of UnityShip, the synthetic data were validated for model pre-training and data augmentation using three different object detection algorithms and six existing real-world ship detection datasets. Our experimental results show that for small-sized and medium-sized real-world datasets, the synthetic data achieve an improvement in model pre-training and data augmentation, showing the value and potential of synthetic data in aerial image recognition and understanding tasks.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Large-Scale Synthetic Urban Dataset for Aerial Scene Understanding
    Gao, Qian
    Shen, Xukun
    Niu, Wensheng
    IEEE ACCESS, 2020, 8 (08): : 42131 - 42140
  • [3] DOTA: A Large-scale Dataset for Object Detection in Aerial Images
    Xia, Gui-Song
    Bai, Xiang
    Ding, Jian
    Zhu, Zhen
    Belongie, Serge
    Luo, Jiebo
    Datcu, Mihai
    Pelillo, Marcello
    Zhang, Liangpei
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3974 - 3983
  • [4] OpenSARWake: A Large-Scale SAR Dataset for Ship Wake Recognition With a Feature Refinement Oriented Detector
    Xu, Chengji
    Wang, Xiaoqing
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [5] SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection
    Shao, Zhenfeng
    Wu, Wenjing
    Wang, Zhongyuan
    Du, Wan
    Li, Chengyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (10) : 2593 - 2604
  • [6] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    SCIENTIFIC DATA, 2023, 10 (01)
  • [7] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    Scientific Data, 10
  • [8] ARTIFACT: A LARGE-SCALE DATASET WITH ARTIFICIAL AND FACTUAL IMAGES FOR GENERALIZABLE AND ROBUST SYNTHETIC IMAGE DETECTION
    Rahman, Md Awsafur
    Paul, Bishmoy
    Sarker, Najibul Haque
    Hakim, Zaber Ibn Abdul
    Fattah, Shaikh Anowarul
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2200 - 2204
  • [9] SegTex: A Large Scale Synthetic Face Dataset for Face Recognition
    Ambardi, Laudwika
    Hong, Sungeun
    Park, In Kyu
    IEEE ACCESS, 2023, 11 : 131939 - 131949
  • [10] AIFood: A Large Scale Food Images Dataset for Ingredient Recognition
    Lee, Gwo Giun
    Huang, Chin-Wei
    Chen, Jia-Hong
    Chen, Shih-Yu
    Chen, Hsiu-Ling
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 802 - 805