UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images

被引:9
作者
He, Boyong [1 ]
Li, Xianjiang [1 ]
Huang, Bo [1 ]
Gu, Enhui [2 ]
Guo, Weijie [3 ]
Wu, Liaoni [1 ]
机构
[1] Xiamen Univ, Sch Aerosp Engn, Xiamen 361102, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Aeronaut & Astronaut, Shanghai 200240, Peoples R China
[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; synthetic data; ship recognition; aerial imagery; VEHICLE DETECTION; TARGET DETECTION;
D O I
10.3390/rs13244999
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
As a data-driven approach, deep learning requires a large amount of annotated data for training to obtain a sufficiently accurate and generalized model, especially in the field of computer vision. However, when compared with generic object recognition datasets, aerial image datasets are more challenging to acquire and more expensive to label. Obtaining a large amount of high-quality aerial image data for object recognition and image understanding is an urgent problem. Existing studies show that synthetic data can effectively reduce the amount of training data required. Therefore, in this paper, we propose the first synthetic aerial image dataset for ship recognition, called UnityShip. This dataset contains over 100,000 synthetic images and 194,054 ship instances, including 79 different ship models in ten categories and six different large virtual scenes with different time periods, weather environments, and altitudes. The annotations include environmental information, instance-level horizontal bounding boxes, oriented bounding boxes, and the type and ID of each ship. This provides the basis for object detection, oriented object detection, fine-grained recognition, and scene recognition. To investigate the applications of UnityShip, the synthetic data were validated for model pre-training and data augmentation using three different object detection algorithms and six existing real-world ship detection datasets. Our experimental results show that for small-sized and medium-sized real-world datasets, the synthetic data achieve an improvement in model pre-training and data augmentation, showing the value and potential of synthetic data in aerial image recognition and understanding tasks.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] A large-scale container dataset and a baseline method for container hole localization
    Yunfeng Diao
    Xin Tang
    He Wang
    Emma Christophine Florence Taylor
    Shirui Xiao
    Mengtian Xie
    Wenming Cheng
    Journal of Real-Time Image Processing, 2022, 19 : 577 - 589
  • [42] PESTD: a large-scale Persian-English scene text dataset
    Rashtehroudi, Atefeh Ranjkesh
    Akoushideh, Alireza
    Shahbahrami, Asadollah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34793 - 34808
  • [43] Large-scale multiview 3D hand pose dataset
    Gomez-Donoso, Francisco
    Orts-Escolano, Sergio
    Cazorla, Miguel
    IMAGE AND VISION COMPUTING, 2019, 81 : 25 - 33
  • [44] MSIF: Multisize Inference Fusion-Based False Alarm Elimination for Ship Detection in Large-Scale SAR Images
    Zhang, Chao
    Yang, Chule
    Cheng, Kaihui
    Guan, Naiyang
    Dong, Hongbin
    Deng, Baosong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [45] Building Instance Change Detection from Large-Scale Aerial Images using Convolutional Neural Networks and Simulated Samples
    Ji, Shunping
    Shen, Yanyun
    Lu, Meng
    Zhang, Yongjun
    REMOTE SENSING, 2019, 11 (11)
  • [46] Classifying for a Mixture of Object Images and Character Patterns by Using CNN Pre-trained for Large-scale Object Image Dataset
    Shima, Yoshihiro
    Nakashima, Yumi
    Yasuda, Michio
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 2360 - 2365
  • [47] ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network
    Min, Weiqing
    Liu, Linhu
    Wang, Zhiling
    Luo, Zhengdong
    Wei, Xiaoming
    Wei, Xiaolin
    Jiang, Shuqiang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 393 - 401
  • [48] RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection
    Ying, Qichao
    Liu, Jiaxin
    Li, Sheng
    Xu, Haisheng
    Qian, Zhenxing
    Zhang, Xinpeng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 737 - 746
  • [49] MCMOD: The Multi-Category Large-Scale Dataset for Maritime Object Detection
    Sun, Zihao
    Hu, Xiao
    Qi, Yining
    Huang, Yongfeng
    Li, Songbin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 1657 - 1669
  • [50] OLKAVS: AN OPEN LARGE-SCALE KOREAN AUDIO-VISUAL SPEECH DATASET
    Park, Jeongkyun
    Hwang, Jung-Wook
    Choi, Kwanghee
    Lee, Seung-Hyeon
    Ahn, Jun Hwan
    Park, Rae-Hong
    Park, Hyung-Min
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6385 - 6389