Objects365: A Large-scale, High-quality Dataset for Object Detection

被引:347
|
作者
Shao, Shuai [1 ]
Li, Zeming [1 ]
Zhang, Tianyuan [1 ]
Peng, Chao [1 ]
Yu, Gang [1 ]
Zhang, Xiangyu [1 ]
Li, Jing [1 ]
Sun, Jian [1 ]
机构
[1] Megvii Technol, Beijing, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a new large-scale object detection dataset, Objects365, which has 365 object categories over 600K training images. More than 10 million, high-quality bounding boxes are manually labeled through a three-step, carefully designed annotation pipeline. It is the largest object detection dataset (with full annotation) so far and establishes a more challenging benchmark for the community. Objects365 can serve as a better feature learning dataset for localization-sensitive tasks like object detection and semantic segmentation. The Objects365 pre-trained models significantly outperform ImageNet pre-trained models with 5.6 points gain (42 vs 36.4) based on the standard setting of 90K iterations on COCO benchmark. Even compared with much long training time like 540K iterations, our Objects365 pretrained model with 90K iterations still have 2.7 points gain (42 vs 39.3). Meanwhile, the finetuning time can be greatly reduced (up to 10 times) when reaching the same accuracy. Better generalization ability of Object365 has also been verified on CityPersons, VOC segmentation, and ADE tasks. The dataset as well as the pretrainedmodels have been released at www.objects365.org.
引用
收藏
页码:8429 / 8438
页数:10
相关论文
共 50 条
  • [1] LaSOT: A High-quality Large-scale Single Object Tracking Benchmark
    Fan, Heng
    Bai, Hexin
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Harshit
    Huang, Mingzhen
    Liu, Juehuan
    Xu, Yong
    Liao, Chunyuan
    Yuan, Lin
    Ling, Haibin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 439 - 461
  • [2] LaSOT: A High-quality Large-scale Single Object Tracking Benchmark
    Heng Fan
    Hexin Bai
    Liting Lin
    Fan Yang
    Peng Chu
    Ge Deng
    Sijia Yu
    Mingzhen Harshit
    Juehuan Huang
    Yong Liu
    Chunyuan Xu
    Lin Liao
    Haibin Yuan
    International Journal of Computer Vision, 2021, 129 : 439 - 461
  • [3] LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking
    Fan, Heng
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Bai, Hexin
    Xu, Yong
    Liao, Chunyuan
    Ling, Haibin
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5369 - 5378
  • [4] A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation
    Linh The Nguyen
    Nguyen Luong Tran
    Long Doan
    Manh Luong
    Dat Quoc Nguyen
    INTERSPEECH 2022, 2022, : 1726 - 1730
  • [5] Collaborative Camouflaged Object Detection: A Large-Scale Dataset and Benchmark
    Zhang, Cong
    Bi, Hongbo
    Xiang, Tian-Zhu
    Wu, Ranwan
    Tong, Jinghui
    Wang, Xiufang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (12) : 1 - 15
  • [6] DOTA: A Large-scale Dataset for Object Detection in Aerial Images
    Xia, Gui-Song
    Bai, Xiang
    Ding, Jian
    Zhu, Zhen
    Belongie, Serge
    Luo, Jiebo
    Datcu, Mihai
    Pelillo, Marcello
    Zhang, Liangpei
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3974 - 3983
  • [7] RGBT Salient Object Detection: A Large-Scale Dataset and Benchmark
    Tu, Zhengzheng
    Ma, Yan
    Li, Zhun
    Li, Chenglong
    Xu, Jieming
    Liu, Yongtao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4163 - 4176
  • [8] 3D Object Detection on large-scale dataset
    Zhao, Yan
    Zhu, Jihong
    Liang, Haoyu
    Chen, Lyujie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
    Wang, Xin
    Wu, Jiawei
    Chen, Junkun
    Li, Lei
    Wang, Yuan-Fang
    Wang, William Yang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4580 - 4590
  • [10] Producing high-quality visualizations of large-scale simulations
    Popescu, V
    Hoffmann, C
    Kilic, S
    Sozen, M
    Meador, S
    IEEE VISUALIZATION 2003, PROCEEDINGS, 2003, : 575 - 580