BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

被引:27
|
作者
Li, Daiqing [1 ]
Ling, Huan [1 ,2 ,3 ]
Kim, Seung Wook [1 ,2 ,3 ]
Kreis, Karsten [1 ]
Fidler, Sanja [1 ,2 ,3 ]
Torralba, Antonio [4 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Univ Toronto, Toronto, ON, Canada
[3] Vector Inst, Bengaluru, India
[4] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/CVPR52688.2022.02064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Annotating images with pixel-wise labels is a time consuming and costly process. Recently, DatasetGAN [78] showcased a promising alternative - to synthesize a large labeled dataset via a generative adversarial network (GAN) by exploiting a small set of manually labeled, GAN generated images. Here, we scale DatasetGAN to ImageNet scale of class diversity. We take image samples from the class-conditional generative model BigGAN [5] trained on ImageNet, and manually annotate only 5 images per class, for all 1k classes. By training an effective feature segmentation architecture on top of BigGAN, we turn Big GAN into a labeled dataset generator. We further show that VQGAN [18] can similarly serve as a dataset generator, leveraging the already annotated data. We create a new ImageNet benchmark by labeling an additional set of real images and evaluate segmentation performance in a variety of settings. Through an extensive ablation study, we show big gains in leveraging a large generated dataset to train different supervised and self-supervised backbone models on pixel-wise tasks. Furthermore, we demonstrate that using our synthesized datasets for pre-training leads to improvements over standard ImageNet pre-training on several downstream datasets, such as PASCAL-VOC, MS-COCO, Cityscapes and chest X-ray, as well as tasks (detection, segmentation). Our benchmark will be made public and maintain a leaderboard for this challenging task. Project Page: https://nv-tlabs.github.io/big-datasetgan/
引用
收藏
页码:21298 / 21308
页数:11
相关论文
共 50 条
  • [1] Pixel-wise video stabilization
    Wang, Zhongqiang
    Huang, Hua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 15939 - 15954
  • [2] Pixel-wise video stabilization
    Zhongqiang Wang
    Hua Huang
    Multimedia Tools and Applications, 2016, 75 : 15939 - 15954
  • [3] Pixel-Wise Contrastive Distillation
    Huang, Junqiang
    Guo, Zichao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16313 - 16323
  • [4] Bioinspired pixel-wise adaptive imaging
    Brinkworth, Russell S. A.
    Mah, Eng-Leng
    O'Carroll, David C.
    SMART STRUCTURES, DEVICES, AND SYSTEMS III, 2007, 6414
  • [5] Tracking Objects as Pixel-Wise Distributions
    Zhao, Zelin
    Wu, Ze
    Zhuang, Yueqing
    Li, Boxun
    Jia, Jiaya
    COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 76 - 94
  • [6] Pixel-Wise Quantization for Image Compression
    Wei, Liang
    Chen, Fangdong
    Wang, Li
    Wu, Xiaoyang
    Pu, Shiliang
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 372 - 372
  • [7] A New Pixel-Wise Mask for Watermarking
    Nafornita, Corina
    MM&SEC'07: PROCEEDINGS OF THE MULTIMEDIA & SECURITY WORKSHOP 2007, 2007, : 221 - +
  • [8] Abstraction in Pixel-wise Noisy Annotations Can Guide Attention to Improve Prostate Cancer Grade Assessment
    Kim, Hyeongsub
    Kong, Seo Taek
    Lee, Hongseok
    Kim, Kyungdoc
    Jung, Kyu-Hwan
    MEDICAL IMAGE LEARNING WITH LIMITED AND NOISY DATA (MILLAND 2022), 2022, 13559 : 23 - 31
  • [9] Pixel-wise Segmentation of Right Ventricle of Heart
    Dang, Yaman
    Anand, Deepak
    Sethi, Amit
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1797 - 1802
  • [10] Pixel-Wise Classification in Hippocampus Histological Images
    Vizcaino, Alfonso
    Sanchez-Cruz, Hermilo
    Sossa, Humberto
    Quintanar, J. Luis
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021