Simple Multi-dataset Detection

被引:11
|
作者
Zhou, Xingyi [1 ]
Koltun, Vladlen [2 ]
Krahenbuhl, Philipp [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Apple, Cupertino, CA USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52688.2022.00742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How do we build a general and broad object detection system? We use all labels of all concepts ever annotated. These labels span diverse datasets with potentially inconsistent taxonomies. In this paper, we present a simple method for training a unified detector on multiple large-scale datasets. We use dataset-specific training protocols and losses, but share a common detection architecture with dataset-specific outputs. We show how to automatically integrate these dataset-specific outputs into a common semantic taxonomy. In contrast to prior work, our approach does not require manual taxonomy reconciliation. Experiments show our learned taxonomy outperforms a expert-designed taxonomy in all datasets. Our multi-dataset detector performs as well as dataset-specific models on each training domain, and can generalize to new unseen dataset without fine-tuning on them. Code is available at https://github.com/xingyizhou/UniDet.
引用
收藏
页码:7561 / 7570
页数:10
相关论文
共 50 条
  • [1] Multi-dataset Detection with Transformers
    Ke, Bo
    Qiao, Ruizhi
    Sun, Xing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2443 - 2449
  • [2] Improving Stance Detection with Multi-Dataset Learning and Knowledge Distillation
    Li, Yingjie
    Zhao, Chenye
    Caragea, Cornelia
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6332 - 6345
  • [3] Longitudinal Multi-Dataset PET Image Reconstruction
    Ellis, Sam
    Reader, Andrew J.
    2017 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2017,
  • [4] Single-dataset Experts for Multi-dataset Question Answering
    Friedman, Dan
    Dodge, Ben
    Chen, Danqi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6128 - 6137
  • [5] Progressive Pseudo Labeling for Multi-Dataset Detection Over Unified Label Space
    Ye, Kai
    Huang, Zepeng
    Xiong, Yilei
    Gao, Yu
    Xie, Jinheng
    Shen, Linlin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 531 - 543
  • [6] ScaleDet: A Scalable Multi-Dataset Object Detector
    Chen, Yanbei
    Wang, Manchen
    Mittal, Abhay
    Xu, Zhenlin
    Favaro, Paolo
    Tighe, Joseph
    Modolo, Davide
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7288 - 7297
  • [7] Interpretable Multi-dataset Evaluation for Named Entity Recognition
    Fu, Jinlan
    Liu, Pengfei
    Neubig, Graham
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6058 - 6069
  • [8] Multi-dataset Training of Transformers for Robust Action Recognition
    Liang, Junwei
    Zhang, Enwei
    Zhang, Jun
    Shen, Chunhua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [9] Multi-Dataset, Multitask Learning of Egocentric Vision Tasks
    Kapidis, Georgios
    Poppe, Ronald
    Veltkamp, Remco C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6618 - 6630
  • [10] An Unsupervised Methodology for the Detection of Epileptic Seizures Using EEG Signals: A Multi-Dataset Evaluation
    Tsiouris, Kostas M.
    Konitsiotis, Spiros
    Markoula, Sofia
    Koutsouris, Dimitrios D.
    Fotiadis, Dimitrios, I
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 3390 - 3393