Plain-Det: A Plain Multi-dataset Object Detector

被引:0
|
作者
Shi, Cheng [1 ]
Zhu, Yuchen [1 ]
Yang, Sibei [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Object detection; Multiple datasets; Proposal generation;
D O I
10.1007/978-3-031-72652-1_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advancements in large-scale foundational models have sparked widespread interest in training highly proficient large vision models. A common consensus revolves around the necessity of aggregating extensive, high-quality annotated data. However, given the inherent challenges in annotating dense tasks in computer vision, such as object detection and segmentation, a practical strategy is to combine and leverage all available data for training purposes. In this work, we propose Plain-Det, which offers flexibility to accommodate new datasets, robustness in performance across diverse datasets, training efficiency, and compatibility with various detection architectures. We utilize Def-DETR, with the assistance of Plain-Det, to achieve a mAP of 51.9 on COCO, matching the current state-of-the-art detectors. We conduct extensive experiments on 13 downstream datasets and Plain-Det demonstrates strong generalization capability. Code is release at https://github.com/ChengShiest/Plain-Det.
引用
收藏
页码:210 / 226
页数:17
相关论文
共 50 条
  • [1] ScaleDet: A Scalable Multi-Dataset Object Detector
    Chen, Yanbei
    Wang, Manchen
    Mittal, Abhay
    Xu, Zhenlin
    Favaro, Paolo
    Tighe, Joseph
    Modolo, Davide
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7288 - 7297
  • [2] Multi-dataset Detection with Transformers
    Ke, Bo
    Qiao, Ruizhi
    Sun, Xing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2443 - 2449
  • [3] Simple Multi-dataset Detection
    Zhou, Xingyi
    Koltun, Vladlen
    Krahenbuhl, Philipp
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7561 - 7570
  • [4] Longitudinal Multi-Dataset PET Image Reconstruction
    Ellis, Sam
    Reader, Andrew J.
    2017 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2017,
  • [5] Single-dataset Experts for Multi-dataset Question Answering
    Friedman, Dan
    Dodge, Ben
    Chen, Danqi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6128 - 6137
  • [6] Interpretable Multi-dataset Evaluation for Named Entity Recognition
    Fu, Jinlan
    Liu, Pengfei
    Neubig, Graham
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6058 - 6069
  • [7] Multi-dataset Training of Transformers for Robust Action Recognition
    Liang, Junwei
    Zhang, Enwei
    Zhang, Jun
    Shen, Chunhua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [8] Multi-Dataset, Multitask Learning of Egocentric Vision Tasks
    Kapidis, Georgios
    Poppe, Ronald
    Veltkamp, Remco C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6618 - 6630
  • [9] Multi-dataset Low-rank Matrix Factorization
    Valavi, Hossein
    Ramadge, Peter J.
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
  • [10] Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection
    Zhang, Bo
    Yuan, Jiakang
    Shi, Botian
    Chen, Tao
    Li, Yikang
    Qiao, Yu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9253 - 9262