Semi-Supervised and Long-Tailed Object Detection with CascadeMatch

被引:0
作者
Yuhang Zang
Kaiyang Zhou
Chen Huang
Chen Change Loy
机构
[1] Nanyang Technological University,S
[2] Apple Inc.,Lab
来源
International Journal of Computer Vision | 2023年 / 131卷
关键词
Object detection; Long-tailed learning; Semi-supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature. We propose a novel pseudo-labeling-based detector called CascadeMatch. Our detector features a cascade network architecture, which has multi-stage detection heads with progressive confidence thresholds. To avoid manually tuning the thresholds, we design a new adaptive pseudo-label mining mechanism to automatically identify suitable values from data . To mitigate confirmation bias, where a model is negatively reinforced by incorrect pseudo-labels produced by itself, each detection head is trained by the ensemble pseudo-labels of all detection heads. Experiments on two long-tailed datasets, i.e., LVIS and COCO-LT, demonstrate that CascadeMatch surpasses existing state-of-the-art semi-supervised approaches—across a wide range of detection architectures—in handling long-tailed object detection. For instance, CascadeMatch outperforms Unbiased Teacher by 1.9 APFix\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {AP}^{{\text {Fix}}}$$\end{document} on LVIS when using a ResNet50-based Cascade R-CNN structure, and by 1.7 APFix\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {AP}^{{\text {Fix}}}$$\end{document} when using Sparse R-CNN with a Transformer encoder. We also show that CascadeMatch can even handle the challenging sparsely annotated object detection problem. Code: https://github.com/yuhangzang/CascadeMatch.
引用
收藏
页码:987 / 1001
页数:14
相关论文
empty
未找到相关数据