OW-Adapter: Human-Assisted Open-World Object Detection with a Few Examples

被引:0
作者
Jamonnak, Suphanut [1 ]
Guo, Jiajing [1 ]
He, Wenbin [1 ]
Gou, Liang [1 ]
Ren, Liu [1 ]
机构
[1] Bosch Res North Amer, Sunnyvale, CA 94085 USA
关键词
Detectors; Automobiles; Proposals; Object detection; Object recognition; Visual analytics; Training; Open world learning; object detection; continuous learning; human-assisted AI; NOVELTY DETECTION; VISUAL ANALYSIS;
D O I
10.1109/TVCG.2023.3326577
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Open-world object detection (OWOD) is an emerging computer vision problem that involves not only the identification of predefined object classes, like what general object detectors do, but also detects new unknown objects simultaneously. Recently, several end-to-end deep learning models have been proposed to address the OWOD problem. However, these approaches face several challenges: a) significant changes in both network architecture and training procedure are required; b) they are trained from scratch, which can not leverage existing pre-trained general detectors; c) costly annotations for all unknown classes are needed. To overcome these challenges, we present a visual analytic framework called OW-Adapter. It acts as an adaptor to enable pre-trained general object detectors to handle the OWOD problem. Specifically, OW-Adapter is designed to identify, summarize, and annotate unknown examples with minimal human effort. Moreover, we introduce a lightweight classifier to learn newly annotated unknown classes and plug the classifier into pre-trained general detectors to detect unknown objects. We demonstrate the effectiveness of our framework through two case studies of different domains, including common object recognition and autonomous driving. The studies show that a simple yet powerful adaptor can extend the capability of pre-trained general detectors to detect unknown objects and improve the performance on known classes simultaneously.
引用
收藏
页码:694 / 704
页数:11
相关论文
共 75 条
  • [1] Do Convolutional Neural Networks Learn Class Hierarchy?
    Alsallakh, Bilal
    Jourabloo, Amin
    Ye, Mao
    Liu, Xiaoming
    Ren, Liu
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 152 - 162
  • [2] Classifier-Guided Visual Correction of Noisy Labels for Image Classification Tasks
    Baeuerle, A.
    Neumann, H.
    Ropinski, T.
    [J]. COMPUTER GRAPHICS FORUM, 2020, 39 (03) : 195 - 205
  • [3] Behrisch M, 2014, IEEE CONF VIS ANAL, P43, DOI 10.1109/VAST.2014.7042480
  • [4] Towards Open Set Deep Networks
    Bendale, Abhijit
    Boult, Terrance E.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1563 - 1572
  • [5] Bendale A, 2015, PROC CVPR IEEE, P1893, DOI 10.1109/CVPR.2015.7298799
  • [6] Bruneau Pierrick, 2013, 2013 17th International Conference on Information Visualisation, P168, DOI 10.1109/IV.2013.21
  • [7] Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
  • [8] TargetVue: Visual Analysis of Anomalous User Behaviors in Online Communication Systems
    Cao, Nan
    Shi, Conglei
    Lin, Sabrina
    Lu, Jie
    Lin, Yu-Ru
    Lin, Ching-Yung
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) : 280 - 289
  • [9] Chalapathy R, 2019, Arxiv, DOI arXiv:1901.03407
  • [10] Chen C., 2021, IEEE Transactions on Visualization and Computer Graphics, V27, P3