OW-Adapter: Human-Assisted Open-World Object Detection with a Few Examples

被引：0

作者：

Jamonnak, Suphanut ^{[1
]}

Guo, Jiajing ^{[1
]}

He, Wenbin ^{[1
]}

Gou, Liang ^{[1
]}

Ren, Liu ^{[1
]}

机构：

[1] Bosch Res North Amer, Sunnyvale, CA 94085 USA

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2024年 / 30卷 / 01期

关键词：

Detectors; Automobiles; Proposals; Object detection; Object recognition; Visual analytics; Training; Open world learning; object detection; continuous learning; human-assisted AI; NOVELTY DETECTION; VISUAL ANALYSIS;

D O I：

10.1109/TVCG.2023.3326577

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Open-world object detection (OWOD) is an emerging computer vision problem that involves not only the identification of predefined object classes, like what general object detectors do, but also detects new unknown objects simultaneously. Recently, several end-to-end deep learning models have been proposed to address the OWOD problem. However, these approaches face several challenges: a) significant changes in both network architecture and training procedure are required; b) they are trained from scratch, which can not leverage existing pre-trained general detectors; c) costly annotations for all unknown classes are needed. To overcome these challenges, we present a visual analytic framework called OW-Adapter. It acts as an adaptor to enable pre-trained general object detectors to handle the OWOD problem. Specifically, OW-Adapter is designed to identify, summarize, and annotate unknown examples with minimal human effort. Moreover, we introduce a lightweight classifier to learn newly annotated unknown classes and plug the classifier into pre-trained general detectors to detect unknown objects. We demonstrate the effectiveness of our framework through two case studies of different domains, including common object recognition and autonomous driving. The studies show that a simple yet powerful adaptor can extend the capability of pre-trained general detectors to detect unknown objects and improve the performance on known classes simultaneously.

引用

页码：694 / 704

页数：11

共 75 条

[1] Do Convolutional Neural Networks Learn Class Hierarchy?
Alsallakh, Bilal
Jourabloo, Amin
Ye, Mao
Liu, Xiaoming
Ren, Liu
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 152 - 162
[2] Classifier-Guided Visual Correction of Noisy Labels for Image Classification Tasks
Baeuerle, A.
Neumann, H.
Ropinski, T.
[J]. COMPUTER GRAPHICS FORUM, 2020, 39 (03) : 195 - 205
[3] Behrisch M, 2014, IEEE CONF VIS ANAL, P43, DOI 10.1109/VAST.2014.7042480
[4] Towards Open Set Deep Networks
Bendale, Abhijit
Boult, Terrance E.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1563 - 1572
[5] Bendale A, 2015, PROC CVPR IEEE, P1893, DOI 10.1109/CVPR.2015.7298799
[6] Bruneau Pierrick, 2013, 2013 17th International Conference on Information Visualisation, P168, DOI 10.1109/IV.2013.21
[7] Caesar H, 2020, PROC CVPR IEEE, P11618, DOI 10.1109/CVPR42600.2020.01164
[8] TargetVue: Visual Analysis of Anomalous User Behaviors in Online Communication Systems
Cao, Nan
Shi, Conglei
Lin, Sabrina
Lu, Jie
Lin, Yu-Ru
Lin, Ching-Yung
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) : 280 - 289
[9] Chalapathy R, 2019, Arxiv, DOI arXiv:1901.03407
[10] Chen C., 2021, IEEE Transactions on Visualization and Computer Graphics, V27, P3

← 1 2 3 4 5 6 7 8 →