Unveiling the unseen: novel strategies for object detection beyond known distributions

被引：0

作者：

Devi, S. ^{[1
]}

Dayana, R. ^{[1
]}

Malarvezhi, P. ^{[2
]}

机构：

[1] SRM Inst Sci & Technol, Dept Elect & Commun Engn, Kattankulathur 603203, Tamil Nadu, India

[2] ADP India, iHCM Res & Dev, Chennai 600032, Tamil Nadu, India

来源：

PATTERN ANALYSIS AND APPLICATIONS | 2024年 / 27卷 / 04期

关键词：

Object detection; OOD generalization; Covariate shifts; Semantic shifts; Model regularization; Robustness;

D O I：

10.1007/s10044-024-01334-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In contemporary machine learning, models often struggle with data distribution variations, severely impacting their out-of-distribution (OOD) generalization and detection capabilities. Current object detection methods, relying on virtual outlier synthesis and class-conditional density estimation, struggle to effectively distinguish OOD samples. They often depend on accurate density estimation and may produce virtual outliers that lack realism, particularly in complex or dynamic environments. Furthermore, previous research has typically addressed covariate and semantic shifts independently, resulting in fragmented solutions that fail to comprehensively tackle OOD generalization. This study introduces a unified approach to enhance OOD generalization in object recognition models, addressing these critical gaps. The strategy involves employing adversarial perturbations on the ID (In-Distribution) dataset to enhance the model's resilience to distribution shifts, thereby simulating potential real-world scenarios characterized by imperceptible variations. Additionally, the integration of Maximum Mean Discrepancy (MMD) at the object level effectively discriminates between ID and OOD samples by quantifying distributional differences. For precise OOD detection, a K-nearest neighbors (KNN) algorithm is used during inference to measure similarity between samples and their closest neighbors in the training data. Evaluations on benchmark datasets, including PASCAL VOC and BDD100K as ID, with COCO and Open Images subsets as OOD, demonstrate significant improvements in OOD generalization compared to existing methods. These discoveries underscore the framework's potential to elevate the dependability and flexibility of object recognition systems in practical scenarios, particularly in autonomous vehicles where accurate object detection under diverse conditions is critical for safety. This research contributes to advancing OOD generalization techniques and lays the groundwork for future refinement to address evolving challenges in machine learning applications. The code can be accessed from https://github.com/DeviSPhd/$$OODG\_OD$$OODG_OD

引用

页数：15

共 46 条

[1] Survey and Performance Analysis of Deep Learning Based Object Detection in Challenging Environments [J].

Ahmed, Muhammad ;

Hashmi, Khurram Azeem ;

Pagani, Alain ;

Liwicki, Marcus ;

Stricker, Didier ;

Afzal, Muhammad Zeshan .

SENSORS, 2021, 21 (15)

[2]

Anirudh R, 2023, Medical imaging with deep learning

[3] Randaugment: Practical automated data augmentation with a reduced search space [J].

Cubuk, Ekin D. ;

Zoph, Barret ;

Shlens, Jonathon ;

Le, Quoc, V .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017

[4] Out-of-Distribution (OOD) Detection Based on Deep Learning: A Review [J].

Cui, Peng ;

Wang, Jinjia .

ELECTRONICS, 2022, 11 (21)

[5]

Devi S, 2023, IEEE Access

[6]

Du Xuefeng, 2022, P INT C LEARN REPR

[7]

Du Xuefeng, 2022, ADV NEUR IN

[8] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[9]

Girshick R, 2018, Detectron2

[10]

Gretton A, 2012, J MACH LEARN RES, V13, P723

← 1 2 3 4 5 →