A Likelihood Ratio-Based Approach to Segmenting Unknown Objects

被引:0
作者
Nayal, Nazir [1 ,2 ]
Shoeb, Youssef [3 ,4 ]
Guney, Fatma [1 ,2 ]
机构
[1] Koc Univ, Comp Engn Dept, Istanbul, Turkiye
[2] KUIS AI Ctr, Istanbul, Turkiye
[3] Continental AG, Hannover, Germany
[4] Tech Univ Berlin, Berlin, Germany
基金
欧洲研究理事会;
关键词
Anomaly Segmentation; Out-of-Distribution Detection; Likelihood Ratio; Unknown Segmentation; OoD Segmentation; Foundational Models for OoD; UNCERTAINTY;
D O I
10.1007/s11263-025-02509-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing the Out-of-Distribution (OoD) segmentation task is a prerequisite for perception systems operating in an open-world environment. Large foundational models are frequently used in downstream tasks, however, their potential for OoD remains mostly unexplored. We seek to leverage a large foundational model to achieve robust representation. Outlier supervision is a widely used strategy for improving OoD detection of the existing segmentation networks. However, current approaches for outlier supervision involve retraining parts of the original network, which is typically disruptive to the model's learned feature representation. Furthermore, retraining becomes infeasible in the case of large foundational models. Our goal is to retrain for outlier segmentation without compromising the strong representation space of the foundational model. To this end, we propose an adaptive, lightweight unknown estimation module (UEM) for outlier supervision that significantly enhances the OoD segmentation performance without affecting the learned feature representation of the original network. UEM learns a distribution for outliers and a generic distribution for known classes. Using the learned distributions, we propose a likelihood-ratio-based outlier scoring function that fuses the confidence of UEM with that of the pixel-wise segmentation inlier network to detect unknown objects. We also propose an objective to optimize this score directly. Our approach achieves a new state-of-the-art across multiple datasets, outperforming the previous best method by 5.74% average precision points while having a lower false-positive rate. Importantly, strong inlier performance remains unaffected. The code and pre-trained models are available at: https://github.com/NazirNayal8/UEM-likelihood-ratio.
引用
收藏
页数:13
相关论文
共 52 条
[1]  
Ackermann J., 2023, BRIT MACH VIS C BMVC
[2]  
Aydemir G., 2023, Advances in Neural Information Processing Systems, V36, P32879
[3]  
Bishop C. M., 1993, ICANN '93, P789
[4]   The Fishyscapes Benchmark: Measuring Blind Spots in Semantic Segmentation [J].
Blum, Hermann ;
Sarlin, Paul-Edouard ;
Nieto, Juan ;
Siegwart, Roland ;
Cadena, Cesar .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (11) :3119-3135
[5]  
Blumenkamp J., 2024, 8 ANN C ROB LEARN
[6]  
Chan R., 2021, 35 C NEUR INF PROC S
[7]   Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation [J].
Chan, Robin ;
Rottmann, Matthias ;
Gottschalk, Hanno .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5108-5117
[8]  
Cheng B, 2021, ADV NEUR IN, V34
[9]   Masked-attention Mask Transformer for Universal Image Segmentation [J].
Cheng, Bowen ;
Misra, Ishan ;
Schwing, Alexander G. ;
Kirillov, Alexander ;
Girdhar, Rohit .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289
[10]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223