Robust pixel-wise detection of road obstacles by integrating composite and real images

被引:0
作者
Munoz, Carlos David Ardon [1 ]
Nishiyama, Masashi [1 ]
Iwai, Yoshio [1 ]
Aoki, Kota [1 ]
机构
[1] Tottori Univ, Grad Sch Engn, 101 Minami 4 Chome,Koyama Cho, Tottori 6808550, Japan
关键词
Computer vision; Anomaly detection; Semantic segmentation;
D O I
10.1016/j.patrec.2025.03.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Safe navigation systems for vehicles use semantic segmentation and other object identification technologies. However, these neural networks are trained on a closed set of known classes, and because of overconfidence in their predictions, it is difficult for them to identify unseen objects. In this study, we propose a framework to identify road obstacles in the drivable area of a driving scene. We built our model on the Synboost framework, integrating a maximized entropy module to generate high-quality priors and image resynthesis as an attention mechanism for a spatial-dissimilarity network. The architecture of our dissimilarity network uses prior images to create a single-channel modulation map at each level of the decoder to weigh features. Moreover, we used a mixture of real and synthetic anomalies during training and fine-tuning. For evaluation and testing, we used only real anomalies. Our experiments demonstrated the importance of using multiple sources of anomalies during training for robust pixel-wise anomaly detection, as all models consistently yielded better results when trained on a combination of real and synthetic images. Additionally, the experimental results demonstrated that, at a 95% true positive rate, our proposed model outperforms previous models in average precision and false positive rate on two test sets.
引用
收藏
页码:106 / 112
页数:7
相关论文
共 33 条
  • [1] Ardon Munoz Carlos David, 2023, Pattern Recognition: 7th Asian Conference, ACPR 2023, Proceedings. Lecture Notes in Computer Science (14408), P150, DOI 10.1007/978-3-031-47665-5_13
  • [2] Dense open-set recognition based on training with noisy negative images
    Bevandic, Petra
    Kreso, Ivan
    Orsic, Marin
    Segvic, Sinisa
    [J]. IMAGE AND VISION COMPUTING, 2022, 124
  • [3] Simultaneous Semantic Segmentation and Outlier Detection in Presence of Domain Shift
    Bevandic, Petra
    Kreso, Ivan
    Orsic, Marin
    Segvic, Sinisa
    [J]. PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 33 - 47
  • [4] Chan R., 2021, P NEUR INF PROC SYST, V1
  • [5] Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation
    Chan, Robin
    Rottmann, Matthias
    Gottschalk, Hanno
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5108 - 5117
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Is Segmentation Uncertainty Useful?
    Czolbe, Steffen
    Arnavaz, Kasra
    Krause, Oswin
    Feragen, Aasa
    [J]. INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 : 715 - 726
  • [8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [9] Pixel-wise Anomaly Detection in Complex Driving Scenes
    Di Biase, Giancarlo
    Blum, Hermann
    Siegwart, Roland
    Cadena, Cesar
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16913 - 16922
  • [10] Modeling Visual Context Is Key to Augmenting Object Detection Datasets
    Dvornik, Nikita
    Mairal, Julien
    Schmid, Cordelia
    [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 375 - 391