Weakly-supervised Semantic Segmentation in Cityscape via Hyperspectral Image

被引:8
作者
Huang, Yuxing [1 ]
Shen, Qiu [1 ]
Fu, Ying [2 ]
You, Shaodi [3 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Univ Amsterdam, Comp Vis Res Grp, Amsterdam, Netherlands
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年
关键词
VIDEO;
D O I
10.1109/ICCVW54120.2021.00131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperspectral images (HSIs) contain the response of each pixel in different spectral bands, which can be used to effectively distinguish various objects in complex scenes. While HSI cameras have become low cost, algorithms based on it have not been well exploited. In this paper, we focus on a novel topic, weakly-supervised semantic segmentation in cityscape via HSIs. It is based on the idea that high-resolution HSIs in city scenes contain rich spectral information, which can be easily associated to semantics without manual labeling. Therefore, it enables low cost, highly reliable semantic segmentation in complex scenes. Specifically, in this paper, we theoretically analyze the HSIs and introduce a weakly-supervised HSI semantic segmentation framework, which utilizes spectral information to improve the coarse labels to a finer degree. The experimental results show that our method can obtain highly competitive labels and even have higher edge fineness than artificial fine labels in some classes. At the same time, the results also show that the refined labels can effectively improve the performance of existing semantic segmentation algorithms. The combination of HSIs and semantic segmentation proves that HSIs have great potential in high-level visual tasks for automatic driving.
引用
收藏
页码:1117 / 1126
页数:10
相关论文
共 50 条
  • [1] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation
    Ahn, Jiwoon
    Kwak, Suha
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4981 - 4990
  • [2] [Anonymous], 2019, P EUR C COMP VIS ECC, DOI DOI 10.1007/S13143-018-0064-5
  • [3] Compressive hyperspectral imaging by random separable projections in both the spatial and the spectral domains
    August, Yitzhak
    Vachman, Chaim
    Rivenson, Yair
    Stern, Adrian
    [J]. APPLIED OPTICS, 2013, 52 (10) : D46 - D54
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] What's the Point: Semantic Segmentation with Point Supervision
    Bearman, Amy
    Russakovsky, Olga
    Ferrari, Vittorio
    Fei-Fei, Li
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 549 - 565
  • [6] Semantic object classes in video: A high-definition ground truth database
    Brostow, Gabriel J.
    Fauqueur, Julien
    Cipolla, Roberto
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (02) : 88 - 97
  • [7] Large scale labelled video data augmentation for semantic segmentation in driving scenarios
    Budvytis, Ignas
    Sauer, Patrick
    Roddick, Thomas
    Breen, Kesar
    Cipolla, Roberto
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 230 - 237
  • [8] In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Kontschieder, Peter
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5639 - 5647
  • [9] High Resolution Multispectral Video Capture with a Hybrid Camera System
    Cao, Xun
    Tong, Xin
    Dai, Qionghai
    Lin, Stephen
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 297 - 304
  • [10] A Prism-Mask System for Multispectral Video Acquisition
    Cao, Xun
    Du, Hao
    Tong, Xin
    Dai, Qionghai
    Lin, Stephen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) : 2423 - 2435