ACTIVE REINFORCEMENT LEARNING FOR THE SEMANTIC SEGMENTATION OF IMAGES CAPTURED BY MOBILE SENSORS

被引:0
作者
Rad, M. Jodeiri [1 ]
Armenakis, C. [1 ]
机构
[1] York Univ, Lassonde Sch Engn, Dept Earth & Space Sci, Geomat Engn,GeoICT Lab, Toronto, ON, Canada
来源
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II | 2022年 / 43-B2卷
基金
加拿大自然科学与工程研究理事会;
关键词
Semantic Segmentation; Active Learning; Reinforcement Learning; Deep Query Network; Deep Neural Network;
D O I
10.5194/isprs-archives-XLIII-B2-2022-593-2022
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In recent years, various Convolutional Neural Networks (CNN) have been used to achieve acceptable performance on semantic segmentation tasks. However, these supervised learning methods require an extensive amount of annotated training data to perform well. Additionally, the model would need to be trained on the same kind of dataset to generalize well for other tasks. Further, commonly real world datasets are usually highly imbalanced. This problem leads to poor performance in the detection of underrepresented classes, which could be the most critical for some applications. The annotation task is time-consuming human labour that creates an obstacle to utilizing supervised learning methods on vision tasks. In this work, we experiment with implementing a reinforced active learning method with a weighted performance metric to reduce human labour while achieving competitive results. A deep Q-network (DQN) is used to find the optimal policy, which would be choosing the most informative regions of the image to be labelled from the unlabelled set. Then, the neural network would be trained with newly labelled data, and its performance would be evaluated. A weighted Intersection over Union (IoU) is used to calculate the rewards for the DQN network. By using weighted IoU, we target to bring more attention to underrepresented classes.
引用
收藏
页码:593 / 599
页数:7
相关论文
共 33 条
  • [1] [Anonymous], large-scale environments based on pose graph optimization
  • [2] [Anonymous], 2019, 29 BRIT MACH VIS C B
  • [3] [Anonymous], 2016, FULLY CONVOLUTIONAL
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Bousias Alexakis E., 2021, INT ARCH PHOTOGRAMM, V43, P829
  • [6] Casanova A., 2020, ARXIV201204027
  • [7] Chen BK, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1504
  • [8] Domain Adaptation for Semantic Segmentation with Maximum Squares Loss
    Chen, Minghao
    Xue, Hongyang
    Cai, Deng
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2090 - 2099
  • [9] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [10] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223