ACTIVE REINFORCEMENT LEARNING FOR THE SEMANTIC SEGMENTATION OF IMAGES CAPTURED BY MOBILE SENSORS

被引：0

作者：

Rad, M. Jodeiri ^{[1
]}

Armenakis, C. ^{[1
]}

机构：

[1] York Univ, Lassonde Sch Engn, Dept Earth & Space Sci, Geomat Engn,GeoICT Lab, Toronto, ON, Canada

来源：

XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II | 2022年 / 43-B2卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Semantic Segmentation; Active Learning; Reinforcement Learning; Deep Query Network; Deep Neural Network;

D O I：

10.5194/isprs-archives-XLIII-B2-2022-593-2022

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

In recent years, various Convolutional Neural Networks (CNN) have been used to achieve acceptable performance on semantic segmentation tasks. However, these supervised learning methods require an extensive amount of annotated training data to perform well. Additionally, the model would need to be trained on the same kind of dataset to generalize well for other tasks. Further, commonly real world datasets are usually highly imbalanced. This problem leads to poor performance in the detection of underrepresented classes, which could be the most critical for some applications. The annotation task is time-consuming human labour that creates an obstacle to utilizing supervised learning methods on vision tasks. In this work, we experiment with implementing a reinforced active learning method with a weighted performance metric to reduce human labour while achieving competitive results. A deep Q-network (DQN) is used to find the optimal policy, which would be choosing the most informative regions of the image to be labelled from the unlabelled set. Then, the neural network would be trained with newly labelled data, and its performance would be evaluated. A weighted Intersection over Union (IoU) is used to calculate the rewards for the DQN network. By using weighted IoU, we target to bring more attention to underrepresented classes.

引用

页码：593 / 599

页数：7

共 33 条

[1] [Anonymous], large-scale environments based on pose graph optimization
[2] [Anonymous], 2019, 29 BRIT MACH VIS C B
[3] [Anonymous], 2016, FULLY CONVOLUTIONAL
[4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[5] Bousias Alexakis E., 2021, INT ARCH PHOTOGRAMM, V43, P829
[6] Casanova A., 2020, ARXIV201204027
[7] Chen BK, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1504
[8] Domain Adaptation for Semantic Segmentation with Maximum Squares Loss
Chen, Minghao
Xue, Hongyang
Cai, Deng
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2090 - 2099
[9] Xception: Deep Learning with Depthwise Separable Convolutions
Chollet, Francois
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
[10] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223

← 1 2 3 4 →