Actor-Critic Instance Segmentation

被引:9
作者
Araslanov, Nikita [1 ]
Rothkopf, Constantin A. [2 ,3 ]
Roth, Stefan [1 ,3 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] Tech Univ Darmstadt, Inst Psychol, Darmstadt, Germany
[3] Tech Univ Darmstadt, Ctr Cognit Sci, Darmstadt, Germany
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most approaches to visual scene analysis have emphasised parallel processing of the image elements. However one area in which the sequential nature of vision is apparent, is that of segmenting multiple, potentially similar and partially occluded objects in a scene. In this work, we revisit the recurrent formulation of this challenging problem in the context of reinforcement learning. Motivated by the limitations of the global max-matching assignment of the ground-truth segments to the recurrent states, we develop an actor-critic approach in which the actor recurrently predicts one instance mask at a time and utilises the gradient from a concurrently trained critic network. We formulate the state, action, and the reward such as to let the critic model long-term effects of the current prediction and incorporate this information into the gradient signal. Furthermore, to enable effective exploration in the inherently high-dimensional action space of instance masks, we learn a compact representation using a conditional variational auto-encoder We show that our actor-critic model consistently provides accuracy benefits over the recurrent baseline on standard instance segmentation benchmarks.
引用
收藏
页码:8229 / 8238
页数:10
相关论文
共 42 条
[1]  
[Anonymous], NIPS 2015
[2]  
[Anonymous], 1987, Readings in computer vision
[3]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.39
[4]  
[Anonymous], 2013, P ADV NEUR INF PROC
[5]  
[Anonymous], 2016, PROC INT C MACH LEAR
[6]  
[Anonymous], 2017, INT C LEARNING REPRE
[7]  
[Anonymous], 2016, ICLR
[8]  
[Anonymous], CVPR WORKSH DEEP LEA
[9]  
[Anonymous], 2015, PROC CVPR IEEE
[10]  
[Anonymous], 1998, ADAPTIVE COMPUTATION