ColorRL: Reinforced Coloring for End-to-End Instance Segmentation

被引:0
作者
Tuan, Tran Anh [1 ]
Khoa, Nguyen Tuan [1 ]
Tran Minh Quan [2 ,3 ]
Jeong, Won-Ki [4 ]
机构
[1] UNIST, Dept Comp Sci & Engn, Ulsan, South Korea
[2] VinBrain, Dept Appl Sci, Hanoi, Vietnam
[3] VinUniversity, Hanoi, Vietnam
[4] Korea Univ, Dept Comp Sci & Engn, Seoul, South Korea
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
新加坡国家研究基金会;
关键词
NETWORKS;
D O I
10.1109/CVPR46437.2021.01645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance segmentation, the task of identifying and separating each individual object of interest in the image, is one of the actively studied research topics in computer vision. Although many feed-forward networks produce high-quality binary segmentation on different types of images, their final result heavily relies on the post-processing step, which separates instances from the binary mask. In comparison, the existing iterative methods extract a single object at a time using discriminative knowledge-based properties (e.g., shapes, boundaries, etc.) without relying on postprocessing. However, they do not scale well with a large number of objects. To exploit the advantages of conventional sequential segmentation methods without impairing the scalability, we propose a novel iterative deep reinforcement learning agent that learns how to differentiate multiple objects in parallel. By constructing a relational graph between pixels, we design a reward function that encourages separating pixels of different objects and grouping pixels that belong to the same instance. We demonstrate that the proposed method can efficiently perform instance segmentation of many objects without heavy post-processing.
引用
收藏
页码:16722 / 16731
页数:10
相关论文
共 50 条
  • [41] On the Impact of Scale-Free Structure on End-to-End TCP Performance
    Sakumoto, Yusuke
    Ohsaki, Hiroyuki
    IEEE 39TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC 2015), VOL 3, 2015, : 652 - 653
  • [42] END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER
    Yang, Xuesong
    Chen, Yun-Nung
    Hakkani-Tur, Dilek
    Crook, Paul
    Li, Xiujun
    Gao, Jianfeng
    Deng, Li
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5690 - 5694
  • [43] An End-to-End Trainable Deep Convolutional Neuro-Fuzzy Classifier
    Yeganejou, Mojtaba
    Kluzinski, Ryan
    Dick, Scott
    Miller, James
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [44] Risk-balanced dimensioning and pricing of End-to-End differentiated services
    Gaivoronski, Alexei A.
    Nesse, Per-Jonny
    Osterbo, Olav-Norvald
    Lonsethagen, Hakon
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 254 (02) : 644 - 655
  • [45] Intelligent end-to-end resource virtualization using Service Oriented Architecture
    Onur, E.
    Sfakianakis, E.
    Papagianni, C.
    Karagiannis, G.
    Kontos, T.
    Niemegeers, I.
    Chochliouros, I. P.
    de Groot, S. Heemstra
    Sjodin, P.
    Hidell, M.
    Cinkler, T.
    Maliosz, M.
    Kaklamani, D. I.
    Carapinha, J.
    Belesioti, M.
    Fytros, E.
    2009 IEEE GLOBECOM WORKSHOPS, 2009, : 345 - +
  • [46] Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming
    Ochiai, Tsubasa
    Watanabe, Shinji
    Hori, Takaaki
    Hershey, John R.
    Xiao, Xiong
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1274 - 1288
  • [47] NEURAL DYNAMIC MODE DECOMPOSITION FOR END-TO-END MODELING OF NONLINEAR DYNAMICS
    Iwata, Tomoharu
    Kawahara, Yoshinobu
    JOURNAL OF COMPUTATIONAL DYNAMICS, 2023, 10 (02): : 268 - 280
  • [48] Searching for Life: End-to-End Automated Detection and Characterization of Ediacaran Biosignatures
    Jonnalagedda, Padmaja
    Surprenant, Rachel L.
    Droser, Mary L.
    Bhanu, Bir
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 18
  • [49] End-to-end driving model based on deep learning and attention mechanism
    Zhu, Wuqiang
    Lu, Yang
    Zhang, Yongliang
    Wei, Xing
    Wei, Zhen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3337 - 3348
  • [50] FROM AUDIO TO SEMANTICS: APPROACHES TO END-TO-END SPOKEN LANGUAGE UNDERSTANDING
    Haghani, Parisa
    Narayanan, Arun
    Bacchiani, Michiel
    Chuang, Galen
    Gaur, Neeraj
    Moreno, Pedro
    Prabhavalkar, Rohit
    Qu, Zhongdi
    Waters, Austin
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 720 - 726