ColorRL: Reinforced Coloring for End-to-End Instance Segmentation

被引：0

作者：

Tuan, Tran Anh ^{[1
]}

Khoa, Nguyen Tuan ^{[1
]}

Tran Minh Quan ^{[2
,3
]}

Jeong, Won-Ki ^{[4
]}

机构：

[1] UNIST, Dept Comp Sci & Engn, Ulsan, South Korea

[2] VinBrain, Dept Appl Sci, Hanoi, Vietnam

[3] VinUniversity, Hanoi, Vietnam

[4] Korea Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

新加坡国家研究基金会;

关键词：

NETWORKS;

D O I：

10.1109/CVPR46437.2021.01645

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Instance segmentation, the task of identifying and separating each individual object of interest in the image, is one of the actively studied research topics in computer vision. Although many feed-forward networks produce high-quality binary segmentation on different types of images, their final result heavily relies on the post-processing step, which separates instances from the binary mask. In comparison, the existing iterative methods extract a single object at a time using discriminative knowledge-based properties (e.g., shapes, boundaries, etc.) without relying on postprocessing. However, they do not scale well with a large number of objects. To exploit the advantages of conventional sequential segmentation methods without impairing the scalability, we propose a novel iterative deep reinforcement learning agent that learns how to differentiate multiple objects in parallel. By constructing a relational graph between pixels, we design a reward function that encourages separating pixels of different objects and grouping pixels that belong to the same instance. We demonstrate that the proposed method can efficiently perform instance segmentation of many objects without heavy post-processing.

引用

页码：16722 / 16731

页数：10

共 50 条

[41] On the Impact of Scale-Free Structure on End-to-End TCP Performance
Sakumoto, Yusuke
Ohsaki, Hiroyuki
IEEE 39TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC 2015), VOL 3, 2015, : 652 - 653
[42] END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER
Yang, Xuesong
Chen, Yun-Nung
Hakkani-Tur, Dilek
Crook, Paul
Li, Xiujun
Gao, Jianfeng
Deng, Li
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5690 - 5694
[43] An End-to-End Trainable Deep Convolutional Neuro-Fuzzy Classifier
Yeganejou, Mojtaba
Kluzinski, Ryan
Dick, Scott
Miller, James
2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
[44] Risk-balanced dimensioning and pricing of End-to-End differentiated services
Gaivoronski, Alexei A.
Nesse, Per-Jonny
Osterbo, Olav-Norvald
Lonsethagen, Hakon
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 254 (02) : 644 - 655
[45] Intelligent end-to-end resource virtualization using Service Oriented Architecture
Onur, E.
Sfakianakis, E.
Papagianni, C.
Karagiannis, G.
Kontos, T.
Niemegeers, I.
Chochliouros, I. P.
de Groot, S. Heemstra
Sjodin, P.
Hidell, M.
Cinkler, T.
Maliosz, M.
Kaklamani, D. I.
Carapinha, J.
Belesioti, M.
Fytros, E.
2009 IEEE GLOBECOM WORKSHOPS, 2009, : 345 - +
[46] Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming
Ochiai, Tsubasa
Watanabe, Shinji
Hori, Takaaki
Hershey, John R.
Xiao, Xiong
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1274 - 1288
[47] NEURAL DYNAMIC MODE DECOMPOSITION FOR END-TO-END MODELING OF NONLINEAR DYNAMICS
Iwata, Tomoharu
Kawahara, Yoshinobu
JOURNAL OF COMPUTATIONAL DYNAMICS, 2023, 10 (02): : 268 - 280
[48] Searching for Life: End-to-End Automated Detection and Characterization of Ediacaran Biosignatures
Jonnalagedda, Padmaja
Surprenant, Rachel L.
Droser, Mary L.
Bhanu, Bir
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 18
[49] End-to-end driving model based on deep learning and attention mechanism
Zhu, Wuqiang
Lu, Yang
Zhang, Yongliang
Wei, Xing
Wei, Zhen
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3337 - 3348
[50] FROM AUDIO TO SEMANTICS: APPROACHES TO END-TO-END SPOKEN LANGUAGE UNDERSTANDING
Haghani, Parisa
Narayanan, Arun
Bacchiani, Michiel
Chuang, Galen
Gaur, Neeraj
Moreno, Pedro
Prabhavalkar, Rohit
Qu, Zhongdi
Waters, Austin
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 720 - 726

← 1 2 3 4 5 →