Multi-Scale Guided Mask Refinement for Coarse-to-Fine RGB-D Perception

被引：3

作者：

Chen, Chongyu ^{[1
]}

Huang, Haoguang ^{[1
]}

Chen, Chuangrong ^{[1
]}

Zheng, Zhuoqi ^{[1
]}

Cheng, Hui ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2019年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Image segmentation; sensor fusion; edge-preserving filtering; OBJECT CLASSIFICATION; SEGMENTATION; COLOR; DEPTH;

D O I：

10.1109/LSP.2018.2886470

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Pixel-level object segmentation is highly desired in many vision applications. Although segmentation methods purely based on visual input have achieved great success in the past decade, their further improvement is still hindered by the intrinsic drawback of color camouflage. With the rapid development and wide deployment of depth sensors, depth assisted methods are increasingly popular in visual perception systems. It is expected that RGB-D based methods can lead to significant performance improvements because color and depth are naturally complementary. However, how to merge color and depth modalities for segmentation with both high efficiency and high accuracy remains an open problem to be addressed. In this letter, we propose to divide the segmentation process into "coarse" and "refining" stages because a coarse segmentation can be easily obtained by various light-weight methods. In this way, we can tackle this problem by focusing on the refinement of coarse segmentations. In particular, we propose a multi-scale approach that selectively inherits the effective features of both edge-preserving filtering and deep neural networks. The proposed approach is evaluated on several bench-mark datasets, respectively, using the coarse segmentations from background subtraction and object detection as the input. Numerous results indicate that our approach can achieve significant accuracy improvements compared to other alternatives, demonstrating superior edge-preserving capability. Besides an effective method for merging RGB-D information, our study on the capability of coarse-to-fine refinement also brings new inspirations for designing light-weight perception systems.

引用

页码：217 / 221

页数：5

共 50 条

[1] Feature enhancement and coarse-to-fine detection for RGB-D tracking
Zhu, Xue-Feng
Xu, Tianyang
Wu, Xiao-Jun
Kittler, Josef
PATTERN RECOGNITION LETTERS, 2024, 179 : 130 - 136
[2] Coarse-to-Fine semantic parsing method for RGB-D indoor scenes
Liu T.
Feng X.
Gu Y.
Dai X.
Luo J.
Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2016, 46 (04): : 681 - 687
[3] Multi-scale iterative refinement network for RGB-D salient object detection
Liu, Ze-Yu
Liu, Jian-Wei
Zuo, Xin
Hu, Ming-Fei
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
[4] Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation
Li, Chen
Song, Li
Zou, Xueyi
Guo, Jiaming
Yan, Youliang
Zhang, Wenjun
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5201 - 5209
[5] Mapping Indoor Spaces by Adaptive Coarse-to-Fine Registration of RGB-D Data
dos Santos, Daniel R.
Basso, Marcos A.
Khoshelham, Kourosh
de Oliveira, Elizeu, Jr.
Pavan, Nadisson L.
Vosselman, George
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (02) : 262 - 266
[6] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
Hao Zhao
Jingrun Zheng
Xiaoke Shang
Wei Zhong
Jinyuan Liu
The Visual Computer, 2024, 40 : 1697 - 1710
[7] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
Zhao, Hao
Zheng, Jingrun
Shang, Xiaoke
Zhong, Wei
Liu, Jinyuan
VISUAL COMPUTER, 2024, 40 (03): : 1697 - 1710
[8] Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection
Li, Zekun
Liu, Yufan
Li, Bing
Hu, Weiming
Zhou, Xue
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[9] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
Zhang, Fan
Liu, Na
Duan, Fuqing
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2621 - 2633
[10] Regression Forest Based RGB-D Visual Relocalization Using Coarse-to-Fine Strategy
Wang, Jikai
Wang, Peng
Dai, Deyun
Xu, Meng
Chen, Zonghai
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4431 - 4438

← 1 2 3 4 5 →