Multi-Scale Guided Mask Refinement for Coarse-to-Fine RGB-D Perception

被引:3
|
作者
Chen, Chongyu [1 ]
Huang, Haoguang [1 ]
Chen, Chuangrong [1 ]
Zheng, Zhuoqi [1 ]
Cheng, Hui [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; sensor fusion; edge-preserving filtering; OBJECT CLASSIFICATION; SEGMENTATION; COLOR; DEPTH;
D O I
10.1109/LSP.2018.2886470
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Pixel-level object segmentation is highly desired in many vision applications. Although segmentation methods purely based on visual input have achieved great success in the past decade, their further improvement is still hindered by the intrinsic drawback of color camouflage. With the rapid development and wide deployment of depth sensors, depth assisted methods are increasingly popular in visual perception systems. It is expected that RGB-D based methods can lead to significant performance improvements because color and depth are naturally complementary. However, how to merge color and depth modalities for segmentation with both high efficiency and high accuracy remains an open problem to be addressed. In this letter, we propose to divide the segmentation process into "coarse" and "refining" stages because a coarse segmentation can be easily obtained by various light-weight methods. In this way, we can tackle this problem by focusing on the refinement of coarse segmentations. In particular, we propose a multi-scale approach that selectively inherits the effective features of both edge-preserving filtering and deep neural networks. The proposed approach is evaluated on several bench-mark datasets, respectively, using the coarse segmentations from background subtraction and object detection as the input. Numerous results indicate that our approach can achieve significant accuracy improvements compared to other alternatives, demonstrating superior edge-preserving capability. Besides an effective method for merging RGB-D information, our study on the capability of coarse-to-fine refinement also brings new inspirations for designing light-weight perception systems.
引用
收藏
页码:217 / 221
页数:5
相关论文
共 50 条
  • [1] Feature enhancement and coarse-to-fine detection for RGB-D tracking
    Zhu, Xue-Feng
    Xu, Tianyang
    Wu, Xiao-Jun
    Kittler, Josef
    PATTERN RECOGNITION LETTERS, 2024, 179 : 130 - 136
  • [2] Coarse-to-Fine semantic parsing method for RGB-D indoor scenes
    Liu T.
    Feng X.
    Gu Y.
    Dai X.
    Luo J.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2016, 46 (04): : 681 - 687
  • [3] Multi-scale iterative refinement network for RGB-D salient object detection
    Liu, Ze-Yu
    Liu, Jian-Wei
    Zuo, Xin
    Hu, Ming-Fei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [4] Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation
    Li, Chen
    Song, Li
    Zou, Xueyi
    Guo, Jiaming
    Yan, Youliang
    Zhang, Wenjun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5201 - 5209
  • [5] Mapping Indoor Spaces by Adaptive Coarse-to-Fine Registration of RGB-D Data
    dos Santos, Daniel R.
    Basso, Marcos A.
    Khoshelham, Kourosh
    de Oliveira, Elizeu, Jr.
    Pavan, Nadisson L.
    Vosselman, George
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (02) : 262 - 266
  • [6] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
    Hao Zhao
    Jingrun Zheng
    Xiaoke Shang
    Wei Zhong
    Jinyuan Liu
    The Visual Computer, 2024, 40 : 1697 - 1710
  • [7] Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion
    Zhao, Hao
    Zheng, Jingrun
    Shang, Xiaoke
    Zhong, Wei
    Liu, Jinyuan
    VISUAL COMPUTER, 2024, 40 (03): : 1697 - 1710
  • [8] Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection
    Li, Zekun
    Liu, Yufan
    Li, Bing
    Hu, Weiming
    Zhou, Xue
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
    Zhang, Fan
    Liu, Na
    Duan, Fuqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2621 - 2633
  • [10] Regression Forest Based RGB-D Visual Relocalization Using Coarse-to-Fine Strategy
    Wang, Jikai
    Wang, Peng
    Dai, Deyun
    Xu, Meng
    Chen, Zonghai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4431 - 4438