3D LayoutCRF for multi-view object class recognition and segmentation

被引:0
|
作者
Hoiem, Derek [1 ]
Rother, Carsten [2 ]
Winn, John [2 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[2] Microsoft Res Cambridge, Cambridge, England
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We introduce an approach to accurately detect and segment partially occluded objects in various viewpoints and scales. Our main contribution is a novel framework for combining object-level descriptions (such as position, shape, and color) with pixel-level appearance, boundary, and occlusion reasoning. In training, we exploit a rough 3D object model to learn physically localized part appearances. To find and segment objects in an image, we generate proposals based on the appearance and layout of local parts. The proposals are then refined after incorporating object-level information, and overlapping objects compete for pixels to produce a final description and segmentation of objects in the scene. A further contribution is a novel instance penalty, which is handled very efficiently during inference. We experimentally validate our approach on the challenging PASCAL'06 car database.
引用
收藏
页码:580 / +
页数:2
相关论文
共 50 条
  • [1] Learning Relationships for Multi-View 3D Object Recognition
    Yang, Ze
    Wang, Liwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7504 - 7513
  • [2] Learning Disentangled Representation for Multi-View 3D Object Recognition
    Huang, Jingjia
    Yan, Wei
    Li, Ge
    Li, Thomas
    Liu, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 646 - 659
  • [3] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [4] Multi-view ensemble manifold regularization for 3D object recognition
    Hong, Chaoqun
    Yu, Jun
    You, Jane
    Chen, Xuhui
    Tao, Dapeng
    INFORMATION SCIENCES, 2015, 320 : 395 - 405
  • [5] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Luequan Wang
    Hongbin Xu
    Wenxiong Kang
    Machine Intelligence Research, 2023, 20 : 872 - 883
  • [6] MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
    Wang, Luequan
    Xu, Hongbin
    Kang, Wenxiong
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (06) : 872 - 883
  • [7] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [8] Deep models for multi-view 3D object recognition: a review
    Alzahrani, Mona
    Usman, Muhammad
    Jarraya, Salma Kammoun
    Anwar, Saeed
    Helmy, Tarek
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
  • [9] Multi-View Object Class Detection with a 3D Geometric Model
    Liebelt, Joerg
    Schmid, Cordelia
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1688 - 1695
  • [10] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04): : 3201 - 3212