Data-Driven 3D Voxel Patterns for Object Category Recognition

被引:0
|
作者
Xiang, Yu [1 ,2 ]
Choi, Wongun [3 ]
Lin, Yuanqing [3 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] NEC Labs Amer Inc, Irving, TX USA
来源
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2015年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the great progress achieved in recognizing objects as 2D bounding boxes in images, it is still very challenging to detect occluded objects and estimate the 3D properties of multiple objects from a single image. In this paper, we propose a novel object representation, 3D Voxel Pattern (3DVP), that jointly encodes the key properties of objects including appearance, 3D shape, viewpoint, occlusion and truncation. We discover 3DVPs in a data-driven way, and train a bank of specialized detectors for a dictionary of 3DVPs. The 3DVP detectors are capable of detecting objects with specific visibility patterns and transferring the meta-data from the 3DVPs to the detected objects, such as 2D segmentation mask, 3D pose as well as occlusion or truncation boundaries. The transferred meta-data allows us to infer the occlusion relationship among objects, which in turn provides improved object recognition results. Experiments are conducted on the KITTI detection benchmark [17] and the outdoor-scene dataset [41]. We improve stateof-the-art results on car detection and pose estimation with notable margins (6% in difficult data of KITTI). We also verify the ability of our method in accurately segmenting objects from the background and localizing them in 3D.
引用
收藏
页码:1903 / 1911
页数:9
相关论文
共 50 条
  • [1] Three-dimensional template correlation: Object recognition in 3D voxel data
    VanCourt, T
    Gu, Y
    Herbordt, MC
    CAMP 2005: SEVENTH INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION , PROCEEDINGS, 2005, : 153 - 158
  • [2] A 3D OBJECT RECOGNITION SYSTEM USING VOXEL REPRESENTATION
    QIAN, K
    LU, X
    BHATTACHARYA, P
    PATTERN RECOGNITION LETTERS, 1992, 13 (10) : 725 - 733
  • [3] Flexible object models for category-level 3D object recognition
    Kushal, Akash
    Schmid, Cordelia
    Ponce, Jean
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1370 - +
  • [4] Good Appearance and 3D Shape Descriptors for Object Category Recognition
    Proenca, Pedro F.
    Gaspar, Filipe
    Dias, Miguel Sales
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (04)
  • [5] Category Level 3D Object Recognition using Depth Images
    Kayim, Guney
    Akgul, Ceyhun Burak
    Sankur, Bulent
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [6] Voxel Transformer for 3D Object Detection
    Mao, Jiageng
    Xue, Yujing
    Niu, Minzhe
    Bai, Haoyue
    Feng, Jiashi
    Liang, Xiaodan
    Xu, Hang
    Xu, Chunjing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3144 - 3153
  • [7] 3D object recognition in TOF data sets
    Hess, H
    Albrecht, M
    Grothof, M
    Hussmann, S
    Oikonomidis, N
    Schwarte, R
    LASER RADAR TECHNOLOGY AND APPLICATIONS VIII, 2003, 5086 : 221 - 228
  • [8] Data-Driven 3D Neck Modeling and Animation
    Liu, Yilong
    Zheng, Chengwei
    Xu, Feng
    Tong, Xin
    Guo, Baining
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (07) : 3226 - 3237
  • [9] A Survey on Data-Driven 3D Shape Descriptors
    Rostami, R.
    Bashiri, F. S.
    Rostami, B.
    Yu, Z.
    COMPUTER GRAPHICS FORUM, 2019, 38 (01) : 356 - 393
  • [10] Data-driven optimization of 3D battery design
    Miyamoto, Kaito
    Broderick, Scott R.
    Rajan, Krishna
    JOURNAL OF POWER SOURCES, 2022, 536