3D Pose Regression using Convolutional Neural Networks

被引:80
作者
Mahendran, Siddharth [1 ]
Ali, Haider [1 ]
Vidal, Rene [1 ]
机构
[1] Johns Hopkins Univ, Ctr Imaging Sci, Baltimore, MD 21218 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017) | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCVW.2017.254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data augmentation and loss function that captures the geometry of the pose space. Experiments on PASCAL3D+ show that the proposed 3D pose regression approach achieves competitive performance compared to the state-of-the-art.
引用
收藏
页码:2174 / 2182
页数:9
相关论文
共 21 条
  • [1] [Anonymous], IEEE C COMP VIS PATT
  • [2] [Anonymous], IEEE C COMP VIS PATT
  • [3] [Anonymous], IEEE C COMP VIS PATT
  • [4] [Anonymous], 2017, IEEE INT C ROB AUT
  • [5] [Anonymous], 2014, P BRIT MACH VIS C 20
  • [6] [Anonymous], 2015, TENSORFLOW LARGE SCA
  • [7] [Anonymous], 2015, P IEEE INT C COMP VI
  • [8] [Anonymous], INT C LEARN REPR
  • [9] [Anonymous], Construction and Analysis of a Large Scale Image Ontology
  • [10] Chollet Francois., 2015, Keras