Multi-view Deep Learning for Image-based Pose Recovery

被引:0
作者
Hong, Chaoqun [1 ]
Yu, Jun [2 ]
Xie, Yong [1 ]
Chen, Xuhui [1 ]
机构
[1] Xiamen Univ Technol, Xiamen 361024, Peoples R China
[2] Hangzhou Dianzi Univ, Hangzhou 310018, Zhejiang, Peoples R China
来源
2015 IEEE 16TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT) | 2015年
关键词
Pose recovery; Multi-view fusion; Manifold alignment; Deep learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
lmage-based human pose recovery is usually conducted by retrieving relevant poses with image features. However, semantic gap exists for current feature extractors, which limits recovery performance. In this paper, we propose a novel method to recover 3D human poses from silhouettes. It is based on multiple feature fusion and deep learning. First, to fuse different types of features, we introduce manifold alignment with hypergraph Laplacian. Hypergraph Laplacian matrix is constructed with patch alignment framework. Second, multi-view description is applied to deep neural networks. In this way, the non-linear mapping from 2D images to 3D poses is learned and pose recovery can be achieved. Experimental results on the widely-used Human3.6m dataset show that the recovery error has been reduced by 10% to 20%, which demonstrates the effectiveness of the proposed method.
引用
收藏
页码:897 / 902
页数:6
相关论文
共 21 条
[1]   Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[3]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[4]   Twin Gaussian Processes for Structured Prediction [J].
Bo, Liefeng ;
Sminchisescu, Cristian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 87 (1-2) :28-52
[5]  
Brand M., 1999, ICCV, V2, P1237, DOI DOI 10.1109/ICCV.1999.790422
[6]   3D human pose recovery from image by efficient visual feature selection [J].
Chen, Cheng ;
Yang, Yi ;
Nie, Feiping ;
Odobez, Jean-Marc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (03) :290-299
[7]  
Cheng B, 2011, IEEE I CONF COMP VIS, P2439, DOI 10.1109/ICCV.2011.6126528
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval [J].
Hong, Chaoqun ;
Yu, Jun ;
Tao, Dacheng ;
Wang, Meng .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (06) :3742-3751
[10]   Multi-view hypergraph learning by patch alignment framework [J].
Hong, Chaoqun ;
Yu, Jun ;
Li, Jonathan ;
Chen, Xuhui .
NEUROCOMPUTING, 2013, 118 :79-86