DEEP REGRESSION FOREST WITH SOFT-ATTENTION FOR HEAD POSE ESTIMATION

被引:0
作者
Ma, Xiangtian [1 ]
Sang, Nan [1 ]
Wang, Xupeng [1 ]
Xiao, Shihua [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, 4,Sect 2,North Jianshe Rd, Chengdu, Sichuan, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年
关键词
head pose estimation; point cloud; multi-task learning; deep regression forest; soft attention;
D O I
10.1109/icip40778.2020.9191082
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
The task of head pose estimation from a single depth image is challenging, due to the presence of large pose variations, occlusions and inhomegeneous facial feature space. To solve the problem, we propose Deep Regression Forest with Soft-Attention (SA-DRF) in a multi-task learning setup. It can be integrated with a general feature learning net and jointly learned in an end-to-end manner. The soft-attention module is facilitated to learn soft masks from the general features and feeds the forest with task-specific features to regress head poses. Experiments on the Biwi Head Pose and Pandora datasets demonstrate its superior performance compared to current state-of-the-arts.
引用
收藏
页码:2840 / 2844
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2016, P 25 INT JOINT C ART
[2]  
Borghi G., 2018, IEEE T PATTERN ANAL, P1
[3]   POSEidon: Face-from-Depth for Driver Pose Estimation [J].
Borghi, Guido ;
Venturelli, Marco ;
Vezzani, Roberto ;
Cucchiara, Rita .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5494-5503
[4]  
Breitenstein Michael D., 2008, 2008 IEEE COMP SOC C
[5]  
Chi CH, 2002, IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, P1, DOI 10.1109/ICME.2002.1035703
[6]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[7]  
Drouard V, 2015, IEEE IMAGE PROC, P4624, DOI 10.1109/ICIP.2015.7351683
[8]   Random Forests for Real Time 3D Face Analysis [J].
Fanelli, Gabriele ;
Dantone, Matthias ;
Gall, Juergen ;
Fossati, Andrea ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :437-458
[9]   QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss [J].
Hsu, Heng-Wei ;
Wu, Tung-Yu ;
Wan, Sheng ;
Wong, Wing Hung ;
Lee, Chen-Yi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) :1035-1046
[10]   High Performance Visual Tracking with Siamese Region Proposal Network [J].
Li, Bo ;
Yan, Junjie ;
Wu, Wei ;
Zhu, Zheng ;
Hu, Xiaolin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8971-8980