Asymmetry-aware bilinear pooling in multi-modal data for head pose estimation

被引:4
作者
Chen, Jiazhong [1 ]
Li, Qingqing [1 ]
Ren, Dakai [2 ]
Cao, Hua [1 ]
Ling, Hefei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Asymmetry-aware; Bilinear pooling; ATTENTION; REPRESENTATION; NETWORK;
D O I
10.1016/j.image.2022.116895
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The head pose on roll and yaw directions is decided by the asymmetric appearance in human faces, and the contextual information of asymmetric appearance is encoded in a head pose related neighborhood. However, CNNs used in existing head pose estimation methods often evenly performs on the features of full image. Thus it is hard to collect the contextual information of such asymmetric appearance by those methods. To address this issue, this paper proposes a novel head pose estimation method that could perceive the asymmetric appearance in human faces. Specifically, the awareness of such asymmetry is undertaken by the local pairwise feature interaction in head pose related neighborhood via bilinear pooling. Evaluations on two public datasets demonstrate that our method could achieve promising results.
引用
收藏
页数:10
相关论文
共 72 条
  • [1] Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network
    Ahn, Byungtae
    Park, Jaesik
    Kweon, In So
    [J]. COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 82 - 96
  • [2] [Anonymous], 2014, BMVC
  • [3] Baltrusaitis T, 2012, PROC CVPR IEEE, P2610, DOI 10.1109/CVPR.2012.6247980
  • [4] Bengio S, 2015, ADV NEUR IN, V28
  • [5] Biternion Nets: Continuous Head Pose Regression from Discrete Training Labels
    Beyer, Lucas
    Hermans, Alexander
    Leibe, Bastian
    [J]. PATTERN RECOGNITION, GCPR 2015, 2015, 9358 : 157 - 168
  • [6] POSEidon: Face-from-Depth for Driver Pose Estimation
    Borghi, Guido
    Venturelli, Marco
    Vezzani, Roberto
    Cucchiara, Rita
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5494 - 5503
  • [7] Breitenstein MD, 2008, PROC CVPR IEEE, P3613
  • [8] Carreira J, 2012, LECT NOTES COMPUT SC, V7578, P430, DOI 10.1007/978-3-642-33786-4_32
  • [9] ABD-Net: Attentive but Diverse Person Re-Identification
    Chen, Tianlong
    Ding, Shaojin
    Xie, Jingyi
    Yuan, Ye
    Chen, Wuyang
    Yang, Yang
    Ren, Zhou
    Wang, Zhangyang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8350 - 8360
  • [10] Active appearance models
    Cootes, TF
    Edwards, GJ
    Taylor, CJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) : 681 - 685