Unconstrained head pose estimation based on bilateral attention

被引:0
作者
Zhang, Xiao [1 ]
Yan, Chunman [2 ,3 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730070, Peoples R China
[2] Northwest Normal Univ, Coll Phys, Lanzhou 730070, Peoples R China
[3] Northwest Normal Univ, Elect Engn Res Ctr Gansu Prov Intelligent Informat, Lanzhou 730070, Peoples R China
关键词
Head pose estimation; Ghost module; Attention mechanism; Bilinear pooling; Multivariate loss function;
D O I
10.1007/s11760-025-03925-y
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Head pose estimation is a challenging and critical research topic, with existing models still facing significant challenges. First, common representations for head pose estimation exhibit discontinuities. Second, recognition rates are low in complex scenes, and models tend to have high parameter counts and substantial computational demands. To solve these problems, this paper proposes an unconstrained head pose estimation model based on bilinear attention. We introduce a 6D rotation matrix for attitude angle representation and a P-Ghost module to enhance the GhostNetV2 lightweight framework for feature extraction. A bilinear attention network is also introduced to integrate spatial and channel information, enabling the model to learn feature correlations, prioritize key channels, and suppress redundant ones. Multiple loss function strategies are also introduced to improve the model's accuracy. The proposed network model undergoes extensive testing on three datasets, with experimental results showing superior performance in head pose estimation.
引用
收藏
页数:11
相关论文
共 31 条
  • [1] Real-time 6DoF full-range markerless head pose estimation
    Algabri, Redhwan
    Shin, Hyunsoo
    Lee, Sungon
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 239
  • [2] Web-Shaped Model for Head Pose Estimation: An Approach for Best Exemplar Selection
    Barra, Paola
    Barra, Silvio
    Bisogni, Carmen
    De Marsico, Maria
    Nappi, Michele
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5457 - 5468
  • [3] How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)
    Bulat, Adrian
    Tzimiropoulos, Georgios
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1021 - 1030
  • [4] A Vector-based Representation to Enhance Head Pose Estimation
    Cao, Zhiwen
    Chu, Zongcheng
    Liu, Dongfang
    Chen, Yingjie
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1187 - 1196
  • [5] Asymmetry-aware bilinear pooling in multi-modal data for head pose estimation
    Chen, Jiazhong
    Li, Qingqing
    Ren, Dakai
    Cao, Hua
    Ling, Hefei
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 110
  • [6] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [7] Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation
    Hempel, Thorsten
    Abdelrahman, Ahmed A.
    Al-Hamadi, Ayoub
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2377 - 2387
  • [8] 6D ROTATION REPRESENTATION FOR UNCONSTRAINED HEAD POSE ESTIMATION
    Hempel, Thorsten
    Abdelrahman, Ahmed A.
    Al-Hamadi, Ayoub
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2496 - 2500
  • [9] QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss
    Hsu, Heng-Wei
    Wu, Tung-Yu
    Wan, Sheng
    Wong, Wing Hung
    Lee, Chen-Yi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) : 1035 - 1046
  • [10] Improving head pose estimation using two-stage ensembles with top-k regression
    Huang, Bin
    Chen, Renwen
    Xu, Wang
    Zhou, Qinbang
    [J]. IMAGE AND VISION COMPUTING, 2020, 93 (93)