Joint graph convolution networks and transformer for human pose estimation in sports technique analysis

被引:6
作者
Cheng, Hongren [1 ,2 ]
Wang, Jing [3 ]
Zhao, Anran [4 ]
Zhong, Yaping [1 ,2 ]
Li, Jingli [5 ]
Dong, Liangshan [6 ]
机构
[1] Wuhan Sports Univ, Sports Big Data Res Ctr, Wuhan 430079, Peoples R China
[2] Hubei Prov Sports & Hlth Innovat Dev Res Ctr, Wuhan 430079, Hubei, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Sch Automat, Chongqing 400065, Peoples R China
[4] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[5] Huazhong Univ Sci & Technol, Sch Phys Educ, Wuhan 430074, Peoples R China
[6] China Univ Geosci, Sch Phys Educ, Wuhan 430074, Peoples R China
关键词
Human pose estimation; Graph convolutional network; Transformer; The topological structure between; IMAGE STEGANOGRAPHY METHOD;
D O I
10.1016/j.jksuci.2023.101819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation has various applications in domains such as sports technology analysis, virtual reality, and education. However, most previous studies focused on the respective feature representations of keypoints, but disregarded the topological relationship among keypoints. To address this challenge, we propose GTPose, a network structure that integrates graph convolutional networks and Transform. First of all, a set of multi-scale convolution operations are applied to extract local feature maps of images. Secondly, the positions of keypoints are roughly estimated by using Transform to process the sequential relations between feature maps. Finally, GCN is adopted to model the topological structure between keypoints to accurately locate the location of keypoints and learn feature representations. The performance of GTPose is evaluated on two real datasets: MS COCO and MPII. Experimental results demonstrate that GTPose outperforms other methods in human pose estimation tasks. In addition, experimental results also show that the spatial relationship between keypoints is effective for accurately characterizing keypoints.
引用
收藏
页数:8
相关论文
共 30 条
  • [1] A Systematic Review of the Application of Camera-Based Human Pose Estimation in the Field of Sport and Physical Exercise
    Badiola-Bengoa, Aritz
    Mendez-Zorrilla, Amaia
    [J]. SENSORS, 2021, 21 (18)
  • [2] Baronti P., 2020, 2020 IEEE INT C HUM, P1
  • [3] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [4] Diogo Joao, 2022, MUM '22: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia, P262, DOI 10.1145/3568444.3570592
  • [5] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [6] An Adaptive Image Steganography Method Based on Histogram of Oriented Gradient and PVD-LSB Techniques
    Hameed, Mohamed Abdel
    Hassaballah, M.
    Aly, Saleh
    Awad, Ali Ismail
    [J]. IEEE ACCESS, 2019, 7 : 185189 - 185204
  • [7] A Novel Image Steganography Method for Industrial Internet of Things Security
    Hassaballah, M.
    Hameed, Mohamed Abdel
    Awad, Ali Ismail
    Muhammad, Khan
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) : 7743 - 7751
  • [8] Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition
    Huang, Zengxi
    Qin, Yusong
    Lin, Xiaobing
    Liu, Tianlin
    Feng, Zhenhua
    Liu, Yiguang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1868 - 1883
  • [9] Jian Wang, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12356), P492, DOI 10.1007/978-3-030-58621-8_29
  • [10] TokenPose: Learning Keypoint Tokens for Human Pose Estimation
    Li, Yanjie
    Zhang, Shoukui
    Wang, Zhicheng
    Yang, Sen
    Yang, Wankou
    Xia, Shu-Tao
    Zhou, Erjin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11293 - 11302