Gaussian guided frame sequence encoder network for action quality assessment

被引:3
作者
Li, Ming-Zhe [1 ]
Zhang, Hong-Bo [1 ]
Dong, Li-Jia [1 ]
Lei, Qing [2 ]
Du, Ji-Xiang [3 ]
机构
[1] Huaqiao Univ, Sch Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
基金
中国国家自然科学基金;
关键词
Action quality assessment; Frame sequence encoder network; Gaussian loss function; Regression analysis;
D O I
10.1007/s40747-022-00892-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Can a computer evaluate an athlete's performance automatically? Many action quality assessment (AQA) methods have been proposed in recent years. Limited by the randomness of video sampling and the simple strategy of model training, the performance of the existing AQA methods can still be further improved. To achieve this goal, a Gaussian guided frame sequence encoder network is proposed in this paper. In the proposed method, the image feature of each video frame is extracted by Resnet model. And then, a frame sequence encoder network is applied to model temporal information and generate action quality feature. Finally, a fully connected network is designed to predict action quality score. To train the proposed method effectively, inspired by the final score calculation rule in Olympic game, Gaussian loss function is employed to compute the error between the predicted score and the label score. The proposed method is implemented on the AQA-7 and MTL-AQA datasets. The experimental results confirm that compared with the state-of-the-art methods, our proposed method achieves the better performance. And detailed ablation experiments are conducted to verify the effectiveness of each component in the module.
引用
收藏
页码:1963 / 1974
页数:12
相关论文
共 47 条
  • [41] GYMetricPose: A light-weight angle-based graph adaptation for action quality assessment
    Gallardo, Ulises
    Caro, Fernando
    Hernandez, Eluney
    Espinosa, Ricardo
    Ochoa-Ruiz, Gilberto
    2024 IEEE 37TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS 2024, 2024, : 43 - 50
  • [42] I3D-AE-LSTM: Combining action representations using a 2-stream autoencoder for Action Quality Assessment
    Moodley, Tevin
    van der Haar, Dustin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 278
  • [43] Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos
    Zeng, Ling-An
    Hong, Fa-Ting
    Zheng, Wei-Shi
    Yu, Qi-Zhi
    Zeng, Wei
    Wang, Yao-Wei
    Lai, Jian-Huang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2526 - 2534
  • [44] Label-reconstruction-based pseudo-subscore learning for action quality assessment in sporting events
    Hong-Bo Zhang
    Li-Jia Dong
    Qing Lei
    Li-Jie Yang
    Ji-Xiang Du
    Applied Intelligence, 2023, 53 : 10053 - 10067
  • [45] Label-reconstruction-based pseudo-subscore learning for action quality assessment in sporting events
    Zhang, Hong-Bo
    Dong, Li-Jia
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10053 - 10067
  • [46] Learning Effective Skeletal Representations on RGB Video for Fine-Grained Human Action Quality Assessment
    Lei, Qing
    Zhang, Hong-Bo
    Du, Ji-Xiang
    Hsiao, Tsung-Chih
    Chen, Chih-Cheng
    ELECTRONICS, 2020, 9 (04)
  • [47] Skeleton-based deep pose feature learning for action quality assessment on figure skating videos
    Li, Huiying
    Lei, Qing
    Zhang, Hongbo
    Du, Jixiang
    Gao, Shangce
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89