Learning and fusing multiple hidden substages for action quality assessment

被引:20
作者
Dong, Li-Jia [1 ]
Zhang, Hong-Bo [1 ]
Shi, Qinghongya [2 ]
Lei, Qing [3 ]
Du, Ji-Xiang [2 ]
Gao, Shangce [4 ]
机构
[1] Huaqiao Univ, Sch Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
[4] Univ Toyama, Fac Engn, Toyama 9308555, Japan
关键词
Action quality assessment; Regression network; Hidden substage; Objective evaluation rule; Field experience;
D O I
10.1016/j.knosys.2021.107388
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many of the existing methods for action quality assessment implement single-stage score regression networks that lack pertinence and rationality for the evaluation task. In this work, our target is to find a reasonable action quality assessment method for sports competitions that conforms to objective evaluation rules and field experience. To achieve this goal, three assessment scenarios, i.e., the overall-score-guided scenario, execution-score-guided scenario, and difficulty-level-based overall-score-guided scenario, are defined. A learning and fusion network of multiple hidden substages is proposed to assess athletic performance by segmenting videos into five substages by a temporal semantic segmentation. The feature of each video segment is extracted from the five feature backbone networks with shared weights, and a fully-connected-network-based hidden regression model is built to predict the score of each substage, fusing these scores into the overall score. We evaluate the proposed method on the UNLV-Diving dataset. The comparison results show that the proposed method based on objective evaluation rules of sports competitions outperforms the regression model directly trained on the overall score. The proposed multiple-substage network is more accurate than the single-stage score regression network and achieves state-of-the-art performance by leveraging objective evaluation rules and field experience that are beneficial for building an accurate and reasonable action quality assessment model. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 47 条
  • [1] Baptista Renato, 2017, 12 INT JOINT C COMP
  • [2] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [3] "Keep me in, Coach!": a Computer Vision Perspective on Assessing ACL Injury Risk in Female Athletes
    Blanchard, Nathaniel
    Skinner, Kyle
    Kemp, Aden
    Scheirer, Walter J.
    Flynn, Patrick J.
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1366 - 1374
  • [4] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [5] The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos
    Doughty, Hazel
    Mayol-Cuevas, Walterio
    Damen, Dima
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7854 - 7863
  • [6] Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination
    Doughty, Hazel
    Damen, Dima
    Mayol-Cuevas, Walterio
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6057 - 6066
  • [7] Learning Spatiotemporal Features with 3D Convolutional Networks
    Du Tran
    Bourdev, Lubomir
    Fergus, Rob
    Torresani, Lorenzo
    Paluri, Manohar
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
  • [8] Automated robot-assisted surgical skill evaluation: Predictive analytics approach
    Fard, Mahtab J.
    Ameri, Sattar
    Ellis, R. Darin
    Chinnam, Ratna B.
    Pandya, Abhilash K.
    Klein, Michael D.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY, 2018, 14 (01)
  • [9] Evaluating Surgical Skills from Kinematic Data Using Convolutional Neural Networks
    Fawaz, Hassan Ismail
    Forestier, Germain
    Weber, Jonathan
    Idoumghar, Lhassane
    Muller, Pierre-Alain
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT IV, 2018, 11073 : 214 - 221
  • [10] Federation Internationale de Natation, 2006, RUL REG FINA DIV RUL