Dual-referenced assistive network for action quality assessment

被引:0
作者
Huang, Keyi [1 ]
Tian, Yi [1 ]
Yu, Chen [1 ]
Huang, Yaping [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Beijing Key Lab Traff Data Anal & Min, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Action quality assessment; Human action understanding; VIDEO;
D O I
10.1016/j.neucom.2024.128786
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action quality assessment (AQA) aims to evaluate the performing quality of a specific action. It is a challenging task as it requires to identify the subtle differences between the videos containing the same action. Most of existing AQA methods directly adopt a pretrained network designed for other tasks to extract video features, which are too coarse to describe fine-grained details of action quality. In this paper, we propose a novel Dual-Referenced Assistive (DuRA) network to polish original coarse-grained features into fine-grained quality-oriented representations. Specifically, we introduce two levels of referenced assistants to highlight the discriminative quality-related contents by comparing a target video and the referenced objects, instead of obtrusively estimating the quality score from an individual video. Firstly, we design a Rating-guided Attention module, which takes advantage of a series of semantic-level referenced assistants to acquire implicit hierarchical semantic knowledge and progressively emphasize quality-focused features embedded in original inherent information. Subsequently, we further design a couple of Consistency Preserving constraints, which introduce a set of individual-level referenced assistants to further eliminate score-unrelated information through more detailed comparisons of differences between actions. The experiments show that our proposed method achieves promising performance on the AQA-7 and MTL-AQA datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] I3D-AE-LSTM: Combining action representations using a 2-stream autoencoder for Action Quality Assessment
    Moodley, Tevin
    van der Haar, Dustin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 278
  • [42] Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network
    Zhang, Wei
    Qu, Chenfei
    Ma, Lin
    Guan, Jingwei
    Huang, Rui
    PATTERN RECOGNITION, 2016, 59 : 176 - 187
  • [43] Network Traffic Type-Based Quality of Experience (QoE) Assessment for Universal Services
    Xu, Zheng
    Zhang, Anguo
    APPLIED SCIENCES-BASEL, 2019, 9 (19):
  • [44] Blind visual quality assessment for image super-resolution by convolutional neural network
    Fang, Yuming
    Zhang, Chi
    Yang, Wenhan
    Liu, Jiaying
    Guo, Zongming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29829 - 29846
  • [45] Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos
    Zeng, Ling-An
    Hong, Fa-Ting
    Zheng, Wei-Shi
    Yu, Qi-Zhi
    Zeng, Wei
    Wang, Yao-Wei
    Lai, Jian-Huang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2526 - 2534
  • [46] Label-reconstruction-based pseudo-subscore learning for action quality assessment in sporting events
    Hong-Bo Zhang
    Li-Jia Dong
    Qing Lei
    Li-Jie Yang
    Ji-Xiang Du
    Applied Intelligence, 2023, 53 : 10053 - 10067
  • [47] Label-reconstruction-based pseudo-subscore learning for action quality assessment in sporting events
    Zhang, Hong-Bo
    Dong, Li-Jia
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10053 - 10067
  • [48] Learning Effective Skeletal Representations on RGB Video for Fine-Grained Human Action Quality Assessment
    Lei, Qing
    Zhang, Hong-Bo
    Du, Ji-Xiang
    Hsiao, Tsung-Chih
    Chen, Chih-Cheng
    ELECTRONICS, 2020, 9 (04)
  • [49] vAQA-SS: Vision-based action quality assessment for style-based skiing☆
    Wen, Yijia
    Luo, Xiaoyan
    Zheng, Lei
    Qi, Liangnan
    Shi, Xiaofeng
    DISPLAYS, 2025, 88
  • [50] Skeleton-based deep pose feature learning for action quality assessment on figure skating videos
    Li, Huiying
    Lei, Qing
    Zhang, Hongbo
    Du, Jixiang
    Gao, Shangce
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89