High-precision skeleton-based human repetitive action counting

被引：2

作者：

Li, Chengxian ^{[1
]}

Shao, Ming ^{[2
]}

Yang, Qirui ^{[1
]}

Xia, Siyu ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing, Peoples R China

[2] Univ Massachusetts, Dept Comp & Informat Sci, Dartmouth, MA USA

来源：

IET COMPUTER VISION | 2023年 / 17卷 / 06期

关键词：

computer vision; convolutional neural nets; MOTION;

D O I：

10.1049/cvi2.12193

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A novel counting model is presented by the authors to estimate the number of repetitive actions in temporal 3D skeleton data. As per the authors' knowledge, this is the first work of this kind using skeleton data for high-precision repetitive action counting. Different from existing works on RGB video data, the authors' model follows a bottom-up pipeline to clip the sub-action first followed by robust aggregation in inference. First, novel counting loss functions and robust inference with backtracking is proposed to pursue precise per-frame count as well as overall count with boundary frames. Second, an efficient synthetic approach is proposed to augment skeleton data in training and thus avoid time-consuming repetitive action data collection work. Finally, a challenging human repetitive action counting dataset named VSRep is collected with various types of action to evaluate the proposed model. Experiments demonstrate that the proposed counting model outperforms existing video-based methods by a large margin in terms of accuracy in real-time inference.

引用

页码：700 / 709

页数：10

共 33 条

[1]

Azy O, 2008, INT C PATT RECOG, P5

[2] Extraction and analysis of multiple periodic motions in video sequences [J].

Briassouli, Alexia ;

Ahuja, Narendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (07) :1244-1261

[3] Robust real-time periodic motion detection, analysis, and applications [J].

Cutler, R ;

Davis, LS .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :781-796

[4] MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction [J].

Dang, Lingwei ;

Nie, Yongwei ;

Long, Chengjiang ;

Zhang, Qing ;

Li, Guiqing .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :11447-11456

[5]

Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714

[6] Counting Out Time: Class Agnostic Video Repetition Counting in the Wild [J].

Dwibedi, Debidatta ;

Aytar, Yusuf ;

Tompson, Jonathan ;

Sermanet, Pierre ;

Zisserman, Andrew .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10384-10393

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

Hu Huazhang, 2022, P IEEECVF C COMPUTER, P19013

[9] VISUAL-PERCEPTION OF BIOLOGICAL MOTION AND A MODEL FOR ITS ANALYSIS [J].

JOHANSSON, G .

PERCEPTION & PSYCHOPHYSICS, 1973, 14 (02) :201-211

[10]

Josyula R., 2021, ARXIV PREPRINT ARXIV

← 1 2 3 4 →