Posture and sequence recognition for Bharatanatyam dance performances using machine learning approaches

被引:15
作者
Mallick, Tanwi [1 ]
Das, Partha Pratim [2 ]
Majumdar, Arun Kumar [3 ]
机构
[1] Argonne Natl Lab, 9700 S Cass Ave, Lemont, IL 60439 USA
[2] Indian Inst Technol Kharagpur, Kharagpur 721302, West Bengal, India
[3] JIS Inst Adv Studies & Res, Kolkata 700091, West Bengal, India
关键词
Posturerecognition; Sequencerecognition; Dancesegmentation; Multi-modaldancemodeling; Machinelearning; BharatanatyamDanceanalysis; INDIAN CLASSICAL DANCE; HIDDEN MARKOV-MODELS;
D O I
10.1016/j.jvcir.2022.103548
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the underlying semantics of performing arts like dance is a challenging task. Analysis of dance is useful to preserve cultural heritage, make video recommendation systems, and build tutoring systems. To create such a dance analysis application, three aspects of dance analysis must be addressed: (1) segment the dance video to find representative action elements, (2) recognize the detected action elements, and (3) recognize sequences formed by combining action elements according to specific rules. This paper attempts to address the three fundamental problems of dance analysis raised above, with a focus on Indian Classical Dance, em Bharatanatyam. Since dance is driven by music, we use both musical and motion information to extract action elements. The action elements are then recognized using machine learning and deep learning techniques. Finally, the Hidden Markov Model (HMM) and Long Short-Term Memory (LSTM) are used to recognize the dance sequence.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
Aich A., 2018, NCVPRIPG 2017, P481, DOI [10.1007/978-981-13-0020-2_42, DOI 10.1007/978-981-13-0020-242]
[2]  
Aich A., 2018, COMPUT VIS PATTERN R, V841
[3]   Quaternionic Signal Processing Techniques for Automatic Evaluation of Dance Performances From MoCap Data [J].
Alexiadis, Dimitrios S. ;
Daras, Petros .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) :1391-1406
[4]  
Andriluka M, 2009, PROC CVPR IEEE, P1014, DOI 10.1109/CVPRW.2009.5206754
[5]  
CAMPBELL LW, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P624, DOI 10.1109/ICCV.1995.466880
[6]   Human action recognition using two-stream attention based LSTM networks [J].
Dai, Cheng ;
Liu, Xingang ;
Lai, Jinfeng .
APPLIED SOFT COMPUTING, 2020, 86
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]   Human Pose Estimation using Body Parts Dependent Joint Regressors [J].
Dantone, Matthias ;
Gall, Juergen ;
Leistner, Christian ;
Van Gool, Luc .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3041-3048
[9]  
Guo F, 2006, PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, P481
[10]   Tensor Manifold Discriminant Projections for Acceleration-Based Human Activity Recognition [J].
Guo, Yanan ;
Tao, Dapeng ;
Cheng, Jun ;
Dougherty, Alan ;
Li, Yaotang ;
Yue, Kun ;
Zhang, Bob .
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (10) :1977-1987