Real-Time Musical Conducting Gesture Recognition Based on a Dynamic Time Warping Classifier Using a Single-Depth Camera

被引:18
作者
Chin-Shyurng, Fahn [1 ]
Lee, Shih-En [1 ]
Wu, Meng-Luen [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 10607, Taiwan
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 03期
关键词
human-computer interaction; dynamic gesture recognition; depth camera; palm tracking; dynamic time warping; musical gesture; musical conductor; KINECT; DTW;
D O I
10.3390/app9030528
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Gesture recognition is a human-computer interaction method, which is widely used for educational, medical, and entertainment purposes. Humans also use gestures to communicate with each other, and musical conducting uses gestures in this way. In musical conducting, conductors wave their hands to control the speed and strength of the music played. However, beginners may have a limited comprehension of the gestures and might not be able to properly follow the ensembles. Therefore, this paper proposes a real-time musical conducting gesture recognition system to help music players improve their performance. We used a single-depth camera to capture image inputs and establish a real-time dynamic gesture recognition system. The Kinect software development kit created a skeleton model by capturing the palm position. Different palm gestures were collected to develop training templates for musical conducting. The dynamic time warping algorithm was applied to recognize the different conducting gestures at various conducting speeds, thereby achieving real-time dynamic musical conducting gesture recognition. In the experiment, we used 5600 examples of three basic types of musical conducting gestures, including seven capturing angles and five performing speeds for evaluation. The experimental result showed that the average accuracy was 89.17% in 30 frames per second.
引用
收藏
页数:17
相关论文
共 35 条
  • [1] Ahmed W, 2016, PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), P120, DOI 10.1109/INFOSCI.2016.7845312
  • [2] [Anonymous], THESIS
  • [3] [Anonymous], 2013, GUIDE COMPUTING EXPR
  • [4] Tracking Conductors Hand Movements Using Multiple Wiimotes
    Bradshaw, David
    Ng, Kia
    [J]. FOURTH INTERNATIONAL CONFERENCE ON AUTOMATED SOLUTIONS FOR CROSS MEDIA CONTENT AND MULTI-CHANNEL DISTRIBUTION, PROCEEDINGS, 2008, : 93 - +
  • [5] Cavalieri Lorenzo, 2016, Human-Computer Interaction. Interaction Platforms and Techniques. 18th International Conference, HCI International 2016. Proceedings: LNCS 9732, P25, DOI 10.1007/978-3-319-39516-6_3
  • [6] Chavez Hector, 2013, 2013 IEEE Power & Energy Society General Meeting, DOI 10.1109/PESMG.2013.6672259
  • [7] Chen S, 2014, IEEE INT FUZZY SYST, P1287, DOI 10.1109/FUZZ-IEEE.2014.6891811
  • [8] Costagliola G, 2006, IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, P103
  • [9] Mood Conductor: Emotion-Driven Interactive Music Performance
    Fazekas, Gyoergy
    Barthet, Mathieu
    Sandler, Mark B.
    [J]. 2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 726 - 726
  • [10] Music Teacher Knowledge: An Examination of the Intersections Between Instrumental Music Teaching and Conducting
    Forrester, Sommer H.
    [J]. JOURNAL OF RESEARCH IN MUSIC EDUCATION, 2018, 65 (04) : 461 - 482