Automated Violin Bowing Gesture Recognition Using FMCW-Radar and Machine Learning

被引:3
作者
Gao, Hannah [1 ]
Li, Changzhi [2 ]
机构
[1] Harriton High Sch, Rosemont, PA 19010 USA
[2] Texas Tech Univ, Dept Elect & Comp Engn, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Feature extraction; Sensors; Radar; Spectrogram; Time-frequency analysis; Machine learning; Instruments; Bowing gestures; feature extraction; frequency-modulated-continuous-wave (FMCW) radar; machine learning; time-Doppler; violin; ARCHITECTURE; DOPPLER;
D O I
10.1109/JSEN.2023.3263513
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A key step in learning the violin is mastering control over various bowing techniques since the drawing of the violin bow directly influences the sound quality produced. As it is important for violinists to receive frequent feedback on their bowing motions, there is a need for digital means of providing automated feedback to musicians. This study uses a 60-GHz frequency-modulated-continuous-wave (FMCW) radar to gather data on the violinist's bowing arm for a total of seven bowing gestures: detache up, detache down, spiccato up, spiccato down, staccato up, staccato down, and tremolo. A total of 1200 bowing gestures for three different violinists are recorded using radar. The raw signal data from the radar is processed to generate time-Doppler spectrograms of the gestures. Features are extracted from the time-Doppler data using two different methods and fed into machine-learning models for the automated classification of bowing gestures. The first method involves manually engineering features to be extracted from the signal data matrix. The second method leverages the power of convolutional neural networks (CNNs) to automatically extract features from images of the time-Doppler spectrograms. Comparing model performances reveals that fine-tuning a pretrained SqueezeNet CNN model yields the highest classification accuracy (95.00%). This study also analyzes the influence of fluctuations in the overall user-to-radar range on the time-Doppler spectrograms produced.
引用
收藏
页码:9262 / 9270
页数:9
相关论文
共 34 条
[1]   Audio classification of violin bowing techniques: An aid for beginners [J].
Alar, Hernan S. ;
Mamaril, Ramil O. ;
Villegas, Lex P. ;
Cabarrubias, Jhon Roe D. .
MACHINE LEARNING WITH APPLICATIONS, 2021, 4
[2]  
[Anonymous], 2007, PROC AXMEDIS
[3]  
[Anonymous], 2008, Proceedings of the International Conference on New Interfaces for Musical Expression
[4]   Robust Cardiac Rate Estimation of an Individual [J].
Choi, Inoh ;
Kim, Min ;
Choi, Jaeho ;
Park, Jeongki ;
Park, Sanghong ;
Kim, Kyungtae .
IEEE SENSORS JOURNAL, 2021, 21 (13) :15053-15064
[5]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[6]   Visual perception of expressiveness in musicians' body movements [J].
Dahl, Sofia ;
Friberg, Anders .
MUSIC PERCEPTION, 2007, 24 (05) :433-454
[7]   Bowing Gestures Classification in Violin Performance: A Machine Learning Approach [J].
Dalmazzo, David ;
Ramirez, Rafael .
FRONTIERS IN PSYCHOLOGY, 2019, 10
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   Inattentive Driving Behavior Detection Based on Portable FMCW Radar [J].
Ding, Chuanwei ;
Chae, Rachel ;
Wang, Jing ;
Zhang, Li ;
Hong, Hong ;
Zhu, Xiaohua ;
Li, Changzhi .
IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2019, 67 (10) :4031-4041
[10]   Gesture Recognition System Using 24 GHz FMCW Radar Sensor Realized on Real-Time Edge Computing Platform [J].
Gan, Liangyu ;
Liu, Yuan ;
Li, Yanzhong ;
Zhang, Runxi ;
Huang, Leilei ;
Shi, Chunqi .
IEEE SENSORS JOURNAL, 2022, 22 (09) :8904-8914