Multi-Modal Deep Learning-Based Violin Bowing Action Recognition

被引:3
作者
Liu, Bao-Yun [1 ]
Jen, Yi-Hsin [2 ,3 ]
Sun, Shih-Wei [4 ]
Su, Li [2 ]
Chang, Pao-Chi [1 ]
机构
[1] Natl Cent Univ, Dept Commun Engn, Taoyuan, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[4] Taipei Natl Univ Arts, Dept New Media Art, Taipei, Taiwan
来源
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN) | 2020年
关键词
D O I
10.1109/icce-taiwan49838.2020.9257995
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a deep learning-based violin action recognition is proposed. By fusing the sensing signals from depth camera modality and inertial sensor modalities, violin bowing actions can be recognized by the proposed deep learning scheme. The actions performed by a violinist are captured by a depth camera, and recorded by wearable sensors on the forearm of a violinist. In the proposed system, 3D convolution neural network (3D-CNN) and long short-term memory (LSTM) deep learning algorithms are adopted to generate the action models from depth camera modality and inertial sensor modalities. The features and models obtained from multi-modalities are used to classify different violin bowing actions. A fusion process from different modalities can achieve satisfactory recognition accuracy. In this paper, we generate a violin bowing actions dataset for the preliminary study and the system performance evaluation.
引用
收藏
页数:2
相关论文
共 4 条
[1]  
Chen, IEEE ICIP 2015
[2]   Bowing Gestures Classification in Violin Performance: A Machine Learning Approach [J].
Dalmazzo, David ;
Ramirez, Rafael .
FRONTIERS IN PSYCHOLOGY, 2019, 10
[3]   3D Convolutional Neural Networks for Human Action Recognition [J].
Ji, Shuiwang ;
Xu, Wei ;
Yang, Ming ;
Yu, Kai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :221-231
[4]  
Li Y., 2017, ARXIV170307475CSCV