Real-Time Recognition of Percussive Sounds by a Model-Based Method

被引:5
作者
Simsekli, Umut [2 ]
Jylha, Antti [1 ]
Erkut, Cumhur [1 ]
Cemgil, Taylan [2 ]
机构
[1] Aalto Univ, Sch Sci & Technol, Dept Signal Proc & Acoust, Aalto 00076, Finland
[2] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2011年
基金
芬兰科学院;
关键词
Markov Model; Classification Accuracy; Hide Markov Model; Expectation Maximization; Event Detection;
D O I
10.1155/2011/291860
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Interactive musical systems require real-time, low-latency, accurate, and reliable event detection and classification algorithms. In this paper, we introduce a model-based algorithm for detection of percussive events and test the algorithm on the detection and classification of different percussive sounds. We focus on tuning the algorithm for a good compromise between temporal precision, classification accuracy and low latency. The model is trained offline on different percussive sounds using the expectation maximization approach for learning spectral templates for each sound and is able to run online to detect and classify sounds from audio stream input by a Hidden Markov Model. Our results indicate that the approach is promising and applicable in design and development of interactive musical systems.
引用
收藏
页数:14
相关论文
共 24 条
  • [1] [Anonymous], 2004, ADAPTIVE COMPUTATION
  • [2] Ball J, 2007, PROCEEDINGS OF THE 10TH CONGRESS OF THE INTERNATIONAL FEDERATION OF SOCIETIES FOR SURGERY OF THE HAND & 7TH CONGRESS OF THE INTERNATIONAL FEDERATION OF SOCIETIES FOR HAND THERAPY, P1
  • [3] CAPPE O, 2005, SPR S STAT, P1
  • [4] Cemgil Ali Taylan, 2009, Comput Intell Neurosci, P785152, DOI 10.1155/2009/785152
  • [5] FitzGerald D., 2006, SIGNAL PROCESSING ME, V1st
  • [6] GILLET O, 2003, P INT C MUS INF RETR
  • [7] Herrera P., 2003, P 114 CONV AUD ENG S, P1
  • [8] Jylha A., 2009, CHI'09 Extended Abstracts on Human Factors in Computing Systems, P3175, DOI [https://doi.org/10.1145/1520340.1520452, DOI 10.1145/1520340.1520452]
  • [9] Jylha A., 2008, Proceedings of the Digital Audio Effects Workshop, P301
  • [10] JYLHA A, 2009, P AUD MOSTL, P69