Multidimensional humming transcription using a statistical approach for query by humming systems

被引:0
作者
Shih, HH [1 ]
Narayanan, SS [1 ]
Kuo, CCJ [1 ]
机构
[1] Univ So Calif, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA
来源
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING | 2003年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new statistical pattern recognition approach applied to human humming transcription is proposed in this research. A music note has two important attributes, i.e. pitch and duration. The proposed algorithm generates multidimensional humming transcriptions, which contain both pitch and duration information. Query by humming provides a natural means for content-based retrieval from music databases, and this research provides a robust front-end for such an application. The segment of a note in the humming waveform is modeled by a hidden Markov model (HMM) while the pitch of the note is modeled by a pitch model using a Gaussian mixture model. Preliminary real-time recognition experiments are carried out with models trained by data obtained from eight human objects. and an overall correct recognition rate of around 80% is demonstrated.
引用
收藏
页码:541 / 544
页数:4
相关论文
共 10 条
[1]  
DUREY AS, 2001, INT S MUS INF RETR I, P109
[2]  
DUREY AS, 2002, ICASSP 2002
[3]  
GHIAS A, 1995, P ACM MULT C 95 SAN
[4]   PARALLEL PROCESSING TECHNIQUES FOR ESTIMATING PITCH PERIODS OF SPEECH IN TIME DOMAIN [J].
GOLD, B ;
RABINER, L .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1969, 46 (2P2) :442-&
[5]  
Jong J.-S. R., 2001, 2001 IEEE INT C MULT, P405
[6]  
McNab R. J., 1996, PROC 19 AUSTRALAS CO, P1247
[7]  
MCNAB RJ, 1996, DIG LIBR C
[8]   Automatic segmentation of acoustic musical signals using hidden Markov models [J].
Raphael, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (04) :360-370
[9]  
RAPHAEL C, 2001, INT S MUS INF RETR I
[10]  
SHIH HH, 2002, 2002 IEEE INT C MULT