ON THE REDUCTION OF FALSE POSITIVES IN SINGING VOICE DETECTION

被引:0
作者
Lehner, Bernhard [1 ]
Widmer, Gerhard [1 ]
Sonnleitner, Reinhard [1 ]
机构
[1] Johannes Kepler Univ Linz, Dept Computat Percept, Linz, Austria
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Motivated by the observation that one of the biggest problems in automatic singing voice detection is the confusion of vocals with other pitch-continuous and pitch-varying instruments, we propose a set of three new audio features designed to reduce the amount of false vocal detections. This is borne out in comparative experiments with three different musical corpora. The resulting singing voice detector appears to be at least on par with more complex state-of-the-art methods. New features and classifier are very light-weight and in principle suitable for on-line use.
引用
收藏
页数:5
相关论文
共 17 条
[1]  
Altun Y., 2003, P 20 INT C MACH LEAR, V20
[2]  
[Anonymous], 2005, ISMIR
[3]  
[Anonymous], 2011, 12 INT PROCEEDING MU
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   YIN, a fundamental frequency estimator for speech and music [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930
[6]  
Grard P., 2005, JAMENDO OPEN YOUR EA
[7]   SPECTRAL-FLATNESS MEASURE FOR STUDYING AUTOCORRELATION METHOD OF LINEAR PREDICTION OF SPEECH ANALYSIS [J].
GRAY, AH ;
MARKEL, JD .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1974, AS22 (03) :207-217
[8]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[9]  
Joachims T, 2009, MACH LEARN, V77, P27, DOI [10.1007/S10994-009-5108-8, 10.1007/s10994-009-5108-8]
[10]  
Lehner B., 2013, P 14 INT C MUS INF R, P1