Separation of synchronous pitched notes by spectral filtering of harmonics

被引:40
作者
Every, Mark R. [1 ]
Szymanski, John E.
机构
[1] Univ Surrey, SEPS, CVSSP, Guildford GU2 7XH, Surrey, England
[2] Univ York, Dept Elect, Media Engn Res Grp, York YO10 5DD, N Yorkshire, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 05期
关键词
music note separation; partial extraction; separation of overlapping harmonics;
D O I
10.1109/TSA.2005.858528
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper discusses the separation of two or more simultaneously excited pitched notes from a mono sound file into separate tracks. In fact, this is an intermediate stage in the longer-term goal of separating out at least two interweaving melodies of different sound sources from a mono file. The approach is essentially to filter the set of harmonics of each note from the mixed spectrum in each time frame of audio. A major consideration has been the separation of overlapping harmonics, and three filter designs are proposed for splitting a spectral peak into its constituent partials given the rough frequency and amplitude estimates of each partial contained within. The overall quality of separation has been good for mixes of up to seven orchestral notes and has been confirmed by measured average signal-to-residual ratios of around 10-20 dB.
引用
收藏
页码:1845 / 1856
页数:12
相关论文
共 19 条
[1]  
Cappe O., 1995, WORKSH APPL SIGN PRO
[2]   The auditory organization of speech and other sources in listeners and computational models [J].
Cooke, M ;
Ellis, DPW .
SPEECH COMMUNICATION, 2001, 35 (3-4) :141-177
[3]  
DEPALLE P, 1997, IEEE WORKSH APP SIGN
[4]  
Desainte-Catherine M, 2000, J AUDIO ENG SOC, V48, P654
[5]  
DONALDIO, 1999, INTERPOLATE FREQUENC
[6]  
EVERY MR, 2004, NOTE SEPARATION DEMO
[7]  
EVERY MR, 2004, 7 INT C DIG AUD EFF
[8]  
Fletcher NevilleH., 1998, PHYS MUSICAL INSTRUM, V2d
[9]   Monaural speech segregation based on pitch tracking and amplitude modulation [J].
Hu, GN ;
Wang, DL .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (05) :1135-1150
[10]  
KLAPURI A, 2000, COST G6 C DIG AUD EF