Online Score-Informed Source Separation with Adaptive Instrument Models

被引:11
作者
Rodriguez-Serrano, Francisco J. [1 ]
Duan, Zhiyao [2 ]
Vera-Candeas, Pedro [1 ]
Pardo, Bryan [3 ]
Carabias-Orti, Julio J. [4 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Jaen, Spain
[2] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
[3] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL USA
[4] Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain
关键词
NMF; online; score-informed; instrument-models; adaptive; score alignment; source separation; NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; CONSTRAINTS; AUDIO;
D O I
10.1080/09298215.2014.989174
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, an online score-informed source separation system is proposed under the Non-negative Matrix Factorization (NMF) framework, using parametric instrument models. Each instrument is modelled using a multi-excitation source-filter model, which provides the flexibility to model different instruments. The instrument models are initially learned on training excerpts of the same kinds of instruments, and are then adapted, during the separation, to the specific instruments used in the audio being separated. The model adaptation method needs to access the musical score content for each instrument, which is provided by an online audio-score alignment method. Source separation is improved by adapting the instrument models using score alignment. Experiments are performed to evaluate the proposed system and its individual components. Results show that it outperforms a state-of-the-art comparison method.
引用
收藏
页码:83 / 96
页数:14
相关论文
共 45 条
  • [1] [Anonymous], 2004, 18 INT C AC
  • [2] [Anonymous], 2009, P 10 INT SOC MUS INF
  • [3] Babaie-zadeh M., 2006, IEEE T AUDIO SPEECH, V18, P538
  • [4] Badeau R., 2009, INT C ACOUST SPEECH
  • [5] Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription
    Bertin, Nancy
    Badeau, Roland
    Vincent, Emmanuel
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 538 - 549
  • [6] Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization
    Carabias-Orti, J. J.
    Virtanen, T.
    Vera-Candeas, P.
    Ruiz-Reyes, N.
    Canadas-Quesada, F. J.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1144 - 1158
  • [7] Cichocki A, 2007, LECT NOTES COMPUT SC, V4493, P793
  • [8] Comon P, 2010, HANDBOOK OF BLIND SOURCE SEPARATION: INDEPENDENT COMPONENT ANALYSIS AND APPLICATIONS, P1
  • [9] CONT A, 2006, INT CONF ACOUST SPEE, P245
  • [10] A Coupled Duration-Focused Architecture for Real-Time Music-to-Score Alignment
    Cont, Arshia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (06) : 974 - 987