Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation

被引:3
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, Telecom ParisTech, LTCI, F-75013 Paris, France
[2] Univ Paris Sud, Cent Supelec, CNRS, UMR 8506,Lab Signaux & Syst, F-91192 Gif Sur Yvette, France
关键词
Nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition; POLYPHONIC MUSIC; MELODY EXTRACTION; SPEECH SIGNALS; TRANSCRIPTION; DECOMPOSITION; ALGORITHMS;
D O I
10.1109/TASLP.2018.2830116
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One of the most general models of music signals considers that such signals can be represented as a sum of two distinct components: a tonal part that is sparse in frequency and temporally stable and a transient (or percussive) part that is composed of short-term broadband sounds. In this paper, we propose a novel hybrid method built upon nonnegative matrix factorization (NMF) that decomposes the time frequency representation of an audio signal into such two components. The tonal part is estimated by a sparse and orthogonal nonnegative decomposition, and the transient part is estimated by a straightforward NMF decomposition constrained by a pre-learned dictionary of smooth spectra. The optimization problem at the heart of our method remains simple with very few hyperparameters and can be solved thanks to simple multiplicative update rules. The extensive benchmark on a large and varied music database against four state of the art harmonic/percussive source separation algorithms demonstrate the merit of the proposed approach.
引用
收藏
页码:1499 / 1511
页数:13
相关论文
共 50 条
  • [21] DRUM TRANSCRIPTION USING PARTIALLY FIXED NONNEGATIVE MATRIX FACTORIZATION
    Wu, Chih-Wei
    Lerch, Alexander
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1281 - 1285
  • [22] Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints
    Francisco Jesus Canadas-Quesada
    Pedro Vera-Candeas
    Nicolas Ruiz-Reyes
    Julio Carabias-Orti
    Pablo Cabanas-Molero
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [23] Geometric Source Separation Method Using Nonnegative Matrix Factorization and Interference Suppression
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (11) : 2442 - 2447
  • [24] Source Separation Based On Nonnegative Matrix Factorization and Independent Component Correlation Algorithm
    Kong, Xiangwei
    Liang, Lin
    Yang, Tianshe
    Zhao, Jing
    Wang, Xuhua
    2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2015, : 1614 - 1619
  • [25] Joint Nonnegative Matrix Factorization for Underdetermined Blind Source Separation in Nonlinear Mixtures
    Kopriva, Ivica
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 107 - 115
  • [26] Blind Source Separation of Heart and Lung Sounds Based on Nonnegative Matrix Factorization
    Lin, ChingShun
    Hasting, Erwin
    2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 731 - 736
  • [27] Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization
    Hossain, Md. Imran
    Islam, Md. Shohidul
    Khatun, Mst. Titasa
    Ullah, Rizwan
    Masood, Asim
    Ye, Zhongfu
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (04) : 1868 - 1891
  • [28] Multimodal Soft Nonnegative Matrix Co-Factorization for Convolutive Source Separation
    Sedighin, Farnaz
    Babaie-Zadeh, Massoud
    Rivet, Bertrand
    Jutten, Christian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (12) : 3179 - 3190
  • [29] STUDENT'S T MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Kitamura, Koichi
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [30] Blind source separation for groundwater pressure analysis based on nonnegative matrix factorization
    Alexandrov, Boian S.
    Vesselinov, Velimir V.
    WATER RESOURCES RESEARCH, 2014, 50 (09) : 7332 - 7347