Multi-Channel Audio Source Separation Using Multiple Deformed References

被引:19
|
作者
Souviraa-Labastic, Nathan [1 ]
Olivero, Anaik [1 ]
Vincent, Emmanuel [2 ]
Bimbot, Frederic [1 ]
机构
[1] Univ Rennes 1, CNRS, Inria, PANAMA,Project Team,IRISA, F-35000 Rennes, France
[2] Inria, F-54600 Villers Les Nancy, France
基金
欧洲研究理事会;
关键词
Generalized Expectation-Maximization (GEM) algorithm; source separation; NONNEGATIVE MATRIX FACTORIZATION; BLIND; INFORMATION; MODELS;
D O I
10.1109/TASLP.2015.2450494
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a general multi-channel source separation framework where additional audio references are available for one (or more) source(s) of a given mixture. Each audio reference is another mixture which is supposed to contain at least one source similar to one of the target sources. Deformations between the sources of interest and their references are modeled in a linear manner using a generic formulation. This is done by adding transformation matrices to an excitation-filter model, hence affecting different axes, namely frequency, dictionary component or time. A nonnegative matrix co-factorization algorithm and a generalized expectation-maximization algorithm are used to estimate the parameters of the model. Different model parameterizations and different combinations of algorithms are tested on music plus voice mixtures guided by music and/or voice references and on professionally-produced music recordings guided by cover references. Our algorithms improve the signal-to-distortion ratio (SDR) of the sources with the lowest intensity by 9 to 15 decibels (dB) with respect to original mixtures.
引用
收藏
页码:1775 / 1787
页数:13
相关论文
共 50 条
  • [31] SINGLE CHANNEL AUDIO SOURCE SEPARATION USING CONVOLUTIONAL DENOISING AUTOENCODERS
    Grais, Emad M.
    Plumbley, Mark D.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1265 - 1269
  • [32] UNSUPERVISED MULTI-CHANNEL SEPARATION AND ADAPTATION
    Han, Cong
    Wilson, Kevin
    Wisdom, Scott
    Hershey, John R.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 721 - 725
  • [33] Multi-Channel Signal Separation by Decorrelation
    Weinstein, Ehud
    Feder, Meir
    Oppenheim, Alan V.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 405 - 413
  • [34] MULTI-VIEW NETWORKS FOR MULTI-CHANNEL AUDIO CLASSIFICATION
    Casebeer, Jonah
    Wang, Zhepei
    Smaragdis, Paris
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 940 - 944
  • [35] BLIND SOURCE SEPARATION FROM MULTI-CHANNEL OBSERVATIONS WITH CHANNEL-VARIANT SPATIAL RESOLUTIONS
    Kayabol, Koray
    Salerno, Emanuele
    Luis Sanz, Jose
    Herranz, Diego
    Kuruoglu, Ercan E.
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1077 - 1081
  • [36] Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel Attention
    Hong, Qian-Bei
    Wu, Chung-Hsien
    Thanh Binh Nguyen
    Wang, Hsin-Min
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 619 - 623
  • [37] The Source Separation of Multi-channel Vibration Signal Based on Nonnegative Tensor Factorization
    Li, Guang
    Liang, Lin
    Liu, Dan
    Li, Maolin
    Wang, Bao
    Xu, Guanghua
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 359 - 363
  • [38] LOCATION AS SUPERVISION FOR WEAKLY SUPERVISED MULTI-CHANNEL SOURCE SEPARATION OF MACHINE SOUNDS
    Falcon-Perez, Ricardo
    Wichern, Gordon
    Germain, Francois G.
    Le Roux, Jonathan
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [39] Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation
    Nakagome, Yu
    Togami, Masahito
    Ogawa, Tetsuji
    Kobayashi, Tetsunori
    INTERSPEECH 2020, 2020, : 86 - 90
  • [40] Multi-channel intramuscular and surface EMG decomposition by convolutive blind source separation
    Negro, Francesco
    Muceli, Silvia
    Castronovo, Anna Margherita
    Holobar, Ales
    Farina, Dario
    JOURNAL OF NEURAL ENGINEERING, 2016, 13 (02)