Blind source separation with optimal transport non-negative matrix factorization

被引:9
|
作者
Rolet, Antoine [1 ]
Seguy, Vivien [1 ]
Blondel, Mathieu [2 ]
Sawada, Hiroshi [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Yoshida Honmachi, Kyoto, Japan
[2] NTT Commun Sci Labs, Kyoto, Japan
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2018年
关键词
NMF; Speech; BSS; Optimal transport; ALGORITHMS;
D O I
10.1186/s13634-018-0576-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (SIFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Non-negative Matrix Factorization for Binary Data
    Larsen, Jacob Sogaard
    Clemmensen, Line Katrine Harder
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 555 - 563
  • [22] Source separation and apportionment of surface water pollution in the Luanhe River Basin based on non-negative matrix factorization
    Leng, Peifang
    Zhang, Qiuying
    Li, Fadong
    Zhang, Yizhang
    Gu, Congke
    WATER SUPPLY, 2019, 19 (07) : 1945 - 1954
  • [23] WISHART LOCALIZATION PRIOR ON SPATIAL COVARIANCE MATRIX IN AMBISONIC SOURCE SEPARATION USING NON-NEGATIVE TENSOR FACTORIZATION
    Guzik, Mateusz
    Kowalczyk, Konrad
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 446 - 450
  • [24] Signal Separation using Non-negative Matrix Factorization Based on R1-norm
    Kider, W.
    Abd El Aziz, M. E.
    LIFE SCIENCE JOURNAL-ACTA ZHENGZHOU UNIVERSITY OVERSEAS EDITION, 2012, 9 (04): : 703 - 707
  • [25] Probabilistic Sparse Non-negative Matrix Factorization
    Hinrich, Jesper Love
    Morup, Morten
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 488 - 498
  • [26] Convergence Analysis of Non-Negative Matrix Factorization for BSS Algorithm
    Yang, Shangming
    Yi, Zhang
    NEURAL PROCESSING LETTERS, 2010, 31 (01) : 45 - 64
  • [27] Convergence Analysis of Non-Negative Matrix Factorization for BSS Algorithm
    Shangming Yang
    Zhang Yi
    Neural Processing Letters, 2010, 31 : 45 - 64
  • [28] Regularized Non-negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source Separation
    Zhang, Shaofei
    Huang, Dongyan
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    Dong, Minghui
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1498 - 1502
  • [29] Non-Negative Matrix Factorization with Averaged Kurtosis and Manifold Constraints for Blind Hyperspectral Unmixing
    Song, Chunli
    Lu, Linzhang
    Zeng, Chengbin
    SYMMETRY-BASEL, 2024, 16 (11):
  • [30] Mono-To-Stereo Blind Upmix Using Non-Negative Matrix Factorization and Decorrelator
    Choi, Keunwoo
    Chon, Sang Bae
    Lee, Seokjin
    Sung, Koeng-Mo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2010, 29 (08): : 509 - 515