Blind source separation with optimal transport non-negative matrix factorization

被引:9
|
作者
Rolet, Antoine [1 ]
Seguy, Vivien [1 ]
Blondel, Mathieu [2 ]
Sawada, Hiroshi [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Yoshida Honmachi, Kyoto, Japan
[2] NTT Commun Sci Labs, Kyoto, Japan
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2018年
关键词
NMF; Speech; BSS; Optimal transport; ALGORITHMS;
D O I
10.1186/s13634-018-0576-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (SIFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Sparse coding of human motion trajectories with non-negative matrix factorization
    Vollmer, Christian
    Hellbach, Sven
    Eggert, Julian
    Gross, Horst-Michael
    NEUROCOMPUTING, 2014, 124 : 22 - 32
  • [42] Intraday Trading Volume and Non-Negative Matrix Factorization
    Takada, Hellinton H.
    Stern, Julio M.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2016, 1757
  • [43] CUSTOM SIZED NON-NEGATIVE MATRIX FACTOR DECONVOLUTION FOR SOUND SOURCE SEPARATION
    Becker, Julian M.
    Rohlfing, Christian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [44] Detection of Sources in Non-Negative Blind Source Separation by Minimum Description Length Criterion
    Lin, Chia-Hsiang
    Chi, Chong-Yung
    Chen, Lulu
    Miller, David J.
    Wang, Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4022 - 4037
  • [45] Guided Semi-Supervised Non-Negative Matrix Factorization
    Li, Pengyu
    Tseng, Christine
    Zheng, Yaxuan
    Chew, Joyce A.
    Huang, Longxiu
    Jarman, Benjamin
    Needell, Deanna
    ALGORITHMS, 2022, 15 (05)
  • [46] Split Gradient Method for Informed Non-negative Matrix Factorization
    Chreiky, Robert
    Delmaire, Gilles
    Puigt, Matthieu
    Roussel, Gilles
    Courcot, Dominique
    Abche, Antoine
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 376 - 383
  • [47] Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization
    Kaya, Oguz
    Kannan, Ramakrishnan
    Ballard, Grey
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [48] Non-Negative Matrix Factorization with Auxiliary Information on Overlapping Groups
    Shiga, Motoki
    Mamitsuka, Hiroshi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1615 - 1628
  • [49] Intersecting Faces: Non-negative Matrix Factorization With New Guarantees
    Ge, Rong
    Zou, James
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2295 - 2303
  • [50] Speaker conversion using kernel non-negative matrix factorization
    Xu Qinyu
    Lu Guanming
    Yan Jingjie
    Li Haibo
    Cheng Xiao
    The Journal of China Universities of Posts and Telecommunications, 2017, (05) : 60 - 67