Audio Source Separation Based on Nonnegative Matrix Factorization with Graph Harmonic Structure

被引:0
|
作者
Ichita, Tomohiro [1 ]
Kyochi, Seisuke [1 ]
Imoto, Keisuke [2 ]
机构
[1] Univ Kitakyushu, Fukuoka, Fukuoka, Japan
[2] Ritsumeikan Univ, Kusatsu, Shiga, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a novel single-channel audio source separation based on graph-regularized nonnegative matrix factorization (NMF) taking harmonic frequency structure of each instrument into account. Since the original NMF, which is regarded as unsupervised learning, cannot readily identify the corresponding basis matrix for each target source, supervised NMFs (SNMFs) using given basis matrices learned from training sources have been extensively studied. Although SNMFs usually separate a mixed source better than NMF, the performance is degraded when training sources different from the observed source. The proposed SNMF does not use learned basis matrices but uses learned graph Laplacian matrices characterizing a harmonic frequency structure of training sources for regularization. Even if training sources are different from target sources, the graph structures from observed and training sources are more correlated, thus, as experimental results show, it can separate more robustly.
引用
收藏
页码:1148 / 1152
页数:5
相关论文
共 50 条
  • [1] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [2] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [3] Supervised Audio Source Separation Based on Nonnegative Matrix Factorization with Cosine Similarity Penalty
    Iwase, Yuta
    Kitamura, Daichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (06) : 906 - 913
  • [4] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Pezzoli, Mirco
    Carabias-Orti, Julio Jose
    Cobos, Maximo
    Antonacci, Fabio
    Sarti, Augusto
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
  • [5] Temporal annotation-based audio source separation using weighted nonnegative matrix factorization
    Duong, Ngoc Q. K.
    Ozerov, Alexey
    Chevallier, Louis
    2014 IEEE Fourth International Conference on Consumer Electronics Berlin (ICCE-Berlin), 2014, : 220 - 224
  • [6] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [7] NONNEGATIVE TENSOR FACTORIZATION FOR SOURCE SEPARATION OF LOOPS IN AUDIO
    Smith, Jordan B. L.
    Goto, Masataka
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 171 - 175
  • [8] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [9] A COMPARATIVE STUDY OF EXAMPLE-GUIDED AUDIO SOURCE SEPARATION APPROACHES BASED ON NONNEGATIVE MATRIX FACTORIZATION
    Ozerov, Alexey
    Kitic, Srdan
    Perez, Patrick
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [10] β-Divergence Two-Dimensional Sparse Nonnegative Matrix Factorization for Audio Source Separation
    Darsono, A. M.
    Haron, N. Z.
    Jaafar, A. S.
    Ahmad, M. I.
    2013 IEEE CONFERENCE ON WIRELESS SENSOR (ICWISE), 2013, : 119 - 123