Harmonic/Percussive Sound Separation Based on Anisotropic Smoothness of Spectrograms

被引:15
|
作者
Tachibana, Hideyuki [1 ]
Ono, Nobutaka [2 ,3 ]
Kameoka, Hirokazu [1 ,4 ]
Sagayama, Shigeki [2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
[2] Natl Inst Informat, Tokyo 1010003, Japan
[3] Grad Univ Adv Studies SOKENDAI, Tokyo 1018430, Japan
[4] NTT Commun Sci Lab, Atsugi, Kanagawa 2430198, Japan
基金
日本学术振兴会;
关键词
Audio source separation; harmonic instruments; music signal processing; percussion; NONNEGATIVE MATRIX FACTORIZATION; MUSIC; PATTERNS; SIGNALS;
D O I
10.1109/TASLP.2014.2351131
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a method to separate a monaural music signal into harmonic components e. g., a guitar and percussive components, e. g., a snare drum. Separation of these two components is a useful preprocessing for many music information retrieval applications, and in addition, it can be used as a new kind of music equalizer in itself, which enables a music listener to adjust the ratio of the volume of the guitar and the drum freely by themselves. Because of these potential applications, there have been many attempts to develop such a technique, especially in the last decade. However, some of the state-of-the-art techniques have a drawback that they are based on costly operations, such as the multiplications of large-sized matrix, Monte Carlo method, etc., which may constitute barriers to the practical use on some small computers such as smart phones. In this paper, an efficient method that does not depend on these costly operations is described. In formulating the methods, the authors basically assumed only the "anisotropic smoothness" of music spectrogram, which can be one of the minimalistic model that reflects the natures of these instruments. To be specific, the authors just assumed that harmonic instruments are smooth in time, while the percussive instruments are smooth in frequency on a music spectrogram. In this paper, on the basis of the assumption, source separation methods are formulated as optimization problems that optimize the "anisotropic smoothness" under some conditions. Because of the simplicity of the model, the derived algorithms are quite simple. Experimental results show that the methods were effective compared to a state-of-the-art technique, and the computation time was much shorter than an existing method; specifically, it can process a three-minute song in around 4-20 seconds on a laptop PC.
引用
收藏
页码:2059 / 2073
页数:15
相关论文
共 31 条
  • [1] Exploiting Continuity/Discontinuity of Basis Vectors in Spectrogram Decomposition for Harmonic-Percussive Sound Separation
    Park, Jeongsoo
    Shin, Jaeyoung
    Lee, Kyogu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1061 - 1074
  • [2] Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    Das, Partha Pratim
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 831 - 835
  • [3] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511
  • [4] Mid-Level Audio Features Based on Cascaded Harmonic-Residual-Percussive Separation
    Lopez-Serrano, Patricio
    Dittmar, Christian
    Mueller, Meinard
    2017 AES INTERNATIONAL CONFERENCE ON SEMANTIC AUDIO, 2017,
  • [5] PHASE-AWARE HARMONIC/PERCUSSIVE SOURCE SEPARATION VIA CONVEX OPTIMIZATION
    Masuyama, Yoshiki
    Yatabe, Kohei
    Oikawa, Yasuhiro
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 985 - 989
  • [6] Phase-recovery algorithm for harmonic/percussive source separation based on observed phase information and analytic computation
    Kobayashi, Kenji
    Masuyama, Yoshiki
    Yatabe, Kohei
    Oikawa, Yasuhiro
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2021, 42 (05) : 261 - 269
  • [7] Heart Sound Classification Using Harmonic and Percussive Spectral Features from Phonocardiograms with a Deep ANN Approach
    Singh, Anupinder
    Arora, Vinay
    Singh, Mandeep
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [8] Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks
    Gonzalez-Martinez, F. D.
    Carabias-Orti, J. J.
    Canadas-Quesada, F. J.
    Ruiz-Reyes, N.
    Martinez-Munoz, D.
    Garcia-Galan, S.
    APPLIED ACOUSTICS, 2024, 216
  • [9] Incorporation of Localization Information for Sound Source Separation in Spherical Harmonic Domain
    Guzik, Mateusz
    Fras, Mieszko
    Kowalczyk, Konrad
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [10] NTF of Spectral and Spatial Features for Tracking and Separation of Moving Sound Sources in Spherical Harmonic Domain
    Guzik, Mateusz
    Kowalczyk, Konrad
    INTERSPEECH 2022, 2022, : 261 - 265