Harmonic/Percussive Sound Separation Based on Anisotropic Smoothness of Spectrograms

被引：15

作者：

Tachibana, Hideyuki ^{[1
]}

Ono, Nobutaka ^{[2
,3
]}

Kameoka, Hirokazu ^{[1
,4
]}

Sagayama, Shigeki ^{[2
]}

机构：

[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan

[2] Natl Inst Informat, Tokyo 1010003, Japan

[3] Grad Univ Adv Studies SOKENDAI, Tokyo 1018430, Japan

[4] NTT Commun Sci Lab, Atsugi, Kanagawa 2430198, Japan

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2014年 / 22卷 / 12期

基金：

日本学术振兴会;

关键词：

Audio source separation; harmonic instruments; music signal processing; percussion; NONNEGATIVE MATRIX FACTORIZATION; MUSIC; PATTERNS; SIGNALS;

D O I：

10.1109/TASLP.2014.2351131

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes a method to separate a monaural music signal into harmonic components e. g., a guitar and percussive components, e. g., a snare drum. Separation of these two components is a useful preprocessing for many music information retrieval applications, and in addition, it can be used as a new kind of music equalizer in itself, which enables a music listener to adjust the ratio of the volume of the guitar and the drum freely by themselves. Because of these potential applications, there have been many attempts to develop such a technique, especially in the last decade. However, some of the state-of-the-art techniques have a drawback that they are based on costly operations, such as the multiplications of large-sized matrix, Monte Carlo method, etc., which may constitute barriers to the practical use on some small computers such as smart phones. In this paper, an efficient method that does not depend on these costly operations is described. In formulating the methods, the authors basically assumed only the "anisotropic smoothness" of music spectrogram, which can be one of the minimalistic model that reflects the natures of these instruments. To be specific, the authors just assumed that harmonic instruments are smooth in time, while the percussive instruments are smooth in frequency on a music spectrogram. In this paper, on the basis of the assumption, source separation methods are formulated as optimization problems that optimize the "anisotropic smoothness" under some conditions. Because of the simplicity of the model, the derived algorithms are quite simple. Experimental results show that the methods were effective compared to a state-of-the-art technique, and the computation time was much shorter than an existing method; specifically, it can process a three-minute song in around 4-20 seconds on a laptop PC.

引用

页码：2059 / 2073

页数：15

共 31 条

[1] Exploiting Continuity/Discontinuity of Basis Vectors in Spectrogram Decomposition for Harmonic-Percussive Sound Separation
Park, Jeongsoo
Shin, Jaeyoung
Lee, Kyogu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1061 - 1074
[2] Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events
Reddy, Gurunath M.
Rao, K. Sreenivasa
Das, Partha Pratim
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 831 - 835
[3] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
Laroche, Clement
Kowalski, Matthieu
Papadopoulos, Helene
Richard, Gael
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511
[4] Mid-Level Audio Features Based on Cascaded Harmonic-Residual-Percussive Separation
Lopez-Serrano, Patricio
Dittmar, Christian
Mueller, Meinard
2017 AES INTERNATIONAL CONFERENCE ON SEMANTIC AUDIO, 2017,
[5] PHASE-AWARE HARMONIC/PERCUSSIVE SOURCE SEPARATION VIA CONVEX OPTIMIZATION
Masuyama, Yoshiki
Yatabe, Kohei
Oikawa, Yasuhiro
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 985 - 989
[6] Phase-recovery algorithm for harmonic/percussive source separation based on observed phase information and analytic computation
Kobayashi, Kenji
Masuyama, Yoshiki
Yatabe, Kohei
Oikawa, Yasuhiro
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2021, 42 (05) : 261 - 269
[7] Heart Sound Classification Using Harmonic and Percussive Spectral Features from Phonocardiograms with a Deep ANN Approach
Singh, Anupinder
Arora, Vinay
Singh, Mandeep
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[8] Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks
Gonzalez-Martinez, F. D.
Carabias-Orti, J. J.
Canadas-Quesada, F. J.
Ruiz-Reyes, N.
Martinez-Munoz, D.
Garcia-Galan, S.
APPLIED ACOUSTICS, 2024, 216
[9] Incorporation of Localization Information for Sound Source Separation in Spherical Harmonic Domain
Guzik, Mateusz
Fras, Mieszko
Kowalczyk, Konrad
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
[10] NTF of Spectral and Spatial Features for Tracking and Separation of Moving Sound Sources in Spherical Harmonic Domain
Guzik, Mateusz
Kowalczyk, Konrad
INTERSPEECH 2022, 2022, : 261 - 265

← 1 2 3 4 →