CLASS-CONDITIONAL EMBEDDINGS FOR MUSIC SOURCE SEPARATION

被引：0

作者：

Seetharaman, Prem ^{[1
,2
]}

Wichern, Gordon ^{[1
]}

Venkataramani, Shrikant ^{[1
,3
]}

Le Roux, Jonathan ^{[1
]}

机构：

[1] MERL, Cambridge, MA 02139 USA

[2] Northwestern Univ, Evanston, IL 60208 USA

[3] Univ Illinois, Champaign, IL USA

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

source separation; deep clustering; music; classification; neural networks;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Isolating individual instruments in a musical mixture has a myriad of potential applications, and seems imminently achievable given the levels of performance reached by recent deep learning methods. While most musical source separation techniques learn an independent model for each instrument, we propose using a common embedding space for the time-frequency bins of all instruments in a mixture inspired by deep clustering and deep attractor networks. Additionally, an auxiliary network is used to generate parameters of a Gaussian mixture model (GMM) where the posterior distribution over GMM components in the embedding space can be used to create a mask that separates individual sources from a mixture. In addition to outperforming a mask-inference baseline on the MUSDB-18 dataset, our embedding space is easily interpretable and can be used for query-based separation.

引用

页码：301 / 305

页数：5

共 23 条

[11] Adaptive Pooling Operators for Weakly Labeled Sound Event Detection [J].

McFee, Brian ;

Salamon, Justin ;

Bello, Juan Pablo .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (11) :2180-2193

[12]

Nugraha A. A., 2016, SIGN PROC C EUSIPCO

[13]

Ozerov A., 2017, MACH LEARN SIGN PROC

[14]

Pardo B., 2006, IEEE SIGNAL PROCESSI, V23

[15]

Rafii Zafar, 2017, MUSDB18 CORPUS MUSIC

[16]

Salamon J, 2017, IEEE WORK APPL SIG, P344, DOI 10.1109/WASPAA.2017.8170052

[17] The 2018 Signal Separation Evaluation Campaign [J].

Stoter, Fabian-Robert ;

Liutkus, Antoine ;

Ito, Nobutaka .

LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 :293-305

[18]

Takahashi N, 2018, INT WORKSH ACOUSTIC, P106, DOI 10.1109/IWAENC.2018.8521383

[19]

Uhlich S, 2017, INT CONF ACOUST SPEE, P261, DOI 10.1109/ICASSP.2017.7952158

[20]

Wang D., 2017, ARXIV170807524

← 1 2 3 →