A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引：0

作者：

Bouvier, Damien ^{[1
]}

Obin, Nicolas ^{[1
]}

Liuni, Marco ^{[1
]}

Roebel, Axel ^{[1
]}

机构：

[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.

引用

页码：131 / 135

页数：5

共 50 条

[31] Upgrading Sparse NMF algorithm for blind source separation through Adaptive Parameterized Hybrid Kernel based approach [J].

ParimalaGandhi, A. ;

Vijayan, S. .

MEASUREMENT, 2019, 143 :11-21

[32] Single channel source separation using graph sparse NMF and adaptive dictionary learning [J].

Pham, Tuan ;

Lee, Yuan-Shan ;

Lin, Yan-Bo ;

Li, Yung-Hui ;

Tai, Tzu-Chiang ;

Wang, Jia-Ching .

INTELLIGENT DATA ANALYSIS, 2017, 21 :S5-S19

[33] TOWARDS SOURCE-FILTER BASED SINGLE SENSOR SPEECH SEPARATION [J].

Stark, Michael ;

Pernkopf, Franz .

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :97-100

[34] A Two-step NMF Based Algorithm for Single Channel Speech Separation [J].

Wang, Shuo ;

Wu, Wenjun .

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, :1987-1990

[35] EXPLOITING SPECTRO-TEMPORAL STRUCTURES USING NMF FOR DNN-BASED SUPERVISED SPEECH SEPARATION [J].

Nie, Shuai ;

Liang, Shan ;

Li, Hao ;

Zhang, XueLiang ;

Yang, ZhanLei ;

Liu, WenJu ;

Dong, LiKe .

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :469-473

[36] SPEECH-GUIDED SOURCE SEPARATION USING A PITCH-ADAPTIVE GUIDE SIGNAL MODEL [J].

Hennequin, Romain ;

Burred, Juan Jose ;

Maller, Simon ;

Leveau, Pierre .

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

[37] Discriminative Training of NMF Model Based on Class Probabilities for Speech Enhancement [J].

Chung, Hanwook ;

Plourde, Eric ;

Champagne, Benoit .

IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) :502-506

[38] An adaptive autoregressive pre-whitener for speech and acoustic signals based on parametric NMF [J].

Jaramillo, Alfredo Esquivel ;

Nielsen, Jesper Kjaer ;

Christensen, Mads Graesboll .

SPEECH COMMUNICATION, 2023, 151 :9-23

[39] Frequency Selection Based Separation of Speech Signals with Reduced Computational Time Using Sparse NMF [J].

Varshney, Yash Vardhan ;

Abbasi, Zia Ahmad ;

Abidi, Musiur Raza ;

Farooq, Omar .

ARCHIVES OF ACOUSTICS, 2017, 42 (02) :287-295

[40] A Comparative Study of Blind Source Separation for Bioacoustics Sounds based on FastICA, PCA and NMF [J].

Hassan, Norsalina ;

Ramli, Dzati Athiar .

KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 :363-372

← 1 2 3 4 5 →