A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引:0
作者
Bouvier, Damien [1 ]
Obin, Nicolas [1 ]
Liuni, Marco [1 ]
Roebel, Axel [1 ]
机构
[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.
引用
收藏
页码:131 / 135
页数:5
相关论文
共 50 条
[31]   Upgrading Sparse NMF algorithm for blind source separation through Adaptive Parameterized Hybrid Kernel based approach [J].
ParimalaGandhi, A. ;
Vijayan, S. .
MEASUREMENT, 2019, 143 :11-21
[32]   Single channel source separation using graph sparse NMF and adaptive dictionary learning [J].
Pham, Tuan ;
Lee, Yuan-Shan ;
Lin, Yan-Bo ;
Li, Yung-Hui ;
Tai, Tzu-Chiang ;
Wang, Jia-Ching .
INTELLIGENT DATA ANALYSIS, 2017, 21 :S5-S19
[33]   TOWARDS SOURCE-FILTER BASED SINGLE SENSOR SPEECH SEPARATION [J].
Stark, Michael ;
Pernkopf, Franz .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :97-100
[34]   A Two-step NMF Based Algorithm for Single Channel Speech Separation [J].
Wang, Shuo ;
Wu, Wenjun .
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, :1987-1990
[35]   EXPLOITING SPECTRO-TEMPORAL STRUCTURES USING NMF FOR DNN-BASED SUPERVISED SPEECH SEPARATION [J].
Nie, Shuai ;
Liang, Shan ;
Li, Hao ;
Zhang, XueLiang ;
Yang, ZhanLei ;
Liu, WenJu ;
Dong, LiKe .
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :469-473
[36]   SPEECH-GUIDED SOURCE SEPARATION USING A PITCH-ADAPTIVE GUIDE SIGNAL MODEL [J].
Hennequin, Romain ;
Burred, Juan Jose ;
Maller, Simon ;
Leveau, Pierre .
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[37]   Discriminative Training of NMF Model Based on Class Probabilities for Speech Enhancement [J].
Chung, Hanwook ;
Plourde, Eric ;
Champagne, Benoit .
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) :502-506
[38]   An adaptive autoregressive pre-whitener for speech and acoustic signals based on parametric NMF [J].
Jaramillo, Alfredo Esquivel ;
Nielsen, Jesper Kjaer ;
Christensen, Mads Graesboll .
SPEECH COMMUNICATION, 2023, 151 :9-23
[39]   Frequency Selection Based Separation of Speech Signals with Reduced Computational Time Using Sparse NMF [J].
Varshney, Yash Vardhan ;
Abbasi, Zia Ahmad ;
Abidi, Musiur Raza ;
Farooq, Omar .
ARCHIVES OF ACOUSTICS, 2017, 42 (02) :287-295
[40]   A Comparative Study of Blind Source Separation for Bioacoustics Sounds based on FastICA, PCA and NMF [J].
Hassan, Norsalina ;
Ramli, Dzati Athiar .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 :363-372