Improving the Performance of Deep Learning Based Speech Enhancement System Using Fuzzy Restricted Boltzmann Machine

被引：6

作者：

Samui, Suman ^{[1
]}

Chakrabarti, Indrajit ^{[1
]}

Ghosh, Soumya K. ^{[1
]}

机构：

[1] Indian Inst Technol Kharagpur, Kharagpur 721302, W Bengal, India

来源：

PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017 | 2017年 / 10597卷

关键词：

Speech enhancement; Deep learning; Deep neural network; Restricted Boltzmann machine (RBM); Fuzzy restricted Boltzmann machine (FRBM); Unsupervised pre-training; Speech quality; Speech intelligibility;

D O I：

10.1007/978-3-319-69900-4_68

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Supervised speech enhancement based on machine learning is a new paradigm for segregating clean speech from background noise. The current work represents a supervised speech enhancement system based on a robust deep learning method where the pre-training phase of deep belief network (DBN) has been conducted by employing fuzzy restricted Boltzmann machines (FRBM) instead of regular RBM. It has been observed that the performance of FRBM model is superior to that of RBM model particularly when the training data is noisy. Our experimental results on various noise scenarios have shown that the proposed approach outperforms the conventional DNN-based speech enhancement methods which use regular RBM for unsupervised pre-training.

引用

页码：534 / 542

页数：9

共 13 条

[1]

[Anonymous], 2013, COMPUT REV

[2] Fuzzy Restricted Boltzmann Machine for the Enhancement of Deep Learning [J].