Monaural Source Separation Using a Random Forest Classifier

被引：1

作者：

Riday, Cosimo ^{[1
]}

Bhargava, Saurabh

Hahnloser, Richard H. R.

Liu, Shih-Chii

机构：

[1] Univ Zurich, Inst Neuroinformat, Zurich, Switzerland

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

基金：

瑞士国家科学基金会;

关键词：

monaural source separation; random forest; deep learning; CASA; IMPROVE SPEECH RECOGNITION; NOISE;

D O I：

10.21437/Interspeech.2016-252

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We address the problem of separating two audio sources from a single channel mixture recording. A novel method called Multi Layered Random Forest (MLRF) that learns a binary mask for both the sources is presented. Random Forest (RF) classifiers are trained for each frequency band of a source spectrogram. A specialized set of linear transformations are applied to a local time-frequency (T-F) neighborhood of the mixture that captures relevant local statistics. A sampling method is presented that efficiently samples T-F training bins in each frequency band. We draw equal numbers of dominant (more power) training samples from the two sources for RF classifiers that estimate the Ideal Binary Mask (IBM). An estimated IBM in a given layer is used to train a RF classifier in the next higher layer of the MLRF hierarchy. On average, MLRF performs better than deep Recurrent Neural Networks (RNNs) and Non-Negative Sparse Coding (NNSC) in signalto-noise ratio (SNR) of reconstructed audio, overall T-F bin classification accuracy, as well as PESQ and STOI scores. Additionally, we demonstrate the ability of the MLRF to correctly reconstruct T-F bins of the target even when the latter has lower power in that frequency band.

引用

页码：3344 / 3348

页数：5

共 50 条

[11] Congestive heart failure detection using random forest classifier [J].

Masetic, Zerina ;

Subasi, Abdulhamit .

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 130 :54-64

[12] An Ensemble classifier approach for Disease Diagnosis using Random Forest [J].

Pachange, Sarika ;

Joglekar, Bela ;

Kulkarni, Parag .

2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,

[13] Default Risk Prediction Using Random Forest and XGBoosting Classifier [J].

Sharma, Alok Kumar ;

Li, Li-Hua ;

Ahmad, Ramli .

2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 :91-101

[14] Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks [J].

Sun, Yang ;

Wang, Wenwu ;

Chambers, Jonathon ;

Naqvi, Syed Mohsen .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) :125-139

[15] Defocus blur radius classification using Random Forest Classifier [J].

Gajjar, Ruchi ;

Zaveri, Tanish .

2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRONICS, SIGNAL PROCESSING AND COMMUNICATION (IESC), 2017, :219-223

[16] Brahmi Script Recognition Using Optimized Convolutional Neural Network with Random Forest Classifier [J].

Jain, Trang ;

Jain, V.K. ;

Jain, A. .

International Journal of High Speed Electronics and Systems, 2025, 34 (04)

[17] Deep neural network and random forest classifier for source tracking of chemical leaks using fence monitoring data [J].

Cho, Jaehoon ;

Kim, Hyunseung ;

Gebreselassie, Addis Lulu ;

Shin, Dongil .

JOURNAL OF LOSS PREVENTION IN THE PROCESS INDUSTRIES, 2018, 56 :548-558

[18] MONAURAL SOURCE SEPARATION: FROM ANECHOIC TO REVERBERANT ENVIRONMENTS [J].

Cord-Landwehr, Tobias ;

Boeddeker, Christoph ;

Von Neumann, Thilo ;

Zorila, Catalin ;

Doddipatla, Rama ;

Haeb-Umbach, Reinhold .

2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,

[19] Random Forest Twitter Bot Classifier [J].

Schnebly, James ;

Sengupta, Shamik .

2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, :506-512

[20] SEQUENTIALLY TRAINED DNNS BASED MONAURAL SOURCE SEPARATION IN REAL ROOM ENVIRONMENTS [J].

Li, Yi ;

Sun, Yang ;

Naqvi, Syed Mohsen .

2019 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2019,

← 1 2 3 4 5 →