Build A Module for Improvement Real Time Speech enhancement using Long Short-term Memory Approach

被引:0
作者
Van Vo [1 ]
Bach Le Son [2 ]
Huy Vo Phuc [2 ]
机构
[1] FPT Univ, Software Engn Dept, Hanoi, Vietnam
[2] FPT Univ, Informat Technol Specialized Dept, Hanoi, Vietnam
来源
PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2023 | 2023年
关键词
Speech enhancement; Noise suppression; Deep Learning; Long Short-term Memory; Virtual Call Center; Customer Relationship Management System;
D O I
10.1145/3591569.3591614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An essential customer experience is required for all businesses today, and customer support as a service brings the right people and processes together. When designing a system for in the context of audio communication for transmission purposes, noise influences must be carefully considered. Improving the quality of phone calls for a smart virtual call center is essential for more effective customer care. This paper proposed a module for improving real-time speech enhancement of phone calls using Long short-term memory (LSTM), an artificial neural network used in the fields of artificial intelligence and deep learning. LSTMs are designed to revoke the long-term dependency issue, remembering information for long periods is generally their default way of behaving. The data set using for this approach is both in English and Vietnamese, the results also improve with evaluation metrics such as PESQ, SI-SDR, STOI.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
[31]   Solid waste generation forecasts using long short-term memory approach [J].
Idrissi, Aya ;
Benabbou, Rajaa ;
Benhra, Jamal ;
El Haji, Mounia .
INTERNATIONAL JOURNAL OF APPLIED MANAGEMENT SCIENCE, 2025, 17 (02)
[32]   An Incremental Learning Approach Using Long Short-Term Memory Neural Networks [J].
Álvaro C. Lemos Neto ;
Rodrigo A. Coelho ;
Cristiano L. de Castro .
Journal of Control, Automation and Electrical Systems, 2022, 33 :1457-1465
[33]   Long Short-Term Memory for Speaker Generalization in Supervised Speech Separation [J].
Chen, Jitong ;
Wang, DeLiang .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3314-3318
[34]   Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory [J].
Tao, Fei ;
Busso, Carlos .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :1244-1248
[35]   Long Short-Term Memory Networks for Noise Robust Speech Recognition [J].
Woellmer, Martin ;
Sun, Yang ;
Eyben, Florian ;
Schuller, Bjoern .
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, :2966-2969
[36]   Predicting Perceptual Centers Located at Vowel Onset in German Speech Using Long Short-Term Memory Networks [J].
Schulz, Felicia ;
De Sisto, Mirella ;
Roncaglia-Denissen, M. Paula ;
Hendrix, Peter .
INTERSPEECH 2023, 2023, :1793-1797
[37]   A short-term water demand forecasting model using multivariate long short-term memory with meteorological data [J].
Zanfei, Ariele ;
Brentan, Bruno Melo ;
Menapace, Andrea ;
Righetti, Maurizio .
JOURNAL OF HYDROINFORMATICS, 2022, 24 (05) :1053-1065
[38]   Real-Time Short-Term Voltage Stability Assessment Using Combined Temporal Convolutional Neural Network and Long Short-Term Memory Neural Network [J].
Adhikari, Ananta ;
Naetiladdanon, Sumate ;
Sangswang, Anawach .
APPLIED SCIENCES-BASEL, 2022, 12 (13)
[39]   A review on the long short-term memory model [J].
Van Houdt, Greg ;
Mosquera, Carlos ;
Napoles, Gonzalo .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) :5929-5955
[40]   A review on the long short-term memory model [J].
Greg Van Houdt ;
Carlos Mosquera ;
Gonzalo Nápoles .
Artificial Intelligence Review, 2020, 53 :5929-5955