Build A Module for Improvement Real Time Speech enhancement using Long Short-term Memory Approach

被引:0
作者
Van Vo [1 ]
Bach Le Son [2 ]
Huy Vo Phuc [2 ]
机构
[1] FPT Univ, Software Engn Dept, Hanoi, Vietnam
[2] FPT Univ, Informat Technol Specialized Dept, Hanoi, Vietnam
来源
PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2023 | 2023年
关键词
Speech enhancement; Noise suppression; Deep Learning; Long Short-term Memory; Virtual Call Center; Customer Relationship Management System;
D O I
10.1145/3591569.3591614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An essential customer experience is required for all businesses today, and customer support as a service brings the right people and processes together. When designing a system for in the context of audio communication for transmission purposes, noise influences must be carefully considered. Improving the quality of phone calls for a smart virtual call center is essential for more effective customer care. This paper proposed a module for improving real-time speech enhancement of phone calls using Long short-term memory (LSTM), an artificial neural network used in the fields of artificial intelligence and deep learning. LSTMs are designed to revoke the long-term dependency issue, remembering information for long periods is generally their default way of behaving. The data set using for this approach is both in English and Vietnamese, the results also improve with evaluation metrics such as PESQ, SI-SDR, STOI.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
[21]   Part of Speech Tagging for Indonesian Language using Bidirectional Long Short-Term Memory [J].
Handrata, Dellon ;
Purwanto, Christian Nathaniel ;
Chandra, Fransisca Haryanti ;
Santoso, Joan ;
Gunawan .
2019 1ST INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2019, :85-88
[22]   A Speech Recognition Method Using Long Short-Term Memory Network in Low Resources [J].
Shu F. ;
Qu D. ;
Zhang W. ;
Zhou L. ;
Guo W. .
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2017, 51 (10) :120-127
[23]   Speech Perception Improvement Algorithm Based on a Dual-Path Long Short-Term Memory Network [J].
Koh, Hyeong Il ;
Na, Sungdae ;
Kim, Myoung Nam ;
Ieracitano, Cosimo ;
Zhang, Xuejun .
BIOENGINEERING-BASEL, 2023, 10 (11)
[24]   A deep learning approach to predict significant wave height using long short-term memory [J].
Minuzzi, Felipe C. ;
Farina, Leandro .
OCEAN MODELLING, 2023, 181
[25]   Waste Prediction Approach Using Hybrid Long Short-Term Memory with Support Vector Machine [J].
Fatovatikhah, Farnaz ;
Ahmedy, Ismail ;
Noor, Rafidah Md .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
[26]   Short-Term Load Forecasting using Long Short Term Memory Optimized by Genetic Algorithm [J].
Zulfiqar, Muhammad ;
Rasheed, Muhammad Babar .
2022 IEEE SUSTAINABLE POWER AND ENERGY CONFERENCE (ISPEC), 2022,
[27]   Speech Inpainting Based on Multi-Layer Long Short-Term Memory Networks [J].
Shi, Haohan ;
Shi, Xiyu ;
Dogan, Safak .
FUTURE INTERNET, 2024, 16 (02)
[28]   An Incremental Learning Approach Using Long Short-Term Memory Neural Networks [J].
Lemos Neto, Alvaro C. ;
Coelho, Rodrigo A. ;
de Castro, Cristiano L. .
JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2022, 33 (05) :1457-1465
[29]   An Incremental Learning Approach Using Long Short-Term Memory Neural Networks [J].
Álvaro C. Lemos Neto ;
Rodrigo A. Coelho ;
Cristiano L. de Castro .
Journal of Control, Automation and Electrical Systems, 2022, 33 :1457-1465
[30]   Long Short-Term Memory for Speaker Generalization in Supervised Speech Separation [J].
Chen, Jitong ;
Wang, DeLiang .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3314-3318