A Helium Speech Correction Method Based on Generative Adversarial Networks

被引:0
|
作者
Li, Hongjun [1 ]
Chen, Yuxiang [1 ]
Ji, Hongwei [2 ]
Zhang, Shibing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Shanghai Salvage Co, Shanghai 200090, Peoples R China
基金
中国国家自然科学基金;
关键词
helium speech correction; generative adversarial network; helium speech dataset; INTELLIGIBILITY; TRANSLATION;
D O I
10.3390/bdcc8110158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The distortion of helium speech caused by helium-oxygen gas mixtures significantly impacts the safety and communication efficiency of saturation divers. Although existing correction methods have shown some effectiveness in improving the intelligibility of helium speech, challenges remain in enhancing clarity and high-pitch correction. To address the issue of degraded speech quality post-correction, a novel helium speech correction method based on generative adversarial networks (GANs) is proposed. Firstly, a new helium speech dataset is introduced, which includes isolated words and continuous speech in both Chinese and English. By training and testing on both isolated words and continuous passages, the correction capability of the model can be accurately evaluated. Secondly, a new evaluation system for helium speech correction is proposed, which partially fills the gap in current helium speech evaluation metrics. This system uses comprehensive similarity to evaluate the similarity of keywords at the sentence level, thus assessing the correction results of helium speech from both word and sentence dimensions. Lastly, a GAN-based helium speech correction method is designed. This method solves the problems of pitch period distortion and formant shift in helium speech by introducing an adaptive speech segmentation algorithm and a fusion loss function and significantly improves the clarity and intelligibility of corrected helium speech. The experimental results show that the corrected helium speech is improved in clarity and intelligibility, which shows its practical value and application potential.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A New Method for Improving Generative Adversarial Networks in Speech Enhancement
    Yang, Fan
    Li, Junfeng
    Yan, Yonghong
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [2] Speech Enhancement Based On Spectrogram Conditional Generative Adversarial Networks
    Han, Ru
    Liu, Jianming
    Wang, Mingwen
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [3] Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
    Wang, Ke
    Zhang, Junbo
    Sun, Sining
    Wang, Yujun
    Xiang, Fei
    Xie, Lei
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1581 - 1585
  • [4] Generative adversarial networks for speech processing: A review
    Wali, Aamir
    Alamgir, Zareen
    Karim, Saira
    Fawaz, Ather
    Ali, Mubariz Barkat
    Adan, Muhammad
    Mujtaba, Malik
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [5] Speech Loss Compensation by Generative Adversarial Networks
    Shi, Yupeng
    Zheng, Nengheng
    Kang, Yuyong
    Rong, Weicong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 347 - 351
  • [6] Texture synthesis method based on generative adversarial networks
    Yu S.
    Han Z.
    Tang Y.
    Wu C.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
  • [7] Age Estimation Method Based on Generative Adversarial Networks
    Ning, Xin
    Li, Weijun
    Sun, Linjun
    2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY, CII 2017, 2017, : 333 - 340
  • [8] A New Steganography Method Based on Generative Adversarial Networks
    Naito, Hiroshi
    Zhao, Qiangfu
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 495 - 500
  • [9] Image registration method based on Generative Adversarial Networks
    Sun, Yujie
    Qi, Heping
    Wang, Chuanyou
    Tao, Lei
    2020 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2020), 2020, : 183 - 188
  • [10] A Model of Emotional Speech Generation Based on Conditional Generative Adversarial Networks
    Jia, Ning
    Zheng, Chunjun
    Sun, Wei
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 106 - 109