Sinusoidal-Based Lowband Synthesis for Artificial Speech Bandwidth Extension

被引:6
作者
Abel, Johannes [1 ]
Fingscheidt, Tim [1 ]
机构
[1] Tech Univ Carolo Wilhelmina Braunschweig, Inst Commun Technol, D-38106 Braunschweig, Germany
关键词
Artificial speech bandwidth extension; lowband; sinusoidal; TELEPHONE SPEECH; SPECTRAL ENVELOPE; BAND EXTENSION; NARROW-BAND; IMPLEMENTATION; QUALITY;
D O I
10.1109/TASLP.2019.2895969
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Conventional narrowband (NB) telephony suffers from limited acoustic bandwidth at the receiver side, leading to degraded speech quality and intelligibility. In this paper, artificial speech bandwidth extension (ABE) of NB speech toward missing frequencies below about 300 Hz (low-frequency band, LB) is proposed to enhance the speech quality. The LB-ABE in this paper is employed together with a preexisting ABE toward high-frequency components to obtain spectrally balanced speech signals. In an instrumental quality assessment, the spectral distance in the LB was improved by more than S dB compared to NB speech. In a subjective listening test, the gap of speech quality between wideband and NB speech was significantly reduced when employing the proposed ABE toward low frequencies. The LB extension was found to further improve the preexisting ABE toward higher frequencies by a significant 0.26 CMOS points.
引用
收藏
页码:765 / 776
页数:12
相关论文
共 69 条
  • [1] Abel J, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5469, DOI 10.1109/ICASSP.2018.8462362
  • [2] Abel J, 2017, IEEE WORK APPL SIG, P219, DOI 10.1109/WASPAA.2017.8170027
  • [3] Artificial Speech Bandwidth Extension Using Deep Neural Networks for Wideband Spectral Envelope Estimation
    Abel, Johannes
    Fingscheidt, Tim
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 71 - 83
  • [4] An Instrumental Quality Measure for Artificially Bandwidth-Extended Speech Signals
    Abel, Johannes
    Kaniewska, Magdalena
    Guillaume, Cyril
    Tirry, Wouter
    Fingscheidt, Tim
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (02) : 384 - 396
  • [5] Abel J, 2016, INT CONF ACOUST SPEE, P5915, DOI 10.1109/ICASSP.2016.7472812
  • [6] [Anonymous], ITU T REC P 800 METH
  • [7] [Anonymous], ITU T REC P 862 2 WI
  • [8] [Anonymous], ITU T REC P 863 PERC
  • [9] [Anonymous], 2016, IEEE INT WORKSH AC S
  • [10] [Anonymous], 2004, 26090 3GPP TS