On the evaluation of the conversational speech quality in telecommunications

被引:20
|
作者
Gueguin, Marie [1 ]
Le Bouquin-Jeannes, Regine [1 ]
Gautier-Turbin, Valerie [2 ]
Faucon, Gerard [1 ]
Barriac, Vincent [2 ]
机构
[1] Univ Rennes 1, INSERM U642, Lab Traitment Signal Image, F-35042 Rennes, France
[2] France Telecom R&D, TECH SSTP MOV, F-22307 Lannion, France
关键词
D O I
10.1155/2008/185248
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose an objective method to assess speech quality in the conversational context by taking into account the talking and listening speech qualities and the impact of delay. This approach is applied to the results of four subjective tests on the effects of echo, delay, packet loss, and noise. The dataset is divided into training and validation sets. For the training set, a multiple linear regression is applied to determine a relationship between conversational, talking, and listening speech qualities and the delay value. The multiple linear regression leads to an accurate estimation of the conversational scores with high correlation and low error between subjective and estimated scores, both on the training and validation sets. In addition, a validation is performed on the data of a subjective test found in the literature which confirms the reliability of the regression. The relationship is then applied to an objective level by replacing talking and listening subjective scores with talking and listening objective scores provided by existing objective models, fed by speech signals recorded during the subjective tests. The conversational model achieves high performance as revealed by comparison with the test results and with the existing standard methodology "E-model," presented in the ITU-T (International Telecommunication Union) Recommendation G. 107. Copyright (c) 2008 Marie Gueguin et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] ECHOIC CONTROL IN CONVERSATIONAL SPEECH
    BOE, R
    WINOKUR, S
    JOURNAL OF GENERAL PSYCHOLOGY, 1978, 99 (02): : 299 - 304
  • [32] Hybridizing Conversational and Clear Speech
    Kusumoto, Akiko
    Kain, Alexander B.
    Hosom, John-Paul
    van Santen, Jan R. H.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 161 - 164
  • [33] Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech
    Tobin, Jimmy
    Nelson, Phillip
    MacDonald, Bob
    Heywood, Rus
    Cave, Richard
    Seaver, Katie
    Desjardins, Antoine
    Jiang, Pan-Pan
    Green, Jordan R.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2024, 67 (11): : 4176 - 4185
  • [34] Recognizing disfluencies in conversational speech
    Lease, Matthew
    Johnson, Mark
    Charniak, Eugene
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1566 - 1573
  • [35] USE OF PROFANITY IN CONVERSATIONAL SPEECH
    NERBONNE, GP
    HIPSKIND, NM
    JOURNAL OF COMMUNICATION DISORDERS, 1972, 5 (01) : 47 - 50
  • [36] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
    CHAPMAN, WD
    BEETLE, DH
    MECHANICAL ENGINEERING, 1971, 93 (07) : 45 - &
  • [37] Modeling disfluencies in conversational speech
    Siu, M
    Ostendorf, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 386 - 389
  • [38] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
    CHAPMAN, WD
    BEETLE, DH
    DESIGN NEWS, 1971, 26 (07) : 125 - &
  • [39] PERIODIC RHYTHMS IN CONVERSATIONAL SPEECH
    WARNER, RM
    LANGUAGE AND SPEECH, 1979, 22 (OCT-) : 381 - 396
  • [40] DESIGNING A CONVERSATIONAL SPEECH INTERFACE
    YOUNG, SJ
    IEE PROCEEDINGS-E COMPUTERS AND DIGITAL TECHNIQUES, 1986, 133 (06): : 305 - 311