On the evaluation of the conversational speech quality in telecommunications

被引：20

作者：

Gueguin, Marie ^{[1
]}

Le Bouquin-Jeannes, Regine ^{[1
]}

Gautier-Turbin, Valerie ^{[2
]}

Faucon, Gerard ^{[1
]}

Barriac, Vincent ^{[2
]}

机构：

[1] Univ Rennes 1, INSERM U642, Lab Traitment Signal Image, F-35042 Rennes, France

[2] France Telecom R&D, TECH SSTP MOV, F-22307 Lannion, France

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2008年 / 2008卷 / 1期

关键词：

D O I：

10.1155/2008/185248

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We propose an objective method to assess speech quality in the conversational context by taking into account the talking and listening speech qualities and the impact of delay. This approach is applied to the results of four subjective tests on the effects of echo, delay, packet loss, and noise. The dataset is divided into training and validation sets. For the training set, a multiple linear regression is applied to determine a relationship between conversational, talking, and listening speech qualities and the delay value. The multiple linear regression leads to an accurate estimation of the conversational scores with high correlation and low error between subjective and estimated scores, both on the training and validation sets. In addition, a validation is performed on the data of a subjective test found in the literature which confirms the reliability of the regression. The relationship is then applied to an objective level by replacing talking and listening subjective scores with talking and listening objective scores provided by existing objective models, fed by speech signals recorded during the subjective tests. The conversational model achieves high performance as revealed by comparison with the test results and with the existing standard methodology "E-model," presented in the ITU-T (International Telecommunication Union) Recommendation G. 107. Copyright (c) 2008 Marie Gueguin et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

引用

页数：15

共 50 条

[31] ECHOIC CONTROL IN CONVERSATIONAL SPEECH
BOE, R
WINOKUR, S
JOURNAL OF GENERAL PSYCHOLOGY, 1978, 99 (02): : 299 - 304
[32] Hybridizing Conversational and Clear Speech
Kusumoto, Akiko
Kain, Alexander B.
Hosom, John-Paul
van Santen, Jan R. H.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 161 - 164
[33] Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech
Tobin, Jimmy
Nelson, Phillip
MacDonald, Bob
Heywood, Rus
Cave, Richard
Seaver, Katie
Desjardins, Antoine
Jiang, Pan-Pan
Green, Jordan R.
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2024, 67 (11): : 4176 - 4185
[34] Recognizing disfluencies in conversational speech
Lease, Matthew
Johnson, Mark
Charniak, Eugene
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1566 - 1573
[35] USE OF PROFANITY IN CONVERSATIONAL SPEECH
NERBONNE, GP
HIPSKIND, NM
JOURNAL OF COMMUNICATION DISORDERS, 1972, 5 (01) : 47 - 50
[36] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
CHAPMAN, WD
BEETLE, DH
MECHANICAL ENGINEERING, 1971, 93 (07) : 45 - &
[37] Modeling disfluencies in conversational speech
Siu, M
Ostendorf, M
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 386 - 389
[38] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
CHAPMAN, WD
BEETLE, DH
DESIGN NEWS, 1971, 26 (07) : 125 - &
[39] PERIODIC RHYTHMS IN CONVERSATIONAL SPEECH
WARNER, RM
LANGUAGE AND SPEECH, 1979, 22 (OCT-) : 381 - 396
[40] DESIGNING A CONVERSATIONAL SPEECH INTERFACE
YOUNG, SJ
IEE PROCEEDINGS-E COMPUTERS AND DIGITAL TECHNIQUES, 1986, 133 (06): : 305 - 311

← 1 2 3 4 5 →