Acoustic characteristics of disguised speech: speaker strategies and listener error patterns

被引：1

作者：

Smith, Allan B. ^{[1
]}

Mason, Nealy ^{[1
]}

Browne, Molly E. ^{[1
]}

机构：

[1] Univ New Hampshire, Durham, NH 03824 USA

来源：

INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW | 2019年 / 26卷 / 01期

关键词：

SPEAKER IDENTIFICATION; VOICE IDENTIFICATION; ACOUSTIC CUES ROBUST TO DISGUISE; DISGUISED SPEECH; IDENTIFICATION; VOICES;

D O I：

10.1558/ijsll.38372

中图分类号：

DF [法律]; D9 [法律];

学科分类号：

0301 ;

摘要：

A group of 13 participants were recorded in two conditions: 1) speaking normally and 2) altering speech to conceal their identity (i.e., disguised speech). Participants were not instructed how to disguise their speech because we were interested in which changes they would choose. A group of inexperienced listeners were largely inaccurate in matching participants' disguised speech to their normal speech. The largest changes between normal and disguised speech were in speaking rate, the first formant, fundamental frequency, and intensity. When listeners made correct matches, the pairs were similar in speaking rate and intensity, as shown by significant correlations. Incorrectly matched pairs were not significantly correlated, suggesting that listeners were not making good use of acoustic cues during those decisions. Overall, the third formant (F3) and speaking rate appeared to be useful acoustic indicators of identity when matching normal and disguised speech samples. Of those two variables, F3 was apparently underutilised by listeners. The implications for what spontaneous speakers do to disguise their speech and what naive listeners attend to when identifying disguised voice are discussed.

引用

页码：85 / 95

页数：11

共 11 条

[1] Speaker-invariant suprasegmental temporal features in normal and disguised speech
Leemann, Adrian
Kolly, Marie-Jose
SPEECH COMMUNICATION, 2015, 75 : 97 - 122
[2] Application of formant instantaneous characteristics to speech recognition and speaker identification
侯丽敏
胡晓宁
谢娟敏
Advances in Manufacturing, 2011, (02) : 123 - 127
[3] Effects of Transmitted Speech Bandwidth on Subjective Assessments of Speaker Characteristics
Gallardo, Laura Fernandez
2018 TENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2018, : 31 - 35
[4] Speaker Identification for Whispered Speech Using Modified Temporal Patterns and MFCCs
Fan, Xing
Hansen, John H. L.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 912 - 915
[5] The impact of compression of speech signal, background noise and acoustic disturbances on the effectiveness of speaker identification
Kaminski, K.
Dobrowolski, A. P.
XI CONFERENCE ON RECONNAISSANCE AND ELECTRONIC WARFARE SYSTEMS, 2017, 10418
[6] Speaker height estimation from speech: Fusing spectral regression and statistical acoustic models
Hansen, John H. L.
Williams, Keri
Boril, Hynek
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (02): : 1052 - 1067
[7] ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION
Prasad, Abhay
Periyasamy, Vijitha
Ghosh, Prasanta Kumar
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4265 - 4269
[8] Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams
Fan, Xing
Hansen, John H. L.
SPEECH COMMUNICATION, 2013, 55 (01) : 119 - 134
[9] Acoustic emission characteristics of coal failure using automatic speech recognition methodology analysis
Wang, H. L.
Song, D. Z.
Li, Z. L.
He, X. Q.
Lan, S. R.
Guo, H. F.
INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 2020, 136
[10] Individual differences in processing non-speech acoustic signals influence cue weighting strategies for L2 speech contrasts
Liu, Xiaoluan
JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2022, 51 (04) : 903 - 916

← 1 2 →