Acoustic characteristics of disguised speech: speaker strategies and listener error patterns

被引:1
|
作者
Smith, Allan B. [1 ]
Mason, Nealy [1 ]
Browne, Molly E. [1 ]
机构
[1] Univ New Hampshire, Durham, NH 03824 USA
关键词
SPEAKER IDENTIFICATION; VOICE IDENTIFICATION; ACOUSTIC CUES ROBUST TO DISGUISE; DISGUISED SPEECH; IDENTIFICATION; VOICES;
D O I
10.1558/ijsll.38372
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
A group of 13 participants were recorded in two conditions: 1) speaking normally and 2) altering speech to conceal their identity (i.e., disguised speech). Participants were not instructed how to disguise their speech because we were interested in which changes they would choose. A group of inexperienced listeners were largely inaccurate in matching participants' disguised speech to their normal speech. The largest changes between normal and disguised speech were in speaking rate, the first formant, fundamental frequency, and intensity. When listeners made correct matches, the pairs were similar in speaking rate and intensity, as shown by significant correlations. Incorrectly matched pairs were not significantly correlated, suggesting that listeners were not making good use of acoustic cues during those decisions. Overall, the third formant (F3) and speaking rate appeared to be useful acoustic indicators of identity when matching normal and disguised speech samples. Of those two variables, F3 was apparently underutilised by listeners. The implications for what spontaneous speakers do to disguise their speech and what naive listeners attend to when identifying disguised voice are discussed.
引用
收藏
页码:85 / 95
页数:11
相关论文
共 11 条
  • [1] Speaker-invariant suprasegmental temporal features in normal and disguised speech
    Leemann, Adrian
    Kolly, Marie-Jose
    SPEECH COMMUNICATION, 2015, 75 : 97 - 122
  • [2] Application of formant instantaneous characteristics to speech recognition and speaker identification
    侯丽敏
    胡晓宁
    谢娟敏
    Advances in Manufacturing, 2011, (02) : 123 - 127
  • [3] Effects of Transmitted Speech Bandwidth on Subjective Assessments of Speaker Characteristics
    Gallardo, Laura Fernandez
    2018 TENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2018, : 31 - 35
  • [4] Speaker Identification for Whispered Speech Using Modified Temporal Patterns and MFCCs
    Fan, Xing
    Hansen, John H. L.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 912 - 915
  • [5] The impact of compression of speech signal, background noise and acoustic disturbances on the effectiveness of speaker identification
    Kaminski, K.
    Dobrowolski, A. P.
    XI CONFERENCE ON RECONNAISSANCE AND ELECTRONIC WARFARE SYSTEMS, 2017, 10418
  • [6] Speaker height estimation from speech: Fusing spectral regression and statistical acoustic models
    Hansen, John H. L.
    Williams, Keri
    Boril, Hynek
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (02): : 1052 - 1067
  • [7] ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION
    Prasad, Abhay
    Periyasamy, Vijitha
    Ghosh, Prasanta Kumar
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4265 - 4269
  • [8] Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams
    Fan, Xing
    Hansen, John H. L.
    SPEECH COMMUNICATION, 2013, 55 (01) : 119 - 134
  • [9] Acoustic emission characteristics of coal failure using automatic speech recognition methodology analysis
    Wang, H. L.
    Song, D. Z.
    Li, Z. L.
    He, X. Q.
    Lan, S. R.
    Guo, H. F.
    INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 2020, 136
  • [10] Individual differences in processing non-speech acoustic signals influence cue weighting strategies for L2 speech contrasts
    Liu, Xiaoluan
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2022, 51 (04) : 903 - 916