EVALUATION OF MIMICKED SPEECH USING PROSODIC FEATURES

被引:0
作者
Mary, Leena [1 ]
Babu, Anish K. K. [1 ]
Joseph, Aju [1 ]
George, Gibin M. [1 ]
机构
[1] Rajiv Gandhi Inst Technol, Kottayam 686501, Kerala, India
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Prosody; intonation; mimicked speech; legendre coefficients; dynamic time warping; LANGUAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe a technique for evaluating the quality of mimicked speech. In other words, mimicry artists are evaluated based on their competences to mimic a particular person. This evaluation is done based on prosodic characteristics for the text dependent cases. Prosodic characteristics are represented using features derived from pitch contour, duration and energy. In this work, prosodic features are extracted from speech after automatically segmenting into intonational phrases. Pitch contour corresponding to each phrase is approximated using weighted sum of legendre polynomials. Prosodic feature set includes weights of first four legendre polynomials (w(0k), w(1k), w(2k), w(3k)), average jitter, average shimmer, voiced duration, total duration and change in energy of each intonation phrase. The effectiveness of the technique is demonstrated using a text dependent database of mimicked speeches. Evaluation is done by dynamic time warping of prosodic features derived from the mimicked speech and the original speech. The scores obtained from this evaluation is compared with the results of manual perception/listening tests, which clearly indicate the effectiveness of the proposed technique.
引用
收藏
页码:7189 / 7193
页数:5
相关论文
共 50 条
  • [21] Expressive Speech Synthesis using Prosodic Modification for Marathi Language
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 126 - 130
  • [22] Prosodic analysis of child speech
    Panagos, JM
    Prelock, PA
    TOPICS IN LANGUAGE DISORDERS, 1997, 17 (04) : 1 - 10
  • [23] Prosodic Contrasts in Ironic Speech
    Bryant, Gregory A.
    DISCOURSE PROCESSES, 2010, 47 (07) : 545 - 566
  • [24] Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist
    Bone, Daniel
    Black, Matthew P.
    Lee, Chi-Chun
    Williams, Marian E.
    Levitt, Pat
    Lee, Sungbok
    Narayanan, Shrikanth
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1042 - 1045
  • [25] Evaluation of prosodic and voice quality features on automatic extraction of paralinguistic information
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    Hagita, Norihiro
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 374 - +
  • [26] Prosodic Temporal Alignment of Co-speech Gestures to Speech Facilitates Referent Resolution
    Jesse, Alexandra
    Johnson, Elizabeth K.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2012, 38 (06) : 1567 - 1581
  • [27] Towards automatic detection of reported speech in dialogue using prosodic cues
    Cervone, Alessandra
    Lai, Catherine
    Pareti, Silvia
    Bell, Peter
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3061 - 3065
  • [28] Reliable Detection of Important Word Boundaries Using Prosodic Features
    Kaufhold, Caroline
    Stemmer, Georg
    Noeth, Elmar
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 259 - 267
  • [29] Analysis of acoustic-prosodic features related to paralinguistic information carried by interjections in dialogue speech
    Ishi, Carlos T.
    Ishiguro, Hiroshi
    Hagita, Norihiro
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3140 - +
  • [30] Language Classification Using Prosodic Features: Comparing Intensity and Pitch
    Zulu, Peleira Nicholas
    2013 Pan African International Conference on Information Science, Computing and Telecommunications (PACT), 2013, : 116 - 121