EVALUATION OF MIMICKED SPEECH USING PROSODIC FEATURES

被引:0
作者
Mary, Leena [1 ]
Babu, Anish K. K. [1 ]
Joseph, Aju [1 ]
George, Gibin M. [1 ]
机构
[1] Rajiv Gandhi Inst Technol, Kottayam 686501, Kerala, India
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Prosody; intonation; mimicked speech; legendre coefficients; dynamic time warping; LANGUAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe a technique for evaluating the quality of mimicked speech. In other words, mimicry artists are evaluated based on their competences to mimic a particular person. This evaluation is done based on prosodic characteristics for the text dependent cases. Prosodic characteristics are represented using features derived from pitch contour, duration and energy. In this work, prosodic features are extracted from speech after automatically segmenting into intonational phrases. Pitch contour corresponding to each phrase is approximated using weighted sum of legendre polynomials. Prosodic feature set includes weights of first four legendre polynomials (w(0k), w(1k), w(2k), w(3k)), average jitter, average shimmer, voiced duration, total duration and change in energy of each intonation phrase. The effectiveness of the technique is demonstrated using a text dependent database of mimicked speeches. Evaluation is done by dynamic time warping of prosodic features derived from the mimicked speech and the original speech. The scores obtained from this evaluation is compared with the results of manual perception/listening tests, which clearly indicate the effectiveness of the proposed technique.
引用
收藏
页码:7189 / 7193
页数:5
相关论文
共 50 条
  • [41] Progress to a VOCA with Prosodic Synthesised Speech
    Wuelfing, Jan-Oliver Y.
    Andre, Elisabeth
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT I, 2018, 10896 : 539 - 546
  • [42] SYLLABLE-BASED PROSODIC ANALYSIS OF AMHARIC READ SPEECH
    Jokisch, Oliver
    Birhanu, Yitagessu
    Hoffmann, Ruediger
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 258 - 262
  • [43] Prosodic features of stances in conversation
    Freeman, Valerie
    LABORATORY PHONOLOGY, 2019, 10 (01):
  • [44] Extraction and representation of prosodic features for language and speaker recognition
    Mary, Leena
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2008, 50 (10) : 782 - 796
  • [45] Prosodic Features for Speaker Verification
    Mary, Leena
    Yegnanarayana, B.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
  • [46] AUTOMATIC FLUENCY EVALUATION OF SPONTANEOUS SPEECH USING DISFLUENCY-BASED FEATURES
    Deng, Huaijin
    Lin, Youchao
    Utsuro, Takehito
    Kobayashi, Akio
    Nishizaki, Hiromitsu
    Hoshino, Junichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 9239 - 9243
  • [47] Prosodic Features' Criterion for Hebrew
    Fishman, Ben
    Lapidot, Itshak
    Opher, Irit
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 482 - 491
  • [48] PROSODIC FEATURES IN SPANISH AUDIO DESCRIPTIONS OF THE VIW CORPUS
    Machuca, Maria J.
    Matamala, Anna
    Rios, Antonio
    MONTI, 2020, 12 : 53 - 77
  • [49] End-of-Utterance Prediction by Prosodic Features and Phrase-Dependency Structure in Spontaneous Japanese Speech
    Ishimoto, Yuichi
    Teraoka, Takehiro
    Enomoto, Mika
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1681 - 1685
  • [50] Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence
    Ananthakrishnan, Sankaranarayanan
    Ghosh, Prasanta
    Narayanan, Shrikanth
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5005 - 5008