Basic parameters in speech processing -: The need for evaluation

被引:0
|
作者
Hoege, Harald [1 ]
机构
[1] Siemens AG, Corp Technol, D-81739 Munich, Germany
关键词
prosodic parameters; VAD; strength of Lombard effect; evaluation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As basic parameters in speech processing we regard pitch, duration, intensity, voice quality, signal to noise ratio, voice activity detection and strength of Lombard effect. Taking in account also adverse conditions the performance of many published algorithms to extract those parameters from the speech signal automatically is not known. A framework based on competitive evaluation is proposed to push algorithmic research and to make progress comparable.
引用
收藏
页码:67 / 74
页数:8
相关论文
共 50 条
  • [1] The need to develop guidelines for the evaluation of medical image processing procedures
    Buvat, I
    Chameroy, V
    Aubry, F
    Pélégrini, M
    El Fakhri, G
    Huguenin, C
    Benali, H
    Todd-Pokropek, A
    Di Paola, R
    MEDICAL IMAGING 1999: IMAGE PROCESSING, PTS 1 AND 2, 1999, 3661 : 1466 - 1477
  • [2] The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
    Gravier, Guillaume
    Adda, Gilles
    Paulsson, Niklas
    Carre, Matthieu
    Giraudel, Aude
    Galibert, Olivier
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 114 - 118
  • [3] Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems
    Ludusan, Bogdan
    Versteegh, Maarten
    Jansen, Aren
    Gravier, Guillaume
    Cao, Xuan-Nga
    Johnson, Mark
    Dupoux, Emmanuel
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 560 - 567
  • [4] ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION
    Salesky, Elizabeth
    Maeder, Julian
    Klinger, Severin
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 733 - 740
  • [5] Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis
    Ali, Ikbel Hadj
    Mnasri, Zied
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2016, 2016, 9918 : 57 - 67
  • [6] SUPERB: Speech processing Universal PERformance Benchmark
    Yang, Shu-wen
    Chi, Po-Han
    Chuang, Yung-Sung
    Lai, Cheng-I Jeff
    Lakhotia, Kushal
    Lin, Yist Y.
    Liu, Andy T.
    Shi, Jiatong
    Chang, Xuankai
    Lin, Guan-Ting
    Huang, Tzu-Hsien
    Tseng, Wei-Cheng
    Lee, Ko-tik
    Liu, Da-Rong
    Huang, Zili
    Done, Shuyan
    Li, Shang-Wen
    Watanabe, Shinji
    Mohamed, Abdelrahman
    Lee, Hung-yi
    INTERSPEECH 2021, 2021, : 1194 - 1198
  • [7] AN EVALUATION OF THE VISUAL SPEECH APPARATUS
    ARENDS, N
    POVEL, DJ
    VANOS, E
    MICHIELSEN, S
    CLAASSEN, J
    FEITER, I
    SPEECH COMMUNICATION, 1991, 10 (04) : 405 - 414
  • [8] Specialist speech and language therapists' use and evaluation of visual speech aids
    Coventry, KR
    Clibbens, J
    Cooper, M
    EUROPEAN JOURNAL OF DISORDERS OF COMMUNICATION, 1997, 32 (03): : 315 - 323
  • [9] Speech Transcript Evaluation for Information Retrieval
    van der Werff, Laurens
    Kraaij, Wessel
    de Jong, Franciska
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1536 - +
  • [10] Cognitive factors in the evaluation of synthetic speech
    Delogu, C
    Conte, S
    Sementina, C
    SPEECH COMMUNICATION, 1998, 24 (02) : 153 - 168