Using pitch as prior knowledge in template-based speech recognition

被引:0
|
作者
Aradilla, Guillermo [1 ]
Vepa, Jithendra [1 ]
Bourlard, Herve [1 ]
机构
[1] IDIAP Res Inst, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templates to the test utterance. Traditionally, this selection is given by a distortion measure on the acoustic features. In this work, we propose to improve this template selection with the use of meta-linguistic information as prior knowledge. In this way, similarity is not only based on acoustic features but also on other sources of information that are present in the speech signal. Results on a continuous digit recognition task confirm the statement that similarity between words does not only depend on acoustic features since we obtained 24% relative improvement over the baseline. Interestingly, results are better even when compared to a system with no prior information but a larger number of templates.
引用
收藏
页码:445 / 448
页数:4
相关论文
共 50 条
  • [1] Template-based continuous speech recognition
    De Wachter, Mathias
    Matton, Mike
    Demuynck, Kris
    Wambacq, Patrick
    Cools, Ronald
    Van Compernolle, Dirk
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1377 - 1390
  • [2] Template-based automatic segmentation of masseter using prior knowledge
    Ng, H. P.
    Ong, S. H.
    Goh, P. S.
    Foong', K. W. C.
    Nowinski, W. L.
    7TH IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, 2006, : 208 - +
  • [3] Template-based Spectral Estimation Using Microphone Array for Speech Recognition
    Tamura, Satoshi
    Hishikawa, Eriko
    Taguchi, Wataru
    Hayamizu, Satoru
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2050 - +
  • [4] Data Pruning for Template-based Automatic Speech Recognition
    Seppi, Dino
    Van Compernolle, Dirk
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 901 - 904
  • [5] Template-based Automatic Speech Recognition meets Prosody
    Seppi, Dino
    Demuynck, Kris
    Van Compernolle, Dirk
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 552 - 555
  • [6] CMOS PROCESSOR FOR TEMPLATE-BASED SPEECH-RECOGNITION SYSTEM
    DREWS, W
    LAROIA, R
    PANDEL, J
    SCHUMACHER, A
    STOLZLE, A
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 155 - 161
  • [7] Isolated Tamil Digit Speech Recognition Using Template-Based and HMM-Based Approaches
    Karpagavalli, S.
    Deepika, R.
    Kokila, P.
    Rani, K. Usha
    Chandra, E.
    GLOBAL TRENDS IN INFORMATION SYSTEMS AND SOFTWARE APPLICATIONS, PT 2, 2012, 270 : 441 - +
  • [8] Face recognition is not template-based
    Carbon, CC
    Leder, H
    PERCEPTION, 2004, 33 : 103 - 103
  • [9] Template-Based Named Entity Recognition Using BART
    Cuiy, Leyang
    Wuz, Yu
    Liu, Jian
    Yang, Sen
    Zhang, Yue
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1835 - 1845
  • [10] Underwater Object Recognition Using Transformable Template Matching Based on Prior Knowledge
    Zhu, Jianjiang
    Yu, Siquan
    Han, Zhi
    Tang, Yandong
    Wu, Chengdong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019