Turbo Processing for Speech Recognition

被引:2
|
作者
Moon, Todd K. [1 ,2 ]
Gunther, Jacob H. [1 ,2 ]
Broadus, Cortnie [3 ]
Hou, Wendy [4 ]
Nelson, Nils [3 ]
机构
[1] Utah State Univ, Informat Dynam Lab, Logan, UT 84322 USA
[2] Utah State Univ, Dept Elect & Comp Engn, Logan, UT 84322 USA
[3] Utah State Univ, Dept Math, Logan, UT 84322 USA
[4] Yale Univ, Dept Math, New Haven, CT 06511 USA
关键词
Human-machine interface; speech processing; turbo processing; HIDDEN MARKOV-MODELS; MAXIMIZATION;
D O I
10.1109/TCYB.2013.2247593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition is a classic example of a human/machine interface, typifying many of the difficulties and opportunities of human/machine interaction. In this paper, speech recognition is used as an example of applying turbo processing principles to the general problem of human/machine interface. Speech recognizers frequently involve a model representing phonemic information at a local level, followed by a language model representing information at a nonlocal level. This structure is analogous to the local (e. g., equalizer) and nonlocal (e. g., error correction decoding) elements common in digital communications. Drawing from the analogy of turbo processing for digital communications, turbo speech processing iteratively feeds back the output of the language model to be used as prior probabilities for the phonemic model. This analogy is developed here, and the performance of this turbo model is characterized by using an artificial language model. Using turbo processing, the relative error rate improves significantly, especially in high-noise settings.
引用
收藏
页码:83 / 91
页数:9
相关论文
共 50 条
  • [1] Turbo Automatic Speech Recognition
    Receveur, Simon
    Weiss, Robin
    Fingscheidt, Tim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 846 - 862
  • [2] A COMPACT FORMULATION OF TURBO AUDIO-VISUAL SPEECH RECOGNITION
    Receveur, Simon
    Meyer, Patrick
    Fingscheidt, Tim
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Makhraj Recognition Using Speech Processing
    Wahidah, A. N.
    Suriazalmi, M. S.
    Niza, M. L.
    Rosyati, H.
    Faradila, N.
    Hasan, A.
    Rohana, A. K.
    Farizan, Z. N.
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 689 - 693
  • [4] Speech Recognition Based on Open Source Speech Processing Software
    Klosowski, Piotr
    Dustor, Adam
    Izydorczyk, Jacek
    Kotas, Jan
    Slimok, Jacek
    COMPUTER NETWORKS, CN 2014, 2014, 431 : 308 - 317
  • [5] Speech activity detection and automatic prosodic processing unit segmentation for emotion recognition
    Sztaho, David
    Vicsi, Klara
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2014, 8 (04): : 315 - 324
  • [6] Speech recognition methods applied to biomedical signals processing
    Novák, D
    Cuesta-Frau, D
    Ani, TAI
    Aboy, M
    Mico, P
    Lhotská, L
    PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 118 - 121
  • [7] Speech recognition in a dialog system: from conventional to deep processing
    Becerra, Aldonso
    Ismael de la Rosa, J.
    Gonzalez, Efren
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 15875 - 15911
  • [8] A Study on Speech Processing
    Rekha, J. Ujwala
    Chatrapati, K. Shahu
    Babu, A. Vinaya
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 209 - 226
  • [9] Realization of Speech Processing and Recognition System Based on Digital Signal Processor
    Wei Yanna
    Jin Yongtao
    Qian Wenguang
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1087 - 1090
  • [10] TURBO ITERATIVE SIGNAL PROCESSING
    Sun, Hong
    Maitre, Henri
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 495 - +