Speech Recognition System for Embedded Real-time Applications

被引:7
|
作者
Cheng, Octavian [1 ]
Abdulla, Waleed [1 ]
Salcic, Zoran [1 ]
机构
[1] Univ Auckland, Dept Elect & Comp Engn, Auckland 1, New Zealand
来源
2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009) | 2009年
关键词
Speech recognition; Embedded application; Real-time system; Softcore processor; Field Programmable Gate Arrays;
D O I
10.1109/ISSPIT.2009.5407487
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper a hardware/software co-processing speech recognizer for embedded applications is proposed. The system mainly consists of a softcore processor and a hardware accelerator The accelerator is responsible for GMM emission probability calculation, which is the major computational bottleneck. To alleviate the memory bandwidth issue, the hardware accelerator uses double-buffering, which allows parallel operation of data retrieval and GMM computation. The proposed accelerator is synthesized on an Altera Stratix II FPGA device together with a Nios II softcore processor running at 100MHz. The proposed system is compared with a pure software-based system using test utterances from the Resource Management (RM1) corpus. For a speech utterance length of 2.49s, the decoding time reduces from 6.64s to 2.48s. The real-time factor improves from 2.67 to 1.00. The word accuracy rate of the proposed system on the RM corpus is 93.42%.
引用
收藏
页码:118 / 122
页数:5
相关论文
共 50 条
  • [1] Hardware-Software Codesign of Automatic Speech Recognition System for Embedded Real-Time Applications
    Cheng, Octavian
    Abdulla, Waleed
    Salcic, Zoran
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2011, 58 (03) : 850 - 859
  • [2] Design and Evaluation of a Real-Time Speech Recognition System
    Shruthi, S.
    Yashaswi, G.
    Shruti, V
    Manikandan, J.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 425 - 430
  • [3] NOVEL CI-BACKOFF SCHEME FOR REAL-TIME EMBEDDED SPEECH RECOGNITION
    Ma, Tao
    Deisher, Michael
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1614 - 1617
  • [4] Design of a Real-Time Speech Recognition System using CNN for Consumer Electronics
    Pavan, G. S.
    Kumar, Nikhil
    Karthik, Krishna N.
    Manikandan, J.
    2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 5 - 10
  • [5] REAL-TIME SPEECH RECOGNITION CAPTIONING OF EVENTS AND MEETINGS
    Boulianne, Gilles
    Boisvert, Maryse
    Osterrath, Frederic
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 197 - 200
  • [6] Analysis of Embedded Real-Time System Security
    Ma Jingjing
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT II, 2011, 215 : 429 - 433
  • [7] Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation
    Lyu, Ke-Ming
    Lyu, Ren-yuan
    Chang, Hsien-Tsung
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [8] Real-time speech synthesis system driven by visual speech
    Li, G
    Xie, GM
    Lin, L
    PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 397 - 402
  • [9] MULTI-USER REAL-TIME SPEECH RECOGNITION WITH A GPU
    Kim, Jungsuk
    Sung, Wonyong
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1617 - 1620
  • [10] Real-Time Photonic Deep Reservoir Computing for Speech Recognition
    Picco, Enrico
    Massar, Serge
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,