Speech Recognition System for Embedded Real-time Applications

被引：7

作者：

Cheng, Octavian ^{[1
]}

Abdulla, Waleed ^{[1
]}

Salcic, Zoran ^{[1
]}

机构：

[1] Univ Auckland, Dept Elect & Comp Engn, Auckland 1, New Zealand

来源：

2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009) | 2009年

关键词：

Speech recognition; Embedded application; Real-time system; Softcore processor; Field Programmable Gate Arrays;

D O I：

10.1109/ISSPIT.2009.5407487

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper a hardware/software co-processing speech recognizer for embedded applications is proposed. The system mainly consists of a softcore processor and a hardware accelerator The accelerator is responsible for GMM emission probability calculation, which is the major computational bottleneck. To alleviate the memory bandwidth issue, the hardware accelerator uses double-buffering, which allows parallel operation of data retrieval and GMM computation. The proposed accelerator is synthesized on an Altera Stratix II FPGA device together with a Nios II softcore processor running at 100MHz. The proposed system is compared with a pure software-based system using test utterances from the Resource Management (RM1) corpus. For a speech utterance length of 2.49s, the decoding time reduces from 6.64s to 2.48s. The real-time factor improves from 2.67 to 1.00. The word accuracy rate of the proposed system on the RM corpus is 93.42%.

引用

页码：118 / 122

页数：5

共 50 条

[1] Hardware-Software Codesign of Automatic Speech Recognition System for Embedded Real-Time Applications
Cheng, Octavian
Abdulla, Waleed
Salcic, Zoran
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2011, 58 (03) : 850 - 859
[2] Design and Evaluation of a Real-Time Speech Recognition System
Shruthi, S.
Yashaswi, G.
Shruti, V
Manikandan, J.
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 425 - 430
[3] NOVEL CI-BACKOFF SCHEME FOR REAL-TIME EMBEDDED SPEECH RECOGNITION
Ma, Tao
Deisher, Michael
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1614 - 1617
[4] Design of a Real-Time Speech Recognition System using CNN for Consumer Electronics
Pavan, G. S.
Kumar, Nikhil
Karthik, Krishna N.
Manikandan, J.
2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 5 - 10
[5] REAL-TIME SPEECH RECOGNITION CAPTIONING OF EVENTS AND MEETINGS
Boulianne, Gilles
Boisvert, Maryse
Osterrath, Frederic
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 197 - 200
[6] Analysis of Embedded Real-Time System Security
Ma Jingjing
ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT II, 2011, 215 : 429 - 433
[7] Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation
Lyu, Ke-Ming
Lyu, Ren-yuan
Chang, Hsien-Tsung
PEERJ COMPUTER SCIENCE, 2024, 10
[8] Real-time speech synthesis system driven by visual speech
Li, G
Xie, GM
Lin, L
PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 2, 2004, : 397 - 402
[9] MULTI-USER REAL-TIME SPEECH RECOGNITION WITH A GPU
Kim, Jungsuk
Sung, Wonyong
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1617 - 1620
[10] Real-Time Photonic Deep Reservoir Computing for Speech Recognition
Picco, Enrico
Massar, Serge
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,

← 1 2 3 4 5 →