Novel Applications of Complexity Inspired RDT Transform for Low Complexity Embedded Speech Recognition in Automotive Environments

被引:4
作者
Bucurica, Mihai [1 ]
Dogaru, Ioana [2 ]
Dogaru, Radu [2 ]
机构
[1] Univ Politehn Bucuresti, Doctoral Sch Elect Telecommun & Informat Technol, Bucharest, Romania
[2] Univ Politehn Bucuresti, Nat Comp Lab, Dept Appl Elect & Informat Engn, Bucharest, Romania
来源
2017 21ST INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS) | 2017年
关键词
speech processing; nonlinear signal processing; radial basis neural networks; complex nonlinear networks; support vector classifiers; CLASSIFICATION;
D O I
10.1109/CSCS.2017.58
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Embedded dictation, i.e. recognizing vocal commands in noisy environments, with good accuracy and using low complexity implementations is a desirable task with many applications. Such applications include automotive infotainment solutions particularly when no connectivity is available, personal assistants including embedded dictation solutions for disabled people, and so on. This paper reports our novel results in applying a nonlinear transform (RD-transform) introduced in a previous work and inspired form complexity measurements of signals generated in cellular automaton. Such a transform is compact and has a low computational complexity yet it previously proved quite efficient in terms of accuracy for a standard task of recognizing user independent dictation of digits. Herein, we report results on employing RD-transform on a specially designed sound database containing commands for the non-critical automotive equipment in a realistic, noisy environment. In addition to specific nonlinear transforms, low complexity FSVC classifiers were employed proving that good accuracies can be achieved using a very convenient implementation solution.
引用
收藏
页码:375 / 378
页数:4
相关论文
共 12 条
  • [1] Bucurica M., 2015, 2015 7 INT C EL COMP
  • [2] Robust Mandarin speech recognition in car environments for embedded navigation system
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 584 - 590
  • [3] Dogaru R., 1996, Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems. MicroNeuro'96, P265, DOI 10.1109/MNNFS.1996.493801
  • [4] Dogaru R., 2008, SYSTEMATIC DESIGN EM
  • [5] Fast and efficient speech signal classification with a novel nonlinear transform
    Dogaru, Radu
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY CONVERGENCE, PROCEEDINGS, 2007, : 43 - 47
  • [6] Dogaru R, 2016, INT CONF COMM, P373, DOI 10.1109/ICComm.2016.7528302
  • [7] Enache M., 2015, 2015 E HLTH BIOENG C, P1
  • [8] Evangelopoulos G. N., 2016, THESIS
  • [9] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    Graves, A
    Schmidhuber, J
    [J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
  • [10] Hawley M., 2003, ASSIST TECHNOL, P882