Thinking about the present and future of the complex speech recognition

被引:0
|
作者
Vicsi, Klara [1 ]
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Mediainformat, Lab Speech Acoust, Budapest, Hungary
来源
3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012) | 2012年
关键词
component; speech recognition; speech to text transformation system; multi-modal speech processing; multi-stream modelling; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical point of the most cognitive info-communication systems is the state of the development of speech recognition technology. The paper gives a short introduction of the principles of this speech recognition technology today. It highlights the fact that these systems in the market are only speech-to-text transformers giving only a word chain at the output, where the speech prosody, speech emotion, speech style and more other information are not involved. Many uncertainties exist in this operational system. Some up to date research tendencies, mostly the parallel processing are introduced aiming to increase the efficiencies of the recognition. At the end, research agenda of META NET are shortly introduced for Multilingual Europe in 2020.
引用
收藏
页码:371 / 376
页数:6
相关论文
共 50 条
  • [21] English speech emotion recognition method based on speech recognition
    Liu, Man
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [22] Speech Recognition and Correction of a Stuttered Speech
    Dash, Ankit
    Subramani, Nikhil
    Manjunath, Tejas
    Yaragarala, Vishruti
    Tripathi, Shikha
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1757 - 1760
  • [23] SPEECH RECOGNITION WITH AUGMENTED SYNTHESIZED SPEECH
    Rosenberg, Andrew
    Zhang, Yu
    Ramabhadran, Bhuvana
    Jia, Ye
    Moreno, Pedro
    Wu, Yonghui
    Wu, Zelin
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 996 - 1002
  • [24] Research on Emergency Parking Instruction Recognition Based on Speech Recognition and Speech Emotion Recognition
    Tian Kexin
    Huang Yongming
    Zhang Guobao
    Zhang Lin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2933 - 2937
  • [25] Learning Speech Rate in Speech Recognition
    Zeng, Xiangyu
    Yin, Shi
    Wang, Dong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 528 - 532
  • [26] On the Adaptation of Foreign Language Speech Recognition Engines for Lithuanian Speech Recognition
    Rudzionis, Vytautas
    Maskeliunas, Rytis
    Rudzionis, Algimantas
    Ratkevicius, Kastytis
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, 2009, 37 : 113 - +
  • [27] A Comprehensive Review of Speech Emotion Recognition Systems
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Kartiwi, Mira
    Ambikairajah, Eliathamby
    IEEE ACCESS, 2021, 9 : 47795 - 47814
  • [28] A Survey of Multilingual Models for Automatic Speech Recognition
    Yadav, Hemant
    Sitaram, Sunayana
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5071 - 5079
  • [29] Automatic Speech Recognition: Systematic Literature Review
    Alharbi, Sadeen
    Alrazgan, Muna
    Alrashed, Alanoud
    Alnomasi, Turkiayh
    Almojel, Raghad
    Alharbi, Rimah
    Alharbi, Saja
    Alturki, Sahar
    Alshehri, Fatimah
    Almojil, Maha
    IEEE ACCESS, 2021, 9 : 131858 - 131876
  • [30] English Speech Recognition Based on Artificial Intelligence
    Bai, Tana
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2259 - 2263