Multimodal speech recognition for unmanned aerial vehicles

被引:0
|
作者
Oneață, Dan [1 ]
Cucu, Horia [1 ]
机构
[1] University POLITEHNICA of Bucharest, Romania
来源
Computers and Electrical Engineering | 2021年 / 90卷
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
106943
中图分类号
学科分类号
摘要
Speech processing - Recurrent neural networks - Visual communication - Speech recognition - Signal to noise ratio - Antennas - Visual languages
引用
收藏
相关论文
共 50 条
  • [41] Applying Generative Adversarial Networks and Vision Transformers in Speech Emotion Recognition
    Heracleous, Panikos
    Fukayama, Satoru
    Ogata, Jun
    Mohammad, Yasser
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13519 LNCS : 67 - 75
  • [42] Research on Tibetan Speech Recognition Based on CNN-DFSMN-CTC
    Northwest Normal University, Engineering Research Center of Gansu Province for Intelligent Information Technology and Application, College of Physics and Electronic Engineering, LanZhou, China
    Proc. - Asia Conf. Electr. Power Comput. Eng., EPCE, (215-219): : 215 - 219
  • [43] Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
    Violeta, Lester Phillip
    Huang, Wen-Chin
    Toda, Tomoki
    arXiv, 2022,
  • [44] Implementation of a Pitch Enhancement Technique: Punjabi Automatic Speech Recognition (PASR)
    Sharma, Rishabh
    Kumar, Deepak
    Kukreja, Vinay
    Sachdeva, Rohit
    2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions), ICRITO 2022, 2022,
  • [45] Application of Data Mining in the Design of English TRAnslator's Speech Recognition System
    Gui, Fenglan
    Yang, Zhengmao
    Lecture Notes on Data Engineering and Communications Technologies, 2023, 169 : 418 - 425
  • [46] An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing
    Violeta, Lester Phillip
    Toda, Tomoki
    2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023, 2023, : 1862 - 1867
  • [47] Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency
    Karita, Shigeki
    Sproat, Richard
    Ishikawa, Haruko
    arXiv, 2023,
  • [48] Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain
    Tejedor-García, Cristian
    van der Molen, Berrie
    van den Heuvel, Henk
    van Hessen, Arjan
    Pieters, Toine
    2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 1032 - 1039
  • [49] Towards end-to-end training of automatic speech recognition for nigerian pidgin
    Ajisafe, Daniel
    Adegboro, Oluwabukola
    Oduntan, Esther
    Arulogun, Tayo
    arXiv, 2020,
  • [50] Multimodal Emotion Recognition among Couples from Lab Settings to Daily Life using Smartwatches
    Boateng, George Gyarteh
    arXiv, 2022,