A review on speech processing using machine learning paradigm

被引:23
|
作者
Bhangale, Kishor Barasu [1 ]
Mohanaprasad, K. [1 ]
机构
[1] VIT Univ, Sch Elect Engn SENSE, Chennai 600127, Tamil Nadu, India
关键词
Speech processing; Speech recognition; Machine learning; Speech feature extraction; Speech classification; Speech emotion recognition; INDEPENDENT COMPONENT ANALYSIS; SUPPORT VECTOR MACHINES; SPEAKER RECOGNITION; CLASSIFICATION; HMM; FEATURES; SHIMMER; MODELS; JITTER; ADAPTATION;
D O I
10.1007/s10772-021-09808-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech processing plays a crucial role in many signal processing applications, while the last decade has bought gigantic evolution based on machine learning prototype. Speech processing has a close relationship with computer linguistics, human-machine interaction, natural language processing, and psycholinguistics. This review article majorly discusses the feature extraction techniques and machine learning classifiers employed in speech processing and recognition activities. The performance of several machine learning techniques is validated for speech emotion recognition application on Berlin EmoDB database. Further, it gives the broad application areas and challenges in machine learning for speech processing.
引用
收藏
页码:367 / 388
页数:22
相关论文
共 50 条
  • [1] A review on speech processing using machine learning paradigm
    Kishor Barasu Bhangale
    K. Mohanaprasad
    International Journal of Speech Technology, 2021, 24 : 367 - 388
  • [2] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [3] Speech Technology Progress Based on New Machine Learning Paradigm
    Delic, Vlado
    Peric, Zoran
    Secujski, Milan
    Jakovuevic, Niksa
    Nikolic, Jelena
    Miskovic, Dragisa
    Simic, Nikola
    Suzic, Sinisa
    Delic, Tijana
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [4] A Review on Emotion Based Harmful Speech Detection Using Machine Learning
    Tyagi, Suryakant
    Varkonyi, Annamaria R.
    Marta, Takacs
    Szenasi, Sandor
    2022 IEEE 22ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 8TH IEEE INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCE AND ROBOTICS (CINTI-MACRO), 2022, : 17 - 23
  • [5] Speech Recognition, Machine Translation, and Speech Translation-A Unified Discriminative Learning Paradigm
    He, Xiaodong
    Deng, Li
    IEEE SIGNAL PROCESSING MAGAZINE, 2011, 28 (05) : 126 - 133
  • [6] Semantic speech analysis using machine learning and deep learning techniques: a comprehensive review
    Tyagi, Suryakant
    Szenasi, Sandor
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (29) : 73427 - 73456
  • [7] Guest Editorial: Advances in Machine Learning for Speech Processing
    Minghui Dong
    Jianhua Tao
    Man Wai Mak
    Journal of Signal Processing Systems, 2016, 82 : 137 - 140
  • [8] Guest Editorial: Advances in Machine Learning for Speech Processing
    Dong, Minghui
    Tao, Jianhua
    Mak, Man Wai
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 137 - 140
  • [9] Speech recognition using machine learning
    Vashisht, Vineet
    Pandey, Aditya Kumar
    Yadav, Satya Prakash
    IEIE Transactions on Smart Processing and Computing, 2021, 10 (03): : 233 - 239
  • [10] A review of deep learning techniques for speech processing
    Mehrish, Ambuj
    Majumder, Navonil
    Bharadwaj, Rishabh
    Mihalcea, Rada
    Poria, Soujanya
    INFORMATION FUSION, 2023, 99