A Comprehensive Review on Audio based Musical Instrument Recognition: Human-Machine Interaction towards Industry 4.0

被引:1
作者
Dash, Sukanta Kumar [1 ]
Solanki, S. S. [1 ]
Chakraborty, Soubhik [2 ]
机构
[1] Birla Inst Technol, Dept Elect & Commun Engn, Ranchi 835215, Jharkhand, India
[2] Birla Inst Technol, Dept Math, Ranchi 835215, Jharkhand, India
来源
JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH | 2023年 / 82卷 / 01期
关键词
Classifier learning; Feature descriptors; Instrument recognition; Multimodal communication; Music information retrieval; NEURAL-NETWORK; CLASSIFICATION; IDENTIFICATION; SOUNDS;
D O I
10.56042/jsir.v82i1.70251
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Over the last two decades, the application of machine technology has shifted from industrial to residential use. Further, advances in hardware and software sectors have led machine technology to its utmost application, the human-machine interaction, a multimodal communication. Multimodal communication refers to the integration of various modalities of information like speech, image, music, gesture, and facial expressions. Music is the non-verbal type of communication that humans often use to express their minds. Thus, Music Information Retrieval (MIR) has become a booming field of research and has gained a lot of interest from the academic community, music industry, and vast multimedia users. The problem in MIR is accessing and retrieving a specific type of music as demanded from the extensive music data. The most inherent problem in MIR is music classification. The essential MIR tasks are artist identification, genre classification, mood classification, music annotation, and instrument recognition. Among these, instrument recognition is a vital sub-task in MIR for various reasons, including retrieval of music information, sound source separation, and automatic music transcription. In recent past years, many researchers have reported different machine learning techniques for musical instrument recognition and proved some of them to be good ones. This article provides a systematic, comprehensive review of the advanced machine learning techniques used for musical instrument recognition. We have stressed on different audio feature descriptors of common choices of classifier learning used for musical instrument recognition. This review article emphasizes on the recent developments in music classification techniques and discusses a few associated future research problems.
引用
收藏
页码:26 / 37
页数:12
相关论文
共 25 条
  • [21] Review of constraints on vision-based gesture recognition for human-computer interaction
    Chakraborty, Biplab Ketan
    Sarma, Debajit
    Bhuyan, M. K.
    MacDorman, Karl F.
    IET COMPUTER VISION, 2018, 12 (01) : 3 - 15
  • [22] A Comprehensive Review on Handcrafted and Learning-Based Action Representation Approaches for Human Activity Recognition
    Sargano, Allah Bux
    Angelov, Plamen
    Habib, Zulfiqar
    APPLIED SCIENCES-BASEL, 2017, 7 (01):
  • [23] Dual-Discriminability-Analysis Type-2 Fuzzy-Neural-Network Based Speech Classification for Human-Machine Interaction
    Wu, Gin-Der
    Zhu, Zhen-Wei
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (04) : 831 - 847
  • [24] Machine learning based approaches for clinical and non-clinical depression recognition and depression relapse prediction using audiovisual and EEG modalities: A comprehensive review
    Yasin, Sana
    Othmani, Alice
    Raza, Imran
    Hussain, Syed Asad
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 159
  • [25] Design and Implementation of Human-Computer Interaction Systems Based on Transfer Support Vector Machine and EEG Signal for Depression Patients' Emotion Recognition
    Chen, Xiang
    Xu, Lijun
    Cao, Ming
    Zhang, Tinghua
    Shang, Zhongan
    Zhang, Linghao
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2021, 11 (03) : 948 - 954