Audio Songs Classification Based on Music Patterns

被引:4
作者
Sharma, Rahul [1 ]
Murthy, Y. V. Srinivasa [1 ]
Koolagudi, Shashidhar G. [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, Karnataka, India
来源
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3 | 2016年 / 381卷
关键词
Music classification; Music indexing and retrieval; Mel-frequency cepstral coefficients; Artificial neural networks; Pattern recognition; Statistical properties; Vibrato; RECOGNITION; RETRIEVAL;
D O I
10.1007/978-81-322-2526-3_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, effort has been made to classify audio songs based on their music pattern which helps us to retrieve the music clips based on listener's taste. This task is helpful in indexing and accessing the music clip based on listener's state. Seven main categories are considered for this work such as devotional, energetic, folk, happy, pleasant, sad and, sleepy. Forty music clips of each category for training phase and fifteen clips of each category for testing phase are considered; vibrato-related features such as jitter and shimmer along with the mel-frequency cepstral coefficients (MFCCs); statistical values of pitch such as min, max, mean, and standard deviation are computed and added to the MFCCs, jitter, and shimmer which results in a 19-dimensional feature vector. feedforward backpropagation neural network (BPNN) is used as a classifier due to its efficiency in mapping the nonlinear relations. The accuracy of 82 % is achieved on an average for 105 testing clips.
引用
收藏
页码:157 / 166
页数:10
相关论文
共 50 条
  • [41] Audio-visual stimulation based emotion classification by correlated EEG channels
    Ahirwal, Mitul Kumar
    Kose, Mangesh Ramaji
    HEALTH AND TECHNOLOGY, 2020, 10 (01) : 7 - 23
  • [42] Dictionary learning based sparse coefficients for audio classification with max and average pooling
    Zubair, Syed
    Yan, Fei
    Wang, Wenwu
    DIGITAL SIGNAL PROCESSING, 2013, 23 (03) : 960 - 970
  • [43] Music video emotion classification using slow-fast audio-video network and unsupervised feature representation
    Pandeya, Yagya Raj
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [44] Research on Music Classification Based on MFCC and BP Neural Network
    LiuYongchun
    Hong, Song
    Jing, Yang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONICS AND COMPUTER, 2014, 59 : 129 - 132
  • [45] Content-Based Music Classification Using Ensemble of Classifiers
    Anisetty, Manikanta Durga Srinivas
    Shetty, Gagan K.
    Hiriyannaiah, Srinidhi
    Matt, Siddesh Gaddadevara
    Srinivasa, K. G.
    Kanavalli, Anita
    INTELLIGENT HUMAN COMPUTER INTERACTION, 2018, 11278 : 285 - 292
  • [46] Classification of Optical Music Symbols based on Combined Neural Network
    Wen, Cuihong
    Rebelo, Ana
    Zhang, Jing
    Cardoso, Jaime
    2014 INTERNATIONAL CONFERENCE ON MECHATRONICS AND CONTROL (ICMC), 2014, : 419 - 423
  • [47] Polish dance music classification based on mel spectrogram decomposition
    Chwaleba, Kinga
    Wach, Weronika
    ADVANCES IN SCIENCE AND TECHNOLOGY-RESEARCH JOURNAL, 2025, 19 (02) : 95 - 113
  • [48] Hybrid Approach for Emotion Classification of Audio Conversation Based on Text and Speech Mining
    Bhaskar, Jasmine
    Sruthi, K.
    Nedungadi, Prema
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 635 - 643
  • [49] A Classification Method for Environmental Audio Data
    Li, Ying
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 2, 2010, : 355 - 361
  • [50] Audio Scanning Network: Bridging Time and Frequency Domains for Audio Classification
    Chen, Liangwei
    Zhou, Xiren
    Chen, Huanhuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11355 - +