TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS

被引:0
作者
Eghbal-zadeh, Hamid [1 ]
Schedl, Markus [1 ]
Widmer, Gerhard [1 ]
机构
[1] Johannes Kepler Univ Linz, Dept Computat Percept, A-4040 Linz, Austria
来源
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2015年
基金
奥地利科学基金会;
关键词
music artist recognition; timbral modeling; song-level features; i-vectors; mfcc;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Music artist (i.e., singer) recognition is a challenging task in Music Information Retrieval (MIR). The presence of different musical instruments, the diversity of music genres and singing techniques make the retrieval of artist-relevant information from a song difficult. Many authors tried to address this problem by using complex features or hybrid systems. In this paper, we propose new song-level timbre-related features that are built from frame-level IVIFCCs via so-called i-vectors. We report artist recognition results with multiple classifiers such as K-nearest neighbor, Discriminant Analysis and Naive Bayer using these new features. Our approach yields considerable improvements and outperforms existing methods. We could achieve an 84.31% accuracy using MFCC features on a 20-classes artist recognition task.
引用
收藏
页码:1286 / 1290
页数:5
相关论文
共 42 条
  • [31] Robust online i-vectors for unsupervised adaptation of DNN acoustic models: A study in the context of digital voice assistants
    Arsikere, Harish
    Garimella, Sri
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2401 - 2405
  • [32] Speaker Recognition Using e-Vectors
    Cumani, Sandro
    Laface, Pietro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 736 - 748
  • [33] COMPARISON OF USER MODELS BASED ON GMM-UBM AND I-VECTORS FOR SPEECH, HANDWRITING, AND GAIT ASSESSMENT OF PARKINSON'S DISEASE PATIENTS
    Vasquez-Correa, J. C.
    Bocklet, T.
    Orozco-Arroyave, J. R.
    Noeth, E.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6544 - 6548
  • [34] A Design for Wireless Music Control System using Speech Recognition
    Nilakhe, Aishwarya
    Shelke, Sushama
    [J]. 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 337 - 339
  • [35] A Comparative Study of Text-Independent Speaker Recognition Systems Using Gaussian Mixture Modeling and i-vector Methods
    Paulose, Suma
    Mathew, Dominic
    Thomas, Abraham
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 444 - 448
  • [36] Continuous Speech Recognition of Kannada Language using Triphone Modeling
    Sajjan, Sharada C.
    Vijaya, C.
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
  • [37] Phoneme Modeling for Speech Recognition in Kannada Using Hidden Markov Model
    Kannadaguli, Prashanth
    Thalengala, Ananthakrishna
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [38] SVM and HMM Modeling Techniques for Speech Recognition Using LPCC and MFCC Features
    Ananthi, S.
    Dhanalakshmi, P.
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 519 - 526
  • [39] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music Using Discrete Wavelet Transform
    Dash, Sukanta Kumar
    Solanki, S. S.
    Chakraborty, Soubhik
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4239 - 4271
  • [40] Raga Recognition of Indian Classical Music using Meerkat Optimization Based MFCC and Fine Tuned BILSTM-XGBOOST
    Jayanthi, J.
    Upendran, V.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,