TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS

被引:0
|
作者
Eghbal-zadeh, Hamid [1 ]
Schedl, Markus [1 ]
Widmer, Gerhard [1 ]
机构
[1] Johannes Kepler Univ Linz, Dept Computat Percept, A-4040 Linz, Austria
来源
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2015年
基金
奥地利科学基金会;
关键词
music artist recognition; timbral modeling; song-level features; i-vectors; mfcc;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Music artist (i.e., singer) recognition is a challenging task in Music Information Retrieval (MIR). The presence of different musical instruments, the diversity of music genres and singing techniques make the retrieval of artist-relevant information from a song difficult. Many authors tried to address this problem by using complex features or hybrid systems. In this paper, we propose new song-level timbre-related features that are built from frame-level IVIFCCs via so-called i-vectors. We report artist recognition results with multiple classifiers such as K-nearest neighbor, Discriminant Analysis and Naive Bayer using these new features. Our approach yields considerable improvements and outperforms existing methods. We could achieve an 84.31% accuracy using MFCC features on a 20-classes artist recognition task.
引用
收藏
页码:1286 / 1290
页数:5
相关论文
共 42 条
  • [1] Robust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
    Ben Kheder, Waad
    Matrouf, Driss
    Bousquet, Pierre-Michel
    Bonastre, Jean-Francois
    Ajili, Moez
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 97 - 107
  • [2] I-vectors for image classification
    Smith, David C.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVII, 2014, 9217
  • [3] Multi-dialect acoustic modeling using phone mapping and online i-vectors
    Arsikere, Harish
    Sapru, Ashtosh
    Garimella, Sri
    INTERSPEECH 2019, 2019, : 2125 - 2129
  • [4] Multitaper MFCC and PLP features for speaker verification using i-vectors
    Alam, Md Jahangir
    Kinnunen, Tomi
    Kenny, Patrick
    Ouellet, Pierre
    O'Shaughnessy, Douglas
    SPEECH COMMUNICATION, 2013, 55 (02) : 237 - 251
  • [5] Intersession compensation and scoring methods in the i-vectors space for speaker recognition
    Bousquet, Pierre-Michel
    Matrouf, Driss
    Bonastre, Jean-Francois
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 492 - 495
  • [6] Audio-Visual Speech Separation Using I-Vectors
    Luo, Yiyu
    Wang, Jing
    Wang, Xinyao
    Wen, Liang
    Wang, Lizhong
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 276 - 280
  • [7] E-VECTORS: JFA AND I-VECTORS REVISITED
    Cumani, Sandro
    Laface, Pietro
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5435 - 5439
  • [8] Regional Accents Recognition based on i-vectors approach: The Case of the Algerian linguistic environment
    Djellab, Mourad
    Amrouche, Abderrahmane
    Mehallegue, Noureddine
    Bouridane, Ahmed
    2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 166 - U400
  • [9] An Investigation on the Use of i-vectors for Robust ASR
    Dimitriadis, Dimitrios
    Thomas, Samuel
    Ganapathy, Sriram
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3828 - 3832
  • [10] Senone I-Vectors for Robust Speaker Verification
    Tan, Zhili
    Zhu, Yingke
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,