TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS

被引：0

作者：

Eghbal-zadeh, Hamid ^{[1
]}

Schedl, Markus ^{[1
]}

Widmer, Gerhard ^{[1
]}

机构：

[1] Johannes Kepler Univ Linz, Dept Computat Percept, A-4040 Linz, Austria

来源：

2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2015年

基金：

奥地利科学基金会;

关键词：

music artist recognition; timbral modeling; song-level features; i-vectors; mfcc;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Music artist (i.e., singer) recognition is a challenging task in Music Information Retrieval (MIR). The presence of different musical instruments, the diversity of music genres and singing techniques make the retrieval of artist-relevant information from a song difficult. Many authors tried to address this problem by using complex features or hybrid systems. In this paper, we propose new song-level timbre-related features that are built from frame-level IVIFCCs via so-called i-vectors. We report artist recognition results with multiple classifiers such as K-nearest neighbor, Discriminant Analysis and Naive Bayer using these new features. Our approach yields considerable improvements and outperforms existing methods. We could achieve an 84.31% accuracy using MFCC features on a 20-classes artist recognition task.

引用

页码：1286 / 1290

页数：5

共 42 条

[1] Robust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
Ben Kheder, Waad
Matrouf, Driss
Bousquet, Pierre-Michel
Bonastre, Jean-Francois
Ajili, Moez
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 97 - 107
[2] I-vectors for image classification
Smith, David C.
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVII, 2014, 9217
[3] Multi-dialect acoustic modeling using phone mapping and online i-vectors
Arsikere, Harish
Sapru, Ashtosh
Garimella, Sri
INTERSPEECH 2019, 2019, : 2125 - 2129
[4] Multitaper MFCC and PLP features for speaker verification using i-vectors
Alam, Md Jahangir
Kinnunen, Tomi
Kenny, Patrick
Ouellet, Pierre
O'Shaughnessy, Douglas
SPEECH COMMUNICATION, 2013, 55 (02) : 237 - 251
[5] Intersession compensation and scoring methods in the i-vectors space for speaker recognition
Bousquet, Pierre-Michel
Matrouf, Driss
Bonastre, Jean-Francois
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 492 - 495
[6] Audio-Visual Speech Separation Using I-Vectors
Luo, Yiyu
Wang, Jing
Wang, Xinyao
Wen, Liang
Wang, Lizhong
2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 276 - 280
[7] E-VECTORS: JFA AND I-VECTORS REVISITED
Cumani, Sandro
Laface, Pietro
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5435 - 5439
[8] Regional Accents Recognition based on i-vectors approach: The Case of the Algerian linguistic environment
Djellab, Mourad
Amrouche, Abderrahmane
Mehallegue, Noureddine
Bouridane, Ahmed
2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 166 - U400
[9] An Investigation on the Use of i-vectors for Robust ASR
Dimitriadis, Dimitrios
Thomas, Samuel
Ganapathy, Sriram
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3828 - 3832
[10] Senone I-Vectors for Robust Speaker Verification
Tan, Zhili
Zhu, Yingke
Mak, Man-Wai
Mak, Brian Kan-Wing
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,

← 1 2 3 4 5 →