TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS

被引：0

作者：

Eghbal-zadeh, Hamid ^{[1
]}

Schedl, Markus ^{[1
]}

Widmer, Gerhard ^{[1
]}

机构：

[1] Johannes Kepler Univ Linz, Dept Computat Percept, A-4040 Linz, Austria

来源：

2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2015年

基金：

奥地利科学基金会;

关键词：

music artist recognition; timbral modeling; song-level features; i-vectors; mfcc;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Music artist (i.e., singer) recognition is a challenging task in Music Information Retrieval (MIR). The presence of different musical instruments, the diversity of music genres and singing techniques make the retrieval of artist-relevant information from a song difficult. Many authors tried to address this problem by using complex features or hybrid systems. In this paper, we propose new song-level timbre-related features that are built from frame-level IVIFCCs via so-called i-vectors. We report artist recognition results with multiple classifiers such as K-nearest neighbor, Discriminant Analysis and Naive Bayer using these new features. Our approach yields considerable improvements and outperforms existing methods. We could achieve an 84.31% accuracy using MFCC features on a 20-classes artist recognition task.

引用

页码：1286 / 1290

页数：5

共 42 条

[31] Robust online i-vectors for unsupervised adaptation of DNN acoustic models: A study in the context of digital voice assistants
Arsikere, Harish
Garimella, Sri
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2401 - 2405
[32] Speaker Recognition Using e-Vectors
Cumani, Sandro
Laface, Pietro
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 736 - 748
[33] COMPARISON OF USER MODELS BASED ON GMM-UBM AND I-VECTORS FOR SPEECH, HANDWRITING, AND GAIT ASSESSMENT OF PARKINSON'S DISEASE PATIENTS
Vasquez-Correa, J. C.
Bocklet, T.
Orozco-Arroyave, J. R.
Noeth, E.
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6544 - 6548
[34] A Design for Wireless Music Control System using Speech Recognition
Nilakhe, Aishwarya
Shelke, Sushama
[J]. 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 337 - 339
[35] A Comparative Study of Text-Independent Speaker Recognition Systems Using Gaussian Mixture Modeling and i-vector Methods
Paulose, Suma
Mathew, Dominic
Thomas, Abraham
[J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 444 - 448
[36] Continuous Speech Recognition of Kannada Language using Triphone Modeling
Sajjan, Sharada C.
Vijaya, C.
[J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
[37] Phoneme Modeling for Speech Recognition in Kannada Using Hidden Markov Model
Kannadaguli, Prashanth
Thalengala, Ananthakrishna
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
[38] SVM and HMM Modeling Techniques for Speech Recognition Using LPCC and MFCC Features
Ananthi, S.
Dhanalakshmi, P.
[J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 519 - 526
[39] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music Using Discrete Wavelet Transform
Dash, Sukanta Kumar
Solanki, S. S.
Chakraborty, Soubhik
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4239 - 4271
[40] Raga Recognition of Indian Classical Music using Meerkat Optimization Based MFCC and Fine Tuned BILSTM-XGBOOST
Jayanthi, J.
Upendran, V.
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,

← 1 2 3 4 5 →