Gender and Affect Recognition Based on GMM and GMM-UBM modeling with relevance MAP estimation

被引:0
作者
Gajsek, Rok [1 ]
Zibert, Janez [2 ]
Justin, Tadej [1 ]
Struc, Vitomir [1 ]
Vesnicer, Bostjan [3 ]
Mihelic, France [1 ]
机构
[1] Univ Ljubljana, Fac Elect Engn, Ljubljana 61000, Slovenia
[2] Univ Primorska, Dept Informat Sci & Technol, Primorska, Slovenia
[3] Alpineon Res & Dev, Ljubljana, Slovenia
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
emotion recognition; affect recognition; gender recognition; GMM-UBM; MAP;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper presents our efforts in the Gender Sub-Challenge and the Affect Sub-Challenge of the INTERSPEECH 2010 Paralinguistic Challenge. The system for the Gender Sub-Challenge is based on modeling the Mel-Frequency Cepstrum Coefficients using Gaussian mixture models, building a separate model for each of the gender categories. For the Affect Sub-Challenge we propose a modeling schema where a universal background model is first trained an all the training data and then, employing the maximum a posteriori estimation criteria, a new feature vector of means is produced for each particular sample. The feature set used is comprised of low level descriptors from the baseline system, which in our case are split into four sub-sets, and modeled by its own model. Predictions from all sub-systems are fused using the sum rule fusion. Aside from the baseline regression procedure, we also evaluated the Support Vector Regression and compared the performance. Both systems achieve higher recognition results on the development set compared to baseline, but in the Affect Sub-Challenge our system's cross correlation is lower than that of the baseline system, although the mean linear error is slightly superior. In the Gender Sub-Challenge the unweighted average recall on the test set is 82.84%, and for the Affect Sub-Challenge the cross-correlation on the test set is 0.39 with mean linear error of 0.143.
引用
收藏
页码:2814 / +
页数:2
相关论文
共 50 条
  • [1] Recognition of Aircraft Engine Sound Based on GMM-UBM Model
    Yuan, Shuai
    Sun, Chengli
    Yang, Haoge
    2017 INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING (EITCE 2017), 2017, 128
  • [2] Recognition of Biometric Unlock Pattern by GMM-UBM
    Ozbek, M. Erdal
    Haytom, Mohamed Amine
    Cherrier, Estelle
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [3] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
    Asbai, Nassim
    Amrouche, Abderrahmane
    Debyeche, Mohamed
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
  • [4] Comparison of LPCC and MFCC features and GMM and GMM-UBM modeling for Limited Data Speaker Verification
    Kumari, Jayanthi T. R.
    Jayanna, H. S.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 818 - 823
  • [5] Gammachirp Filter Banks Applied in Roust Speaker Recognition Based on GMM-UBM Classifier
    Deng, Lei
    Gao, Yong
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (02) : 170 - 177
  • [6] Speaker Dependent Activation Keyword Detector Based on GMM-UBM
    Kurniawati, Evelyn
    George, Sapna
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 671 - 674
  • [7] Language Identification of Minority Language based on GMM-UBM Model
    Zuo, Libo
    Yang, Jian
    He, Linyu
    2011 AASRI CONFERENCE ON APPLIED INFORMATION TECHNOLOGY (AASRI-AIT 2011), VOL 1, 2011, : 395 - 398
  • [8] A Discriminative Performance Metric for GMM-UBM Speaker Identification
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2114 - +
  • [9] SPEAKER DEPENDENT THRESHOLD'S ESTIMATION FOR GMM-UBM SPEAKER VERIFICATION SYSTEMS
    Bachir, Mehemmel
    Samir, Akrouf
    Abbas, Mehemmel
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 1, 2012, : 537 - 542
  • [10] Combination of GMM-UBM and DTW for Voice Command Authentication System
    Kurniawati, Evelyn
    Somarajan, Sasiraj
    TENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 2019, 11071