Combining Evidence from Spectral and Source-like Features for Person Recognition from Humming

被引:0
|
作者
Patil, Hemant A. [1 ]
Madhavi, Maulik C. [1 ]
Parhi, Keshab K. [2 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhiangar, India
[2] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN USA
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
Humming; VTEO; VTMFCC; fusion of Source-System features; polynomial classifier; SPEAKER RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, hum of a person is used in voice biometric system. In addition, recently proposed feature set, i.e., Variable length Teager Energy Based Mel Frequency Cepstral Coefficients (VTMFCC), is found to capture perceptually meaningful source-like information from hum signal. For person recognition, MFCC gives EER of 13.14% and %ID of 64.96%. A reduction in equal error rate (EER) by 0.2% and improvement in identification rate by 7.3 % is achieved when a score-level fusion system is employed by combining evidence from MFCC (system) and VTMFCC (source-like features) than MFCC alone. Results are reported for various feature dimensions and population sizes.
引用
收藏
页码:376 / +
页数:2
相关论文
共 50 条
  • [1] Static and dynamic information derived from source and system features for person recognition from humming
    Patil, Hemant
    Madhavi, Maulik
    Parhi, Keshab
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (03) : 393 - 406
  • [2] Combining evidences from magnitude and phase information using VTEO for person recognition using humming
    Patil, Hemant A.
    Madhavi, Maulik C.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 225 - 256
  • [3] Exploiting Variable Length Teager Energy Operator in Melcepstral Features for Person Recognition from Humming
    Madhavi, Maulik C.
    Patil, Hemant A.
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 624 - 628
  • [4] Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
    Yegnanarayana, B
    Prasanna, SRM
    Zachariah, JM
    Gupta, CS
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 575 - 582
  • [5] Significance of Phase-based Features for Person Recognition Using Humming
    Sailor, Hardik B.
    Madhavi, Maulik C.
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 99 - 103
  • [6] Combining evidence from residual phase and MFCC features for speaker recognition
    Murty, KR
    Yegnanarayana, B
    IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (01) : 52 - 55
  • [7] Emotion Recognition from Speech Signals using Excitation Source and Spectral Features
    Choudhury, Akash Roy
    Ghosh, Anik
    Pandey, Rahul
    Barman, Subhas
    PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 257 - 261
  • [8] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [9] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [10] AT-A-DISTANCE PERSON RECOGNITION VIA COMBINING OCULAR FEATURES
    Verma, Shalini
    Mittal, Paritosh
    Vatsa, Mayank
    Singh, Richa
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3131 - 3135