Static and dynamic information derived from source and system features for person recognition from humming

被引:2
|
作者
Patil, Hemant [1 ]
Madhavi, Maulik [1 ]
Parhi, Keshab [2 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar, India
[2] Univ Minnesota, Dept Elect & Comp Engn, Twin Cities Campus, Minneapolis, MN 55455 USA
关键词
Humming; Delta and shifted delta features; VTMFCC; Score-level fusion; Polynomial classifier;
D O I
10.1007/s10772-012-9161-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, hum of a person (instead of normal speech) is used to design a voice biometric system for person recognition. In addition, a recently proposed static feature set, viz., Variable length Teager energy based Mel Frequency Cepstral Coefficients (VTMFCC), is found to capture source-like information of a hum signal. Effectiveness of VTMFCC over linear prediction (LP) residual to capture the complementary information than MFCC is demonstrated in a hum signal. Person recognition performance is found to be better when a score-level fusion is used by combining evidences from static and dynamic features forMFCC (system) and VTMFCC (source-like) features than MFCC alone. Experiments are validated on two types of dynamic features, viz., delta cepstrum and shifted delta cepstrum. In addition, for score-level fusion using static and dynamic features % identification rate and % Equal Error Rate are observed to outperform by 7.9 % and 0.27 %, respectively than MFCC alone. Furthermore, we have observed that person recognition system gives better performance for larger frame duration 69.6 ms as opposed to traditional 10-30 ms frame duration.
引用
收藏
页码:393 / 406
页数:14
相关论文
共 50 条
  • [1] Combining Evidence from Spectral and Source-like Features for Person Recognition from Humming
    Patil, Hemant A.
    Madhavi, Maulik C.
    Parhi, Keshab K.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 376 - +
  • [2] Exploiting Variable Length Teager Energy Operator in Melcepstral Features for Person Recognition from Humming
    Madhavi, Maulik C.
    Patil, Hemant A.
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 624 - 628
  • [3] Combining evidences from magnitude and phase information using VTEO for person recognition using humming
    Patil, Hemant A.
    Madhavi, Maulik C.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 225 - 256
  • [4] Significance of Phase-based Features for Person Recognition Using Humming
    Sailor, Hardik B.
    Madhavi, Maulik C.
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 99 - 103
  • [5] Static and Dynamic Features Analysis from Human Skeletons for Gait Recognition
    Li, Ziqiong
    Yu, Shiqi
    Reyes, Edel B. Garcia
    Shan, Caifeng
    Li, Yan-ran
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [6] Emotion recognition from speech using source, system, and prosodic features
    Koolagudi, Shashidhar G.
    Rao, K. Sreenivasa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 265 - 289
  • [7] RECOVERING DYNAMIC INFORMATION FROM STATIC HANDWRITING
    BOCCIGNONE, G
    CHIANESE, A
    CORDELLA, LP
    MARCELLI, A
    PATTERN RECOGNITION, 1993, 26 (03) : 409 - 418
  • [8] Person Recognition at a Distance: Improving Face Recognition Through Body Static Information
    Gonzalez-Sosa, Ester
    Vera-Rodriguez, Ruben
    Hernandez-Ortega, Javier
    Fierrez, Julian
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3439 - 3444
  • [9] Dissociating gait from static appearance: A virtual reality study of the role of dynamic identity signatures in person recognition
    Simhi, Noa
    Yovel, Galit
    COGNITION, 2020, 205
  • [10] Social perception from static and dynamic visual information
    Perrett, D. I.
    Xiao, D.
    Jellema, T.
    Barraclough, N.
    Oram, M. W.
    PERCEPTION, 2006, 35 : 120 - 120