Combining Evidence from Spectral and Source-like Features for Person Recognition from Humming

被引：0

作者：

Patil, Hemant A. ^{[1
]}

Madhavi, Maulik C. ^{[1
]}

Parhi, Keshab K. ^{[2
]}

机构：

[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhiangar, India

[2] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN USA

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

Humming; VTEO; VTMFCC; fusion of Source-System features; polynomial classifier; SPEAKER RECOGNITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, hum of a person is used in voice biometric system. In addition, recently proposed feature set, i.e., Variable length Teager Energy Based Mel Frequency Cepstral Coefficients (VTMFCC), is found to capture perceptually meaningful source-like information from hum signal. For person recognition, MFCC gives EER of 13.14% and %ID of 64.96%. A reduction in equal error rate (EER) by 0.2% and improvement in identification rate by 7.3 % is achieved when a score-level fusion system is employed by combining evidence from MFCC (system) and VTMFCC (source-like features) than MFCC alone. Results are reported for various feature dimensions and population sizes.

引用

页码：376 / +

页数：2

共 50 条

[1] Static and dynamic information derived from source and system features for person recognition from humming
Patil, Hemant
Madhavi, Maulik
Parhi, Keshab
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (03) : 393 - 406
[2] Combining evidences from magnitude and phase information using VTEO for person recognition using humming
Patil, Hemant A.
Madhavi, Maulik C.
COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 225 - 256
[3] Exploiting Variable Length Teager Energy Operator in Melcepstral Features for Person Recognition from Humming
Madhavi, Maulik C.
Patil, Hemant A.
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 624 - 628
[4] Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
Yegnanarayana, B
Prasanna, SRM
Zachariah, JM
Gupta, CS
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 575 - 582
[5] Significance of Phase-based Features for Person Recognition Using Humming
Sailor, Hardik B.
Madhavi, Maulik C.
Patil, Hemant A.
PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 99 - 103
[6] Combining evidence from residual phase and MFCC features for speaker recognition
Murty, KR
Yegnanarayana, B
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (01) : 52 - 55
[7] Emotion Recognition from Speech Signals using Excitation Source and Spectral Features
Choudhury, Akash Roy
Ghosh, Anik
Pandey, Rahul
Barman, Subhas
PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 257 - 261
[8] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
Arijul Haque
K. Sreenivasa Rao
Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
[9] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
Haque, Arijul
Rao, K. Sreenivasa
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
[10] AT-A-DISTANCE PERSON RECOGNITION VIA COMBINING OCULAR FEATURES
Verma, Shalini
Mittal, Paritosh
Vatsa, Mayank
Singh, Richa
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3131 - 3135

← 1 2 3 4 5 →