iVector Fusion of Prosodic and Cepstral Features for Speaker Verification

被引:0
|
作者
Kockmann, Marcel [1 ]
Ferrer, Luciana
Burget, Lukas [1 ]
Cernocky, Jan Honza [1 ]
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
speaker verification; prosody; JFA; iVector; SMM; fusion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we apply the promising iVector extraction technique followed by PLDA modeling to simple prosodic contour features. With this procedure we achieve results comparable to a system that models much more complex prosodic features using our recently proposed SMM-based iVector modeling technique. We then propose a combination of both prosodic iVectors by joint PLDA modeling that leads to significant improvements over individual systems with an EER of 5.4% on NEST SRE 2008 telephone data. Finally, we can combine these two prosodic iVector front ends with a baseline cepstral iVector system to achieve up to 21% relative reduction in new DCF.
引用
收藏
页码:272 / 275
页数:4
相关论文
共 50 条
  • [1] Evaluation of Lineal Relation between Shifted Delta Cepstral Features and Prosodic Features in Speaker Verification
    Calvo, Jose R.
    Ribas, Dayana
    Fernandez, Rafael
    Hernandez, Gabriel
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 112 - 119
  • [2] Prosodic Features for Speaker Verification
    Mary, Leena
    Yegnanarayana, B.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
  • [3] Effect of VoIP on Prosodic Features for Speaker Verification
    Cherian, Athira Jess
    Antony, Anil P.
    Mary, Leena
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 487 - 490
  • [4] Multi-System Fusion of Extended Context Prosodic and Cepstral Features for Paralinguistic Speaker Trait Classification
    Sanchez, Michelle Hewlett
    Lawson, Aaron
    Vergyri, Dimitra
    Bratt, Harry
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 514 - 517
  • [5] Application of Shifted Delta Cepstral Features in Speaker Verification
    Calvo, Jose R.
    Fernandez, Rafael
    Hernandez, Gabriel
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 29 - 32
  • [6] Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
    Sarkar, Achintya K.
    Cong-Thanh Do
    Le, Viet-Bac
    Barras, Claude
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1040 - 1044
  • [7] Fusion of Acoustic and Prosodic Features for Speaker Clustering
    Zibert, Janez
    Mihelic, France
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 210 - +
  • [8] Speaker verification using boosted cepstral features with Gaussian distributions
    Salman, Ahmad
    Muhammad, Ejaz
    Khurshid, Khawar
    INMIC 2007: PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE, 2007, : 14 - 18
  • [9] Novel Phase Encoded Mel Cepstral Features for Speaker Verification
    Naik, Apeksha J.
    Tak, Rishabh
    Patil, Hemant A.
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 572 - 581
  • [10] Fusion of auditory inspired amplitude modulation spectrum and cepstral features for whispered and normal speech speaker verification
    Sarria-Paja, Milton
    Falk, Tiago H.
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 437 - 456