Bimodal speaker identification using dynamic Bayesian network

被引：0

作者：

Li, DD ^{[1
]}

Sang, LF ^{[1
]}

Yang, YC ^{[1
]}

Wu, ZH ^{[1
]}

机构：

[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Peoples R China

来源：

ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS | 2004年 / 3338卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The authentication of a person requires a consistently high recognition accuracy which is difficult to attain using a single recognition modality. This paper assesses the fusion of voiceprint and face feature for bimodal speaker identification using Dynamic Bayesian Network (DBN). Our contribution is to propose a general feature-level fusion framework in bimodal speaker identification. Within the framework, the voice and face feature are combined into a single DBN to obtain better performance than any single system alone. The tests were conducted on a multi-modal database of 54 users who provided voiceprint and face data of different speech type and content We compare our approach with mono-modal system and other classic decision-level methods and show that feature-level fusion using dynamic Bayesian network improved performance by about 4-5%, much better than the others.

引用

页码：577 / 585

页数：9

共 50 条

[1] Audio-visual bimodal speaker identification using dynamic Bayesian networks
Wu, Zhiyong
Cai, Lianhong
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2006, 43 (03): : 470 - 475
[2] Dynamic Bayesian network approach to speaker identification
Sang, LF
Yang, YC
Wu, ZH
Zhang, WF
ELECTRONICS LETTERS, 2003, 39 (03) : 329 - 330
[3] Automatic speaker recognition using dynamic Bayesian network
Sang, LF
Wu, ZH
Yang, YC
Zhang, WF
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 188 - 191
[4] Automatic speaker recognition using dynamic Bayesian network
Sang, LF
Wu, ZH
Yang, YC
Zhang, WF
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 613 - 616
[5] Speaker Identification in Noisy Environments Using Dynamic Bayesian Networks
Khanteymoori, A. R.
Homayounpour, M. M.
Menhaj, M. B.
2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 600 - +
[6] Dynamic hazard identification and scenario mapping using Bayesian network
Xin, Peiwei
Khan, Faisal
Ahmed, Salim
PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2017, 105 : 143 - 155
[7] A Bayesian approach to sparse dynamic network identification
Chiuso, Alessandro
Pillonetto, Gianluigi
AUTOMATICA, 2012, 48 (08) : 1553 - 1565
[8] Dynamic Bayesian Network for Operational Modal Identification
Li, B.
Kiureghian, A. Der
STRUCTURAL HEALTH MONITORING 2013, VOLS 1 AND 2, 2013, : 2696 - 2703
[9] Towards Speaker Identification System based on Dynamic Neural Network
Ivanovas, E.
Navakauskas, D.
ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (10) : 69 - 72
[10] A Correction of Missing Reliability for Robust Bimodal Speaker Identification
Tariquzzaman, Md
Kim, Jin Young
Na, Seung You
2009 INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY, PROCEEDINGS, 2009, : 239 - 243

← 1 2 3 4 5 →