A Bimodal Biometric Verification System Based on Deep Learning

被引：1

作者：

Song, Baolin ^{[1
]}

Jiang, Hao ^{[1
]}

Zhao, Li ^{[1
]}

Huang, Chengwei ^{[2
]}

机构：

[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China

[2] Fandou Informat Technol Co Ltd, Dept Res & Dev, Nanjing, Jiangsu, Peoples R China

来源：

PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2017) | 2017年

基金：

中国国家自然科学基金;

关键词：

identity authentication; multi-modal biometrics; feature fusion; deep learning; convolutional neural networks;

D O I：

10.1145/3177404.3177410

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In order to improve the limitation of single-mode biometric identification technology, a bimodal biometric verification system based on deep learning is proposed in this paper. A modified CNN architecture is used to generate better facial feature for bimodal fusion. The obtained facial feature and acoustic feature extracted by the acoustic feature extraction model are fused together to form the fusion feature on feature layer level. The fusion feature obtained by this method are used to train a neural network of identifying the target person who have these corresponding features. Experimental results demonstrate the superiority and high performance of our bimodal biometric in comparison with single-mode biometrics for identity authentication, which are tested on a bimodal database consists of data coherent from TED-LIUM and CASIA-WebFace. Compared with using facial feature or acoustic feature alone, the classification accuracy of fusion feature obtained by our method is increased obviously.

引用

页码：89 / 93

页数：5

共 12 条

[1]

[Anonymous], 2016, P IEEE WINTER C APPL

[2]

Cumani S., 2016, US Patent, Patent No. [9,373,330, 9373330]

[3] Design and implementation of an online corpus of presentation transcripts of TED Talks [J].

Hasebe, Yoichiro .

CURRENT WORK IN CORPUS LINGUISTICS: WORKING WITH TRADITIONALLY- CONCEIVED CORPORA AND BEYOND (CILC2015), 2015, 198 :174-182

[4] Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering [J].

Huang, Chengwei ;

Song, Baolin ;

Zhao, Li .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (04) :805-816

[5] A Fast and Accurate Unconstrained Face Detector [J].

Liao, Shengcai ;

Jain, Anil K. ;

Li, Stan Z. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :211-223

[6] Distributed incremental fingerprint identification with reduced database penetration rate using a hierarchical classification based on feature fusion and selection [J].

Peralta, Daniel ;

Triguero, Isaac ;

Garcia, Salvador ;

Saeys, Yvan ;

Benitez, Jose M. ;

Herrera, Francisco .

KNOWLEDGE-BASED SYSTEMS, 2017, 126 :91-103

[7]

Rath Subrat Kumar, 2014, International Journal of Modern Education and Computer Science, V6, P34, DOI 10.5815/ijmecs.2014.08.05

[8]

Ritter M., 2016, SPEECH COMMUN, P1

[9]

Schroff F, 2015, PROC CVPR IEEE, P815, DOI 10.1109/CVPR.2015.7298682

[10]

Stolcke A, 2014, AC SPEECH SIGN PROC, P5552

← 1 2 →