State-of-the-art in speaker recognition

被引:54
作者
Faundez-Zanuy, M
Monte-Moreno, E
机构
[1] Escola Univ Politecn Mataro, Barcelona 08303, Spain
[2] TALP Res Ctr, Barcelona, Spain
关键词
D O I
10.1109/MAES.2005.1432568
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recent advances in speech technologies have produced new tools that can be used to improve the performance and flexibility of speaker recognition. While there are few degrees of freedom or alternative methods when using fingerprint or iris identification techniques, speech offers much more flexibility and different levels to perform recognition: the system can force the user to speak in a particular manner, different for each attempt to enter. Also, with voice input, the system has other degrees of freedom, such as the use of knowledge/codes that only the user knows, or dialectical/semantical traits that are difficult to forge. This paper offers an overview of the state-of-the-art in speaker recognition, with special emphasis on the pros and cons, and the current research lines. The current research lines include improved classification systems, and the use of high level information by means of probabilistic grammars. In conclusion, speaker recognition is far away from being a technology where all the possibilities have already been explored.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
[21]   Driver Intention Recognition: State-of-the-Art Review [J].
Vellenga, Koen ;
Steinhauer, H. Joe ;
Karlsson, Alexander ;
Falkman, Goran ;
Rhodin, Asli ;
Koppisetty, Ashok Chaitanya .
IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 3 :602-616
[22]   On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models [J].
Shahreza, Hatef Otroshi ;
Hahn, Vedrana Krivokuca ;
Marcel, Sebastien .
2021 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2021, :50-55
[23]   State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations [J].
Villalba, Jesus ;
Chen, Nanxin ;
Snyder, David ;
Garcia-Romero, Daniel ;
McCree, Alan ;
Sell, Gregory ;
Borgstrom, Jonas ;
Garcia-Perera, Leibny Paola ;
Richardson, Fred ;
Dehak, Reda ;
Torres-Carrasquillo, Pedro A. ;
Dehak, Najim .
COMPUTER SPEECH AND LANGUAGE, 2020, 60
[24]   State-of-the-art Speaker Recognition for Telephone and Video Speech: the JHU-MIT Submission for NIST SRE18 [J].
Villalba, Jesus ;
Chen, Nanxin ;
Snyder, David ;
Garcia-Romero, Daniel ;
McCree, Alan ;
Sell, Gregory ;
Borgstrom, Jonas ;
Richardson, Fred ;
Shon, Suwon ;
Grondin, Francois ;
Dehak, Reda ;
Garcia-Perera, Leibny Paola ;
Povey, Daniel ;
Torres-Carrasquillo, Pedro A. ;
Khudanpur, Sanjeev ;
Dehak, Najim .
INTERSPEECH 2019, 2019, :1488-1492
[25]   State-of-the-art of situation recognition systems for intraoperative procedures [J].
Junger, D. ;
Frommer, S. M. ;
Burgert, O. .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (04) :921-939
[26]   Handwritten digit recognition using state-of-the-art techniques [J].
Liu, CL ;
Nakashima, K ;
Sako, H ;
Fujisawa, H .
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, :320-325
[27]   A survey of state-of-the-art approaches for emotion recognition in text [J].
Alswaidan, Nourah ;
Menai, Mohamed El Bachir .
KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (08) :2937-2987
[28]   Handwritten digit recognition: benchmarking of state-of-the-art techniques [J].
Liu, CL ;
Nakashima, K ;
Sako, H ;
Fujisawa, H .
PATTERN RECOGNITION, 2003, 36 (10) :2271-2285
[29]   State-of-the-art of situation recognition systems for intraoperative procedures [J].
D. Junger ;
S. M. Frommer ;
O. Burgert .
Medical & Biological Engineering & Computing, 2022, 60 :921-939
[30]   Optical music recognition: state-of-the-art and open issues [J].
Rebelo, Ana ;
Fujinaga, Ichiro ;
Paszkiewicz, Filipe ;
Marcal, Andre R. S. ;
Guedes, Carlos ;
Cardoso, Jaime S. .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2012, 1 (03) :173-190