SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION FOR EMBEDDED SYSTEMS

被引：11

作者：

Balian, Julien ^{[1
]}

Tavarone, Raffaele ^{[1
]}

Poumeyrol, Mathieu ^{[1
]}

Coucke, Alice ^{[1
]}

机构：

[1] Sonos Inc, Paris, France

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

speaker verification; neural networks; text independent; small footprint;

D O I：

10.1109/ICASSP39728.2021.9413564

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep neural network approaches to speaker verification have proven successful, but typical computational requirements of State-Of-The-Art (SOTA) systems make them unsuited for embedded applications. In this work, we present a two-stage model architecture orders of magnitude smaller than common solutions (237.5K learning parameters, 11.5MFLOPS) reaching a competitive result of 3.31% Equal Error Rate (EER) on the well established VoxCeleb1 verification test set. We demonstrate the possibility of running our solution on small devices typical of IoT systems such as the Raspberry Pi 3B with a latency smaller than 200ms on a 5s long utterance. Additionally, we evaluate our model on the acoustically challenging VOiCES corpus. We report a limited increase in EER of 2.6 percentage points with respect to the best scoring model of the 2019 VOiCES from a Distance Challenge, against a reduction of 25.6 times in the number of learning parameters.

引用

页码：6179 / 6183

页数：5

共 50 条

[21] Quasi Text-Independent Speaker-Verification based on Pattern Matching
Gerber, Michael
Beutler, Rene
Pfister, Beat
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 93 - 96
[22] End-to-End Feature Learning for Text-Independent Speaker Verification
Chen, Fangzhou
Bian, Tengyue
Xu, Li
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3949 - 3954
[23] GENERATIVE X-VECTORS FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Xu, Longting
Das, Rohan Kumar
Yilmaz, Emre
Yang, Jichen
Li, Haizhou
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1014 - 1020
[24] TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES
Liu, Kai
Zhou, Huan
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6569 - 6573
[25] GRAPH ATTENTIVE FEATURE AGGREGATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Shim, Hye-Jin
Heo, Jungwoo
Park, Jae-Han
Lee, Ga-Hui
Yu, Ha-Jin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7972 - 7976
[26] A novel text-independent speaker verification method based on the global speaker model
Zhang, YY
Zhang, D
Zhu, XY
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (05): : 598 - 602
[27] Score Fusion Methods for Text-Independent Speaker Verification Applications
Rastoceanu, Florin
Lazar, Marilena
2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
[28] Bayesian Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Zhu, Yingke
Mak, Brian
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1000 - 1012
[29] Text-independent speaker verification using speaker clustering and support vector machines
Hou, FL
Wang, BX
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 456 - 459
[30] Strategies for End-to-End Text-Independent Speaker Verification
Lin, Weiwei
Mak, Man-Wai
Chien, Jen-Tzung
INTERSPEECH 2020, 2020, : 4308 - 4312

← 1 2 3 4 5 →