Causal reasoning for algorithmic fairness in voice controlled cyber-physical systems

被引：1

作者：

Fenu, Gianni ^{[1
]}

Marras, Mirko ^{[1
]}

Medda, Giacomo ^{[1
]}

Meloni, Giacomo ^{[1
]}

机构：

[1] Univ Cagliari, Dept Math & Comp Sci, I-09124 Cagliari, Italy

来源：

PATTERN RECOGNITION LETTERS | 2023年 / 168卷

关键词：

Security; Authentication; Voice biometrics; Fairness; Speaker recognition; SPEAKER; TRANSFORMATION; BIAS;

D O I：

10.1016/j.patrec.2023.03.014

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automated speaker recognition is enabling personalized interactions with the voice-based interfaces and assistants part of the modern cyber-physical-social systems. Prior studies have unfortunately uncovered disparate impacts across demographic groups on the outcomes of speaker recognition systems and consequently proposed a range of countermeasures. Understanding why a speaker recognition system may lead to this disparate performance for different (groups of) individuals, going beyond mere data imbalance reasons and black-box countermeasures, is an essential yet under-explored perspective. In this paper, we propose an explanatory framework that aims to provide a better understanding of how speaker recognition models perform as the underlying voice characteristics on which they are tested change. With our framework, we evaluate two state-of-the-art speaker recognition models, comparing their fairness in terms of security, through a systematic analysis of the impact of more than twenty voice characteristics. Our findings include important takeaways to enable voice controlled cyber-physical-social systems for everyone. Source code and data are available at https://bit.ly/EA-PRLETTERS . (c) 2023 Elsevier B.V. All rights reserved.

引用

页码：131 / 137

页数：7

共 41 条

[1] A new Paradigm and Meta-Model for Cyber-Physical-Social Systems
Abera, Yilma Bereket
Naudet, Yannick
Panetto, Herve
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 10949 - 10954
[2] Backstrom Tom., 2019, Introduction to speech processing
[3] Boersma P., 2022, PRAAT DOING PHONETIC
[4] Chung JS, 2018, INTERSPEECH, P1086
[5] Front-End Factor Analysis for Speaker Verification
Dehak, Najim
Kenny, Patrick J.
Dehak, Reda
Dumouchel, Pierre
Ouellet, Pierre
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
[6] Explaining recommender systems fairness and accuracy through the lens of data characteristics
Deldjoo, Yashar
Bellogin, Alejandro
Di Noia, Tommaso
[J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
[7] Feinberg D.R., 2022, PARSELMOUTH PRAAT SC
[8] Fenu Gianni, 2020, ESSE 2020: Proceedings of the 2020 European Symposium on Software Engineering, P129, DOI 10.1145/3393822.3432325
[9] Fenu G., 2022, Procedia Comput Sci, V198, P249, DOI DOI 10.1016/J.PROCS.2021.12.236
[10] Fair Voice Biometrics: Impact of Demographic Imbalance on Group Fairness in Speaker Recognition
Fenu, Gianni
Marras, Mirko
Medda, Giacomo
Meloni, Giacomo
[J]. INTERSPEECH 2021, 2021, : 1892 - 1896

← 1 2 3 4 5 →