Causal reasoning for algorithmic fairness in voice controlled cyber-physical systems

被引:1
作者
Fenu, Gianni [1 ]
Marras, Mirko [1 ]
Medda, Giacomo [1 ]
Meloni, Giacomo [1 ]
机构
[1] Univ Cagliari, Dept Math & Comp Sci, I-09124 Cagliari, Italy
关键词
Security; Authentication; Voice biometrics; Fairness; Speaker recognition; SPEAKER; TRANSFORMATION; BIAS;
D O I
10.1016/j.patrec.2023.03.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated speaker recognition is enabling personalized interactions with the voice-based interfaces and assistants part of the modern cyber-physical-social systems. Prior studies have unfortunately uncovered disparate impacts across demographic groups on the outcomes of speaker recognition systems and consequently proposed a range of countermeasures. Understanding why a speaker recognition system may lead to this disparate performance for different (groups of) individuals, going beyond mere data imbalance reasons and black-box countermeasures, is an essential yet under-explored perspective. In this paper, we propose an explanatory framework that aims to provide a better understanding of how speaker recognition models perform as the underlying voice characteristics on which they are tested change. With our framework, we evaluate two state-of-the-art speaker recognition models, comparing their fairness in terms of security, through a systematic analysis of the impact of more than twenty voice characteristics. Our findings include important takeaways to enable voice controlled cyber-physical-social systems for everyone. Source code and data are available at https://bit.ly/EA-PRLETTERS . (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:131 / 137
页数:7
相关论文
共 41 条
  • [41] Mitigating bias against non-native accents
    Zhang, Yuanyuan
    Zhang, Yixuan
    Halpern, Bence Mark
    Patel, Tanvina
    Scharenborg, Odette
    [J]. INTERSPEECH 2022, 2022, : 3168 - 3172