ANALYSIS OF THE DNN-BASED SRE SYSTEMS IN MULTI-LANGUAGE CONDITIONS

被引:0
|
作者
Novotny, Ondrej [1 ]
Matejka, Pavel
Glembek, Ondrej
Plchot, Oldrich
Grezl, Frantisek
Burget, Lukas
Cernocky, Jan
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
关键词
DNN; Multi-Language; Speaker Recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the "Language Pack" of the PRISM set, we evaluate the systems' performance using the NIST's standard metrics. We show that not only the gain from using DNNs vanishes, nor using dedicated DNNs for target conditions helps, but also the DNN-based systems tend to produce de-calibrated scores under the studied conditions. This work gives suggestions for directions of future research rather than any particular solutions to these issues.
引用
收藏
页码:199 / 204
页数:6
相关论文
共 50 条
  • [31] Semantic language and multi-language MT approach based on SL
    Gao, QS
    Hu, Y
    Li, L
    Gao, XY
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2003, 18 (06) : 848 - 852
  • [32] Robust Adversarial Attacks Against DNN-Based Wireless Communication Systems
    Bahramali, Alireza
    Nasr, Milad
    Houmansadr, Amir
    Goeckel, Dennis
    Towsley, Don
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 126 - 140
  • [33] DNN-based multi-output model for predicting soccer team tactics
    Lee, Geon Ju
    Jung, Jason J.
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [34] DNN-based Multi-Channel Speech Coding Employing Sound Localization
    Deng, Shuhao
    Bao, Changchun
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 451 - 451
  • [35] Towards Efficient, Multi-Language Dynamic Taint Analysis
    Kreindl, Jacob
    Bonetta, Daniele
    Moessenboeck, Hanspeter
    PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON MANAGED PROGRAMMING LANGUAGES AND RUNTIMES (MPLR '19), 2019, : 85 - 94
  • [36] Analysis and manipulation of distributed multi-language software code
    Deruelle, L
    Melab, N
    Bouneffa, M
    Basson, H
    FIRST IEEE INTERNATIONAL WORKSHOP ON SOURCE CODE ANALYSIS AND MANIPULATION, PROCEEDINGS, 2001, : 43 - 54
  • [37] An analysis of multi-language simultaneous display in the translation system
    Sato, Mizuki
    Hishiyama, Reiko
    2017 IEEE 41ST ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2017, : 666 - 671
  • [38] DNN-based multi-output model for predicting soccer team tactics
    Lee G.J.
    Jung J.J.
    PeerJ Computer Science, 2022, 8
  • [39] MULTI-SPEAKER MODELING AND SPEAKER ADAPTATION FOR DNN-BASED TTS SYNTHESIS
    Fan, Yuchen
    Qian, Yao
    Soong, Frank K.
    He, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4475 - 4479
  • [40] Unified Language-Independent DNN-Based G2P Converter
    Juzova, Marketa
    Tihelka, Daniel
    Vit, Jakub
    INTERSPEECH 2019, 2019, : 2085 - 2089