ANALYSIS OF THE DNN-BASED SRE SYSTEMS IN MULTI-LANGUAGE CONDITIONS

被引:0
|
作者
Novotny, Ondrej [1 ]
Matejka, Pavel
Glembek, Ondrej
Plchot, Oldrich
Grezl, Frantisek
Burget, Lukas
Cernocky, Jan
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
关键词
DNN; Multi-Language; Speaker Recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the "Language Pack" of the PRISM set, we evaluate the systems' performance using the NIST's standard metrics. We show that not only the gain from using DNNs vanishes, nor using dedicated DNNs for target conditions helps, but also the DNN-based systems tend to produce de-calibrated scores under the studied conditions. This work gives suggestions for directions of future research rather than any particular solutions to these issues.
引用
收藏
页码:199 / 204
页数:6
相关论文
共 50 条
  • [41] IMPACT OF SINGLE-MICROPHONE DEREVERBERATION ON DNN-BASED MEETING TRANSCRIPTION SYSTEMS
    Yoshioka, Takuya
    Chen, Xie
    Gales, Mark J. F.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [42] Multi-Language Ontology-based Search Engine
    Zhuhadar, Leyla
    Nasraoui, Olfa
    Wyatt, Robert
    Romero, Elizabeth
    THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER-HUMAN INTERACTIONS: ACHI 2010, 2010, : 13 - 18
  • [43] Perspectives to promote modularity, reusability, and consistency in multi-language systems
    Hyacinth Ali
    Gunter Mussbacher
    Jörg Kienzle
    Innovations in Systems and Software Engineering, 2022, 18 : 5 - 37
  • [44] Nexus: A GPU Cluster Engine for Accelerating DNN-Based Video Analysis
    Shen, Haichen
    Chen, Lequn
    Jin, Yuchen
    Zhao, Liangyu
    Kong, Bingyu
    Philipose, Matthai
    Krishnamurthy, Arvind
    Sundaram, Ravi
    PROCEEDINGS OF THE TWENTY-SEVENTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '19), 2019, : 322 - 337
  • [45] A DNN-BASED ACOUSTIC MODELING OF TONAL LANGUAGE AND ITS APPLICATION TO MANDARIN PRONUNCIATION TRAINING
    Hu, Wenping
    Qian, Yao
    Soong, Frank K.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] Perspectives to promote modularity, reusability, and consistency in multi-language systems
    Ali, Hyacinth
    Mussbacher, Gunter
    Kienzle, Jorg
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2022, 18 (01) : 5 - 37
  • [47] Towards an efficient simulation of multi-language descriptions of heterogeneous systems
    Dubois, Mathieu
    Aboulhamid, El Mostapha
    Rousseau, Frederic
    2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 538 - +
  • [48] Leveraging Synthetic Data for DNN-Based Visual Analysis of Passenger Seats
    Aranjuelo N.
    Apellaniz J.L.
    Unzueta L.
    Garcia J.
    Garcia S.
    Elordi U.
    Otaegui O.
    SN Computer Science, 4 (1)
  • [49] Delta-MelSpectra Features for Noise Robustness to DNN-based ASR systems
    Kumar, Kshitiz
    Liu, Chaojun
    Gong, Yifan
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2445 - 2448
  • [50] Online DNN-based Channel Estimator for Massive MIMO Systems with Nonlinear Distortion
    Zheng, Xuanyu
    Lau, Vincent
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,