SPEAKER VERIFICATION WITH APPLICATION-AWARE BEAMFORMING

被引:0
作者
Mosner, Ladislav [1 ]
Plchot, Oldrich [1 ]
Rohdin, Johan [1 ]
Burget, Lukas [1 ]
Cernocky, Jan [1 ]
机构
[1] Brno Univ Technol, Fac Informat Technol, IT4I Ctr Excellence, Brno, Czech Republic
来源
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019) | 2019年
关键词
Speaker verification; beamforming; x-vector; generalized eigenvalue problem;
D O I
10.1109/asru46091.2019.9003932
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multichannel speech processing applications usually employ beamformers as means of speech enhancement through spatial filtering. Beamformers with learnable parameters require training to minimize a loss function that is not necessarily correlated with the final objective. In this paper, we present a framework employing recent neural network based generalized eigenvalue beamformer and application-specific model that allows for optimization of beamformer w.r.t. target application. In our case, the application is speaker verification which utilizes a speaker embedding (x-vector) extractor that conveniently comes with desired loss. We show that application-specific training of the beamformer brings performance improvements over a system trained in the standard way. We perform our analysis on the recently introduced VOiCES corpus which contains multichannel data and allows us to modify the evaluation trials such that enrollment recordings remain single-channel and test utterances are multichannel.
引用
收藏
页码:411 / 418
页数:8
相关论文
共 23 条
  • [1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [2] Acoustic beamforming for speaker diarization of meetings
    Anguera, Xavier
    Wooters, Chuck
    Hernando, Javier
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2011 - 2022
  • [3] [Anonymous], 2017, INTERSPEECH 2017
  • [4] [Anonymous], INT J COMPUTER APPL
  • [5] [Anonymous], IEEE T AUDIO SPEECH
  • [6] Boeddeker C, 2017, INT CONF ACOUST SPEE, P171, DOI 10.1109/ICASSP.2017.7952140
  • [7] HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS
    CAPON, J
    [J]. PROCEEDINGS OF THE IEEE, 1969, 57 (08) : 1408 - &
  • [8] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
  • [9] Erdogan H., IMPROVED MVDR BEAMFO, P1981
  • [10] Heymann J., P 4 INT WORKSH SPEEC