Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification

被引:1
|
作者
Kataria, Saurabh [1 ,2 ]
Villalba, Jesus [1 ,2 ]
Moro-Velazquez, Laureano [1 ]
Dehak, Najim [1 ,2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA
来源
INTERSPEECH 2022 | 2022年
关键词
domain adaptation; speech bandwidth extension; time-domain GAN; non-parallel learning; joint learning;
D O I
10.21437/Interspeech.2022-10900
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech systems developed for a particular choice of acoustic domain and sampling frequency do not translate easily to others. The usual practice is to learn domain adaptation and bandwidth extension models independently. Contrary to this, we propose to learn both tasks together. Particularly, we learn to map narrow-band conversational telephone speech to wideband microphone speech. We developed parallel and non-parallel learning solutions which utilize both paired and unpaired data. We first discuss joint and disjoint training of multiple generative models for our tasks. Then, we propose a two-stage learning solution using a pre-trained domain adaptation system for pre-processing in bandwidth extension training. We evaluated our schemes on a Speaker Verification downstream task. We used the JHU-MIT experimental setup for NIST SRE21, which comprises SRE16, SRE-CTS Superset, and SRE21. Our results prove that learning both tasks is better than learning just one. On SRE16, our best system achieves 22% relative improvement in Equal Error Rate w.r.t. a direct learning baseline and 8% w.r.t. a strong bandwidth expansion system.
引用
收藏
页码:615 / 619
页数:5
相关论文
共 44 条
  • [41] Towards soft real-time fault diagnosis for edge devices in industrial IoT using deep domain adaptation training strategy
    Kumar, Dileep
    Ujjan, Sanaullah Mehran
    Dev, Kapal
    Khowaja, Sunder Ali
    Bhatti, Naveed Anwar
    Hussain, Tanweer
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 160 : 90 - 99
  • [42] A deep learning-based technique for firm classification and domain adaptation in land cover classification using time-series aerial images
    Indrajit Kalita
    Shounak Chakraborty
    Talla Giridhara Ganesh Reddy
    Moumita Roy
    Earth Science Informatics, 2024, 17 : 655 - 678
  • [43] A deep learning-based technique for firm classification and domain adaptation in land cover classification using time-series aerial images
    Kalita, Indrajit
    Chakraborty, Shounak
    Reddy, Talla Giridhara Ganesh
    Roy, Moumita
    EARTH SCIENCE INFORMATICS, 2024, 17 (01) : 655 - 678
  • [44] Assessing fusarium oxysporum disease severity in cotton using unmanned aerial system images and a hybrid domain adaptation deep learning time series model
    Abdalla, Alwaseela
    Wheeler, Terry A.
    Dever, Jane
    Lin, Zhe
    Arce, Joel
    Guo, Wenxuan
    BIOSYSTEMS ENGINEERING, 2024, 237 : 220 - 231