Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

被引:11
|
作者
Zhang, Jisi [1 ]
Zorila, Catalin [2 ]
Doddipatla, Rama [2 ]
Barker, Jon [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[2] Toshiba Cambridge Res Lab, Cambridge, England
来源
INTERSPEECH 2021 | 2021年
关键词
semi-supervised learning; speech separation; teacher-student;
D O I
10.21437/Interspeech.2021-1243
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we introduce a novel semi-supervised learning framework for end-to-end speech separation. The proposed method first uses mixtures of unseparated sources and the mixture invariant training (MixIT) criterion to train a teacher model. The teacher model then estimates separated sources that are used to train a student model with standard permutation invariant training (PIT). The student model can be fine-tuned with supervised data, i.e., paired artificial mixtures and clean speech sources, and further improved via model distillation. Experiments with single and multi channel mixtures show that the teacher-student training resolves the over-separation problem observed in the original MixIT method. Further, the semi-supervised performance is comparable to a fully-supervised separation system trained using ten times the amount of supervised data.
引用
收藏
页码:3495 / 3499
页数:5
相关论文
共 50 条
  • [21] Semi-supervised end-to-end ASR via teacher-student learning with conditional posterior distribution
    Zhang, Zi-qiang
    Song, Yan
    Zhang, Jian-shu
    McLoughlin, Ian
    Dai, Li-Rong
    INTERSPEECH 2020, 2020, : 3580 - 3584
  • [22] Unsupervised and semi-supervised adaptation of a hybrid speech recognition system
    Trmal, Jan
    Zelinka, Jan
    Mueller, Ludek
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 527 - 530
  • [23] A Semi-Supervised Method for Grain Boundary Segmentation: Teacher-Student Knowledge Distillation and Pseudo-Label Repair
    Huang, Yuanyou
    Zhang, Xiaoxun
    Ma, Fang
    Li, Jiaming
    Wang, Shuxian
    ELECTRONICS, 2024, 13 (17)
  • [24] Dual Student: Breaking the Limits of the Teacher in Semi-supervised Learning
    Ke, Zhanghan
    Wang, Daoye
    Yan, Qiong
    Ren, Jimmy
    Lau, Rynson W. H.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6727 - 6735
  • [25] A Dual-Headed Teacher-Student Framework with an Uncertainty-Guided Mechanism for Semi-Supervised Skin Lesion Segmentation
    Zou, Changman
    Jeon, Wang-Su
    Ju, Hye-Rim
    Rhee, Sang-Yong
    ELECTRONICS, 2025, 14 (05):
  • [26] Teacher-Student Competition for Unsupervised Domain Adaptation
    Xiao, Ruixin
    Liu, Zhilei
    Wu, Baoyuan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8291 - 8298
  • [27] Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels
    Hailat, Zeyad
    Chen, Xue-Wen
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 907 - 912
  • [28] Semi-supervised and unsupervised discriminative language model training for automatic speech recognition
    Dikici, Erinc
    Saraclar, Murat
    SPEECH COMMUNICATION, 2016, 83 : 54 - 63
  • [29] Semi-supervised pedestrian re-identification via a teacher-student model with similarity-preserving generative adversarial networks
    Zhao, Botong
    Wang, Yanjie
    Su, Keke
    Ren, Hong
    Han, Xiyu
    APPLIED INTELLIGENCE, 2023, 53 (02) : 1605 - 1618
  • [30] Semi-supervised Learning via Improved Teacher-Student Network for Robust 3D Reconstruction of Stereo Endoscopic Image
    Shi, Hongkuan
    Wang, Zhiwei
    Lv, Jinxin
    Wang, Yilang
    Zhang, Peng
    Zhu, Fei
    Li, Qiang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4661 - 4669