Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

被引:11
|
作者
Zhang, Jisi [1 ]
Zorila, Catalin [2 ]
Doddipatla, Rama [2 ]
Barker, Jon [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[2] Toshiba Cambridge Res Lab, Cambridge, England
来源
INTERSPEECH 2021 | 2021年
关键词
semi-supervised learning; speech separation; teacher-student;
D O I
10.21437/Interspeech.2021-1243
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we introduce a novel semi-supervised learning framework for end-to-end speech separation. The proposed method first uses mixtures of unseparated sources and the mixture invariant training (MixIT) criterion to train a teacher model. The teacher model then estimates separated sources that are used to train a student model with standard permutation invariant training (PIT). The student model can be fine-tuned with supervised data, i.e., paired artificial mixtures and clean speech sources, and further improved via model distillation. Experiments with single and multi channel mixtures show that the teacher-student training resolves the over-separation problem observed in the original MixIT method. Further, the semi-supervised performance is comparable to a fully-supervised separation system trained using ten times the amount of supervised data.
引用
收藏
页码:3495 / 3499
页数:5
相关论文
共 50 条
  • [31] Semi-supervised Teacher-Reference-Student Architecture for Action Quality Assessment
    Yun, Wulian
    Qi, Mengshi
    Peng, Fei
    Ma, Huadong
    COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 161 - 178
  • [32] ADVERSARIAL TEACHER-STUDENT LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION
    Meng, Zhong
    Li, Jinyu
    Gong, Yifan
    Juang, Biing-Hwang
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5949 - 5953
  • [33] A semi-supervised learning detection method for vision-based monitoring of construction sites by integrating teacher-student networks and data augmentation
    Xiao, Bo
    Zhang, Yuxuan
    Chen, Yuan
    Yin, Xianfei
    ADVANCED ENGINEERING INFORMATICS, 2021, 50
  • [34] Semi-Supervised and Unsupervised Extreme Learning Machines
    Huang, Gao
    Song, Shiji
    Gupta, Jatinder N. D.
    Wu, Cheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2405 - 2417
  • [35] Semi-Supervised Learning of Speech Sounds
    Jansen, Aren
    Niyogi, Partha
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2264 - 2267
  • [36] Semi-supervised learning of speech recognizers based on variational autoencoder and unsupervised data augmentation
    Ho, Hyeon
    Kang, Byung Ok
    Kwon, Oh-Wook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (06): : 578 - 586
  • [37] Supervised, semi-supervised and unsupervised inference of gene regulatory networks
    Maetschke, Stefan R.
    Madhamshettiwar, Piyush B.
    Davis, Melissa J.
    Ragan, Mark A.
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 195 - 211
  • [38] A Survey on Supervised, Unsupervised, and Semi-Supervised Approaches in Crowd Counting
    Wang, Jianyong
    Gao, Mingliang
    Li, Qilei
    Kim, Hyunbum
    Jeon, Gwanggil
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 3561 - 3582
  • [39] Sentiment analysis in Turkish: Supervised, semi-supervised, and unsupervised techniques
    Aydin, Cem Rifki
    Gungor, Tunga
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (04) : 455 - 483
  • [40] Ensemble learning with trees and rules: Supervised, semi-supervised, unsupervised
    Akdemir, Deniz
    Jannink, Jean-Luc
    INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 857 - 872