UNSUPERVISED MULTI-CHANNEL SEPARATION AND ADAPTATION

被引:0
作者
Han, Cong [1 ,2 ]
Wilson, Kevin [2 ]
Wisdom, Scott [2 ]
Hershey, John R. [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Google, Mountain View, CA 94043 USA
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年
关键词
multi-channel; speech separation;
D O I
10.1109/ICASSP48485.2024.10447422
中图分类号
学科分类号
摘要
A key challenge in machine learning is to generalize from training data to an application domain of interest. This work extends the recently-proposed mixture invariant training (MixIT) algorithm to perform unsupervised learning in the multi-channel setting. We use MixIT to train a model on far-field microphone array recordings of overlapping reverberant and noisy speech from the AMI Corpus. The models are trained on both supervised and unsupervised training data, and are tested on real AMI recordings containing overlapping speech. To objectively evaluate our models, we also use a synthetic multi-channel AMI test set. Holding network architectures constant, we find that semi-supervised fine-tuning of a model pretrained on a large and diverse single-channel dataset yields the largest improvement to SI-SNR and to human listening ratings across synthetic and real datasets, outperforming supervised models trained on well-matched synthetic data. Our results demonstrate that unsupervised learning through MixIT enables model adaptation on both single- and multi-channel real-world speech recordings.
引用
收藏
页码:721 / 725
页数:5
相关论文
共 50 条
  • [31] Multi-channel Authentication for Online Banking
    AlFairuz, Mohamed
    SUSTAINABLE ECONOMIC GROWTH, EDUCATION EXCELLENCE, AND INNOVATION MANAGEMENT THROUGH VISION 2020, VOLS I-VII, 2017, : 1959 - 1968
  • [32] Multi-channel support for DMAC in WSNs
    LI, Jie
    ZHANG, Yong
    LI, Zhi
    XU, Hui-yang
    WANG, Li
    Journal of China Universities of Posts and Telecommunications, 2007, 14 (SUPPL. 1): : 27 - 31
  • [33] Misbehavior in Multi-Channel MAC Protocols
    Zhang, Yan
    Lazos, Loukas
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2020, 17 (04) : 760 - 774
  • [34] MCB - A multi-channel beaconing protocol
    Klingler, Florian
    Dressler, Falko
    Cao, Jiannong
    Sommer, Christoph
    AD HOC NETWORKS, 2016, 36 : 258 - 269
  • [35] Directional Multi-channel MAC for VANETs
    Duc Ngoc Minh Dang
    Quynh Tu Ngo
    Hanh Ngoc Dang
    Phuong Luu Vo
    Choong Seon Hong
    AETA 2016: RECENT ADVANCES IN ELECTRICAL ENGINEERING AND RELATED SCIENCES: THEORY AND APPLICATION, 2017, 415 : 803 - 812
  • [36] Multi-channel quantum parameter estimation
    Bao, Liying
    Qi, Bo
    Wang, Yabo
    Dong, Daoyi
    Wu, Rebing
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [37] Deterministic multi-channel information exchange
    Holzer, Stephan
    Locher, Thomas
    Pignolet, Yvonne Anne
    Wattenhofer, Roger
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2017, 87 : 84 - 103
  • [38] Multi-channel quantum parameter estimation
    Liying Bao
    Bo Qi
    Yabo Wang
    Daoyi Dong
    Rebing Wu
    Science China Information Sciences, 2022, 65
  • [39] A Multi-channel Adaptive Equalization Method
    Xiao, Shanghui
    Zhang, Mengyao
    Liu, Jian
    Xu, Qiang
    Pan, Wensheng
    Ma, Wanzhi
    Liu, Ying
    Shao, Shihai
    2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,
  • [40] Channel Allocation Evaluation for a multi-channel MAC protocol
    Diab, Rana
    Chalhoub, Gerard
    Misson, Michel
    2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 1857 - 1862