Distilling knowledge from Gaussian process teacher to neural network student

被引:1
作者
Wong, Jeremy H. M. [1 ]
Zhang, Huayun [1 ]
Chen, Nancy F. [1 ]
机构
[1] ASTAR, Inst Infocomm Res I2R, Singapore, Singapore
来源
INTERSPEECH 2023 | 2023年
关键词
Gaussian process; neural network; knowledge distillation; ensemble combination; spoken language assessment; MISPRONUNCIATION DETECTION;
D O I
10.21437/Interspeech.2023-190
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Neural Networks (NN) and Gaussian Processes (GP) are different modelling approaches. The former stores characteristics of the training data in its many parameters, and then performs inference by parsing inputs through these parameters. The latter instead performs inference by computing a similarity between the test and training inputs, and then predicts test outputs that are correlated with the reference training outputs of similar inputs. These models may be combined to leverage upon their diversity. However, both combination and the matrix computations for GP inference are expensive. This paper investigates whether a NN student is able to effectively learn from the information distilled from a GP or ensemble teacher. It is computationally cheaper to infer using this student. Experiments on the speechocean762 spoken language assessment dataset suggest that learning is effective.
引用
收藏
页码:426 / 430
页数:5
相关论文
共 38 条
  • [1] [Anonymous], 2014, P INTERSPEECH
  • [2] Bauer M, 2016, ADV NEUR IN, V29
  • [3] Bourlard Herve A., 1994, Connectionist Speech Recognition: A Hybrid Approach
  • [4] Chan W, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3264
  • [5] Dean J., 2015, ARXIV PREPRINT ARXIV
  • [6] CORRELATION COEFFICIENTS MEASURED ON SAME INDIVIDUALS
    DUNN, OJ
    CLARK, V
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1969, 64 (325) : 366 - &
  • [7] Ellis DPW, 2001, INT CONF ACOUST SPEE, P517, DOI 10.1109/ICASSP.2001.940881
  • [8] Garcia-Romero D, 2017, INT CONF ACOUST SPEE, P4930, DOI 10.1109/ICASSP.2017.7953094
  • [9] Ghahremani Pegah, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P2494, DOI 10.1109/ICASSP.2014.6854049
  • [10] Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
    Gong, Xun
    Zhou, Zhikai
    Qian, Yanmin
    [J]. INTERSPEECH 2022, 2022, : 2618 - 2622