Distilling knowledge from Gaussian process teacher to neural network student

被引：1

作者：

Wong, Jeremy H. M. ^{[1
]}

Zhang, Huayun ^{[1
]}

Chen, Nancy F. ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res I2R, Singapore, Singapore

来源：

INTERSPEECH 2023 | 2023年

关键词：

Gaussian process; neural network; knowledge distillation; ensemble combination; spoken language assessment; MISPRONUNCIATION DETECTION;

D O I：

10.21437/Interspeech.2023-190

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Neural Networks (NN) and Gaussian Processes (GP) are different modelling approaches. The former stores characteristics of the training data in its many parameters, and then performs inference by parsing inputs through these parameters. The latter instead performs inference by computing a similarity between the test and training inputs, and then predicts test outputs that are correlated with the reference training outputs of similar inputs. These models may be combined to leverage upon their diversity. However, both combination and the matrix computations for GP inference are expensive. This paper investigates whether a NN student is able to effectively learn from the information distilled from a GP or ensemble teacher. It is computationally cheaper to infer using this student. Experiments on the speechocean762 spoken language assessment dataset suggest that learning is effective.

引用

页码：426 / 430

页数：5

共 38 条

[1] [Anonymous], 2014, P INTERSPEECH
[2] Bauer M, 2016, ADV NEUR IN, V29
[3] Bourlard Herve A., 1994, Connectionist Speech Recognition: A Hybrid Approach
[4] Chan W, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3264
[5] Dean J., 2015, ARXIV PREPRINT ARXIV
[6] CORRELATION COEFFICIENTS MEASURED ON SAME INDIVIDUALS
DUNN, OJ
CLARK, V
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1969, 64 (325) : 366 - &
[7] Ellis DPW, 2001, INT CONF ACOUST SPEE, P517, DOI 10.1109/ICASSP.2001.940881
[8] Garcia-Romero D, 2017, INT CONF ACOUST SPEE, P4930, DOI 10.1109/ICASSP.2017.7953094
[9] Ghahremani Pegah, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P2494, DOI 10.1109/ICASSP.2014.6854049
[10] Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
Gong, Xun
Zhou, Zhikai
Qian, Yanmin
[J]. INTERSPEECH 2022, 2022, : 2618 - 2622

← 1 2 3 4 →