共 21 条
- [1] Chiu CC, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P4774, DOI 10.1109/ICASSP.2018.8462105
- [2] Chu W, 2009, INT CONF ACOUST SPEE, P3969, DOI 10.1109/ICASSP.2009.4960497
- [4] Gibiansky A., 2017, P ANN C NEUR INF PRO, P2962
- [5] Good Michael., 2001, VIRTUAL SCORE REPRES, P113
- [6] Ito Keith, 2017, LJ SPEECH DATASET
- [7] Kingma DP., 2017, A method for stochastic optimization, DOI DOI 10.48550/ARXIV.1412.6980
- [8] Lee J., 2019, ARXIV190801919
- [9] Jasper: An End-to-End Convolutional Neural Acoustic Model [J]. INTERSPEECH 2019, 2019, : 71 - 75
- [10] Montreal Forced Aligner: trainable text-speech alignment using Kaldi [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 498 - 502