共 8 条
[1]
Bain M, 2023, Arxiv, DOI [arXiv:2303.00747, 10.48550/arXiv.2303.00747]
[2]
Goto Kenta, 2016, IPSJ AN10096193, No. 2016-CE-133, V11, P1
[3]
huggingface, Wav2vec2(asr-base-960h)
[4]
Kurzinger Ludwig, 2020, Speech and Computer. 22nd International Conference, SPECOM 2020. Proceedings. Lecture Notes in Artificial Intelligence Subseries of Lecture Notes in Computer Science (LNAI 12335), P267, DOI 10.1007/978-3-030-60276-5_27
[5]
Montreal Forced Aligner: trainable text-speech alignment using Kaldi
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:498-502
[6]
Radford A., 2022, arXiv, DOI [DOI 10.48550/ARXIV.2212.04356, 10.48550/ARXIV.2212.04356]
[7]
Teng Haikun, 2019, Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System (ICDMML 2019