共 31 条
[1]
[Anonymous], 2019, P ICML
[2]
Direct Acoustics-to-Word Models for English Conversational Speech Recognition
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:959-963
[3]
Bahdanau Dzmitry, 2015, P INT C AC SPEECH SI, P4945
[4]
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
[J].
INTERSPEECH 2019,
2019,
:3790-3794
[5]
Bengio S, 2015, ADV NEUR IN, V28
[6]
Chan W, 2016, INT CONF ACOUST SPEE, P4960, DOI 10.1109/ICASSP.2016.7472621
[7]
Towards better decoding and language model integration in sequence to sequence models
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:523-527
[8]
Dong LH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5884, DOI 10.1109/ICASSP.2018.8462506
[9]
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:949-953
[10]
Hori T, 2019, INT CONF ACOUST SPEE, P6271, DOI [10.1109/icassp.2019.8683307, 10.1109/ICASSP.2019.8683307]