共 56 条
[1]
Baevski A., 2020, Advances in neural information processing systems, V33, P12449, DOI 10.5555/3495724.3496768
[2]
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
[J].
INTERSPEECH 2019,
2019,
:3790-3794
[3]
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
[J].
INTERSPEECH 2019,
2019,
:4115-4119
[4]
Effectively Building Tera Scale MaxEnt Language Models Incorporating Non-Linguistic Signals
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:2710-2714
[5]
Carletta J, 2005, LECT NOTES COMPUT SC, V3869, P28
[6]
Chan William, 2021, ARXIV PREPRINT ARXIV
[7]
Changhao Shan, 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Proceedings, P5631, DOI 10.1109/ICASSP.2019.8682490
[8]
Chen Zhehuai, 2020, INTERSPEECH
[9]
Chen Zhehuai, 2021, INTERSPEECH
[10]
Chung YA, 2020, INT CONF ACOUST SPEE, P3497, DOI [10.1109/icassp40776.2020.9054438, 10.1109/ICASSP40776.2020.9054438]