共 31 条
[1]
NTCD-TIMIT: A New Database and Baseline for Noise-robust Audio-visual Speech Recognition
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:3752-3756
[2]
[Anonymous], 2013, SYNTHESIS LECT SPEEC
[3]
[Anonymous], 2017, NEURIPS
[5]
Bando Y., 2020, ISCA INTERSPEECH, P2437
[6]
Bando Y, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P716, DOI 10.1109/ICASSP.2018.8461530
[8]
GUIDED VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A SUPERVISED CLASSIFIER
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:681-685
[9]
Chen RTQ, 2018, 32 C NEURAL INFORM P, V31
[10]
Creswell A., 2018, ARXIV171105175CS