共 23 条
[1]
Akuzawa K, 2018, INTERSPEECH, P3067
[2]
Dong LH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5884, DOI 10.1109/ICASSP.2018.8462506
[3]
Graves A., 2006, P 23 INT C MACH LEAR, P369, DOI [DOI 10.1145/1143844.1143891, 10.1145/1143844.1143891]
[4]
Graves A, 2014, PR MACH LEARN RES, V32, P1764
[5]
SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM
[J].
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING,
1984, 32 (02)
:236-243
[6]
Two-Stage Data Augmentation for Low-Resourced Speech Recognition
[J].
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES,
2016,
:2378-2382
[7]
Hsu WN, 2019, INT CONF ACOUST SPEE, P5901, DOI 10.1109/ICASSP.2019.8683561
[8]
Ito Keith, 2017, LJ SPEECH DATASET
[9]
Karita Shigeki., 2019, CoRR
[10]
Kingma D.P., 2014, Auto-encoding variational bayes