共 49 条
[1]
Building audio-visual phonetically annotated Arabic corpus for expressive text to speech
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:3767-3771
[2]
Time-domain envelope modulating the noise component of excitation in a continuous residual-based vocoder for statistical parametric speech synthesis
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:434-438
[3]
Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder
[J].
SPEECH AND COMPUTER, SPECOM 2017,
2017, 10458
:282-291
[4]
[Anonymous], PREDICTION PERCEIVED
[5]
[Anonymous], P INT 2011
[6]
[Anonymous], AUDITORY PROCESSING
[7]
[Anonymous], P INT COMP MUS C GLA
[8]
[Anonymous], 2003, COMPUT SYST
[9]
[Anonymous], P ADV NONL SPEECH PR
[10]
[Anonymous], 2010, P INT SPEECH COMM AS