共 50 条
[31]
IMPROVING MANDARIN END-TO-END SPEECH SYNTHESIS BY SELF-ATTENTION AND LEARNABLE GAUSSIAN BIAS
[J].
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019),
2019,
:208-213
[32]
End-to-end text-to-speech synthesis with unaligned multiple language units based on attention
[J].
INTERSPEECH 2020,
2020,
:4009-4013
[33]
ESPnet: End-to-End Speech Processing Toolkit
[J].
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES,
2018,
:2207-2211
[35]
Neurorecognition visualization in multitask end-to-end speech
[J].
OPTICAL FIBERS AND THEIR APPLICATIONS 2023,
2024, 12985
[37]
End-to-End Localization and Ranking for Relative Attributes
[J].
COMPUTER VISION - ECCV 2016, PT VI,
2016, 9910
:753-769
[39]
USING SPEECH SYNTHESIS TO TRAIN END-TO-END SPOKEN LANGUAGE UNDERSTANDING MODELS
[J].
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
2020,
:8499-8503
[40]
LEARNING LATENT REPRESENTATIONS FOR STYLE CONTROL AND TRANSFER IN END-TO-END SPEECH SYNTHESIS
[J].
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2019,
:6945-6949