共 50 条
[30]
Phoneme-to-audio alignment with recurrent neural networks for speaking and singing voice
[J].
INTERSPEECH 2021,
2021,
:61-65