共 60 条
[1]
Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
[J].
ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION,
2019,
:220-225
[4]
Afouras T, 2018, Arxiv, DOI arXiv:1809.02108
[5]
Jalalifar SA, 2018, Arxiv, DOI arXiv:1803.07461
[6]
Expressive Visual Text-To-Speech Using Active Appearance Models
[J].
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2013,
:3382-3389
[7]
[Anonymous], 2011, ACM transactions on graphics (TOG), DOI [DOI 10.1145/1964921.1964972, 10.1145/2010324.1964972]
[8]
Arevalo J, 2017, Arxiv, DOI [arXiv:1702.01992, DOI 10.48550/ARXIV.1702.01992]
[9]
Arslan L., 1998, INT C AUD VIS SPEECH, P175
[10]
High-Quality Passive Facial Performance Capture using Anchor Frames
[J].
ACM TRANSACTIONS ON GRAPHICS,
2011, 30 (04)