共 33 条
[1]
Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
[J].
ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION,
2019,
:220-225
[2]
Afouras T, 2018, Arxiv, DOI arXiv:1809.00496
[3]
Audio-Visual Face Reenactment
[J].
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV),
2023,
:5167-5176
[4]
BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:4945-4954
[5]
Hierarchical Cross-Modal Talking Face Generation with Dynamic Pixel-Wise Loss
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:7824-7833
[6]
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
[J].
PROCEEDINGS SIGGRAPH ASIA 2022,
2022,
[7]
Chi L., 2020, Adv Neural Inf Process Syst, V33, P4479, DOI DOI 10.5555/3495724.3496100
[8]
Out of Time: Automated Lip Sync in the Wild
[J].
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II,
2017, 10117
:251-263
[10]
Capture, Learning, and Synthesis of 3D Speaking Styles
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:10093-10103