共 44 条
[1]
Balaji Y., 2022, arXiv
[2]
Text2LIVE: Text-Driven Layered Image and Video Editing
[J].
COMPUTER VISION - ECCV 2022, PT XV,
2022, 13675
:707-723
[3]
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:22563-22575
[4]
Brooks Tim, 2024, Video generation models as world simulators, V1, P2
[5]
Cao MD, 2023, Arxiv, DOI arXiv:2304.08465
[6]
Pix2Video: Video Editing using Image Diffusion
[J].
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023),
2023,
:23149-23160
[7]
Chai Wenhao, 2023, P IEEE CVF INT C COM, P23040
[8]
Chen WF, 2024, Arxiv, DOI arXiv:2305.13840
[9]
Cong YR, 2024, Arxiv, DOI arXiv:2310.05922
[10]
Structure and Content-Guided Video Synthesis with Diffusion Models
[J].
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV,
2023,
:7312-7322