共 50 条
[41]
Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening
[J].
Visual Intelligence,
2025, 3 (1)
[43]
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
[J].
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7,
2024,
:6639-6647
[44]
Multi-Conditional Generative Adversarial Network for Text-to-Video Synthesis
[J].
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics,
2022, 34 (10)
:1567-1579
[46]
Human Motion Aware Text-to-Video Generation with Explicit Camera Control
[J].
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024,
2024,
:5069-5078
[47]
HOW TEXT-TO-VIDEO TOOL SORA COULD SHAPE SCIENCE - AND SOCIETY
[J].
NATURE,
2024, 627 (8004)
:475-476
[48]
T2VBench: Benchmarking Temporal Dynamics for Text-to-Video Generation
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW,
2024,
:5325-5335
[49]
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2024,
:9212-9221
[50]
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
[J].
COMPUTER VISION - ECCV 2024, PT LXXXIX,
2025, 15147
:332-349