共 50 条
[11]
Rewind and Render: Towards Factually Accurate Text-to-Video Generation with Distilled Knowledge Retrieval
[J].
THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 28,
2025,
:29652-29654
[12]
LONG TERM MEMORY-ENHANCED VIA CAUSAL REASONING FOR TEXT-TO-VIDEO RETRIEVAL
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024),
2024,
:8160-8164
[14]
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
[J].
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6,
2024,
:5207-5214
[15]
Grid Diffusion Models for Text-to-Video Generation
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024,
2024,
:8734-8743
[16]
WAVE: Warping DDIM Inversion Features for Zero-Shot Text-to-Video Editing
[J].
COMPUTER VISION - ECCV 2024, PT LXXVI,
2025, 15134
:38-55
[17]
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024,
2024,
:7038-7048
[18]
MEVG: Multi-event Video Generation with Text-to-Video Models
[J].
COMPUTER VISION-ECCV 2024, PT XLIII,
2025, 15101
:401-418
[19]
ImproveYourVideos: Architectural Improvements for Text-to-Video Generation Pipeline
[J].
IEEE ACCESS,
2025, 13
:1986-2003