共 32 条
[1]
Barratt S, 2018, Arxiv, DOI arXiv:1801.01973
[2]
Beltagy I, 2020, Arxiv, DOI arXiv:2004.05150
[3]
Berthelot D, 2017, Arxiv, DOI arXiv:1703.10717
[4]
Dhariwal P, 2021, ADV NEUR IN, V34
[5]
Taming Transformers for High-Resolution Image Synthesis
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:12868-12878
[6]
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[7]
Vector Quantized Diffusion Model for Text-to-Image Synthesis
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2022,
:10686-10696
[8]
Heusel M, 2017, ADV NEUR IN, V30
[9]
Ho Jonathan., 2020, P 34 INT C NEURAL IN, P6840
[10]
Khanna S, 2024, Arxiv, DOI [arXiv:2312.03606, 10.48550/arXiv.2312.03606]