共 324 条
[11]
Bian N, 2024, Arxiv, DOI [arXiv:2303.16421, DOI 10.48550/ARXIV.2303.16421]
[12]
Bisong E., 2019, Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, P485, DOI [DOI 10.1007/978-1-4842-4470-8_38, 10.1007/978-1-4842-4470-838, DOI 10.1007/978-1-4842-4470-838]
[13]
LaTr: Layout-Aware Transformer for Scene-Text VQA
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:16527-16537
[14]
Brock Andrew, 2021, P MACHINE LEARNING R, V139
[15]
Brown TB, 2020, ADV NEUR IN, V33
[16]
Byeon Minwoo, 2022, COYO-700M: Image -Text Pair Dataset
[18]
Cai RZ, 2023, Arxiv, DOI arXiv:2312.02896
[19]
Cao YH, 2023, Arxiv, DOI [arXiv:2303.04226, DOI 10.48550/ARXIV.2303.04226, 10.48550/arXiv.2303.04226]
[20]
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:3557-3567