共 71 条
[1]
[Anonymous], 2009, CVPR
[2]
Ba J. L., 2016, arXiv, DOI 10.48550/arXiv:1607.06450
[3]
Bahng H, 2022, Arxiv, DOI arXiv:2203.17274
[4]
Ben-Zaken E, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, P1
[5]
Bommasani R., 2022, On the opportunities and risks of foundation models, DOI [10.48550/arXiv.2108.07258, DOI 10.48550/ARXIV.2108.07258]
[6]
Brown TB, 2020, ADV NEUR IN, V33
[7]
Cai H, 2020, ADV NEUR IN, V33
[8]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[9]
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
[J].
COMPUTER VISION - ECCV 2018, PT VII,
2018, 11211
:833-851
[10]
An Empirical Study of Training Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9620-9629