共 38 条
[2]
Brown TB, 2020, ADV NEUR IN, V33
[3]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[4]
Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge
[J].
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022,
2022,
:5152-5161
[5]
Multi-modal Masked Autoencoders for Medical Vision-and-Language Pre-training
[J].
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V,
2022, 13435
:679-689
[6]
DWT-CV: Dense weight transfer-based cross validation strategy for model selection in biomedical data analysis
[J].
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE,
2022, 135
:20-29
[7]
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[8]
Ding M, 2022, Arxiv, DOI [arXiv:2204.14217, 10.48550/arXiv.2204.14217]
[9]
An Empirical Study of Training End-to-End Vision-and-Language Transformers
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:18145-18155