共 61 条
[1]
Anderson P, 2018, PROC CVPR IEEE, P6077, DOI [10.1109/CVPR.2018.00636, 10.1002/ett.70087]
[2]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[3]
Aneja J., 2018, P IEEE CVF C COMP VI
[4]
Anil R., 2018, ICLR
[5]
Banerjee S., 2005, P ACL WORKSHOP INTRI, P65, DOI DOI 10.3115/1626355.1626389
[6]
Bruno P., 2022, P INT C IM AN PROC
[7]
Cagrandi M., 2021, P ACM INT C MULT RET
[8]
Emerging Properties in Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9630-9640
[9]
Cornia M., 2021, AI Communications, P1
[10]
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
[J].
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA),
2020,
:1128-1134