共 68 条
[1]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[2]
Ba JL, 2016, arXiv
[3]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[4]
Learning to Detect Human-Object Interactions
[J].
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018),
2018,
:381-389
[5]
HICO: A Benchmark for Recognizing Human-Object Interactions in Images
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:1017-1025
[6]
Reformulating HOI Detection as Adaptive Set Prediction
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:9000-9009
[7]
Chen Z., 2020, International Conference on Learning Representations
[8]
Cheng Y., 2022, NEUROCOMPUTING
[10]
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929