共 40 条
[1]
BAI HT, 2022, ECCV, V3669, P612, DOI DOI 10.1007/978-3-031-20077-9_36
[2]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[3]
E2Net: Excitative-Expansile Learning forWeakly Supervised Object Localization
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:573-581
[4]
Chen ZW, 2022, AAAI CONF ARTIF INTE, P410
[6]
Attention-based Dropout Layer for Weakly Supervised Object Localization
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:2214-2223
[7]
Dosovitskiy A., 2021, INT C LEARN REPRESEN
[8]
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:2866-2875
[9]
Guo Guangyu, 2021, IEEE CVPR
[10]
ViTOL: Vision Transformer for Weakly Supervised Object Localization
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022,
2022,
:4100-4109