共 50 条
- [1] Geometrically-Aware Dual Transformer Encoding Visual and Textual Features for Image Captioning ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT V, PAKDD 2024, 2024, 14649 : 15 - 27
- [3] Image Captioning Based on Visual Relevance and Context Dual Attention Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
- [4] Relational Attention with Textual Enhanced Transformer for Image Captioning PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 151 - 163
- [5] GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 167 - 184
- [7] Context-assisted Transformer for Image Captioning Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1889 - 1903