共 50 条
- [1] Enhanced-Memory Transformer for Coherent Paragraph Video Captioning 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 836 - 840
- [2] Exploring adaptive attention in memory transformer applied to coherent video paragraph captioning 2022 IEEE EIGHTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2022), 2022, : 37 - 44
- [3] MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2603 - 2614
- [5] STVGBert: A Visual-linguistic Transformer based Framework for Spatio-temporal Video Grounding 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1513 - 1522
- [6] Descriptive and Coherent Paragraph Generation for Image Paragraph Captioning Using Vision Transformer and Post-processing ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2023, 2023, 14124 : 40 - 52
- [7] Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning International Journal of Computer Vision, 2023, 131 : 82 - 100