共 50 条
- [41] Attention-based Visual-Audio Fusion for Video Caption Generation [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 839 - 844
- [42] Image Caption Generation Using Contextual Information Fusion With Bi-LSTM-s [J]. IEEE ACCESS, 2023, 11 : 134 - 143
- [43] Mind's Eye: A Recurrent Visual Representation for Image Caption Generation [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2422 - 2431
- [45] Hierarchical Attention-Based Fusion for Image Caption With Multi-Grained Rewards [J]. IEEE ACCESS, 2020, 8 (08): : 57943 - 57951
- [47] Image Caption with Endogenous–Exogenous Attention [J]. Neural Processing Letters, 2019, 50 : 431 - 443
- [48] CNN image caption generation [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 152 - 157