共 22 条
[1]
Discriminative Latent Semantic Graph for Video Captioning
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:3556-3564
[2]
Motion Guided Region Message Passing for Video Captioning
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1523-1532
[3]
Semantic Compositional Networks for Visual Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1141-1150
[4]
Hemalatha M, 2020, IEEE WINT CONF APPL, P1576, DOI [10.1109/WACV45572.2020.9093344, 10.1109/wacv45572.2020.9093344]
[5]
Dense-Captioning Events in Videos
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:706-715
[6]
REVNET: BRING REVIEWING INTO VIDEO CAPTIONING FOR A BETTER DESCRIPTION
[J].
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME),
2019,
:1312-1317
[7]
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10867-10876
[8]
Jointly Modeling Embedding and Translation to Bridge Video and Language
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:4594-4602
[9]
Memory-Attended Recurrent Network for Video Captioning
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:8339-8348
[10]
Ryu H, 2021, AAAI CONF ARTIF INTE, V35, P2514