共 69 条
[1]
Aafaq Nayyer, 2022, IEEE T MULTIMEDIA TM, V14, P1
[2]
Unsupervised Learning from Narrated Instruction Videos
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:4575-4583
[3]
[Anonymous], 2017, ICML P MACHINE LEARN
[4]
[Anonymous], 2015, P 2015 ANN C N AM CH
[5]
[Anonymous], 1997, P NAT C ART INT 9 C
[6]
Ba JL, 2016, arXiv
[7]
Discriminative Latent Semantic Graph for Video Captioning
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:3556-3564
[8]
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3185-3194
[9]
Critic-based Attention Network for Event-based Video Captioning
[J].
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19),
2019,
:811-817
[10]
BidirectionalLong-Short Term Memory for Video Description
[J].
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE,
2016,
:436-440