共 55 条
[1]
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:12479-12488
[2]
[Anonymous], 2017, ACM MM, DOI DOI 10.1145/3123266.3123448
[3]
Bachman P, 2019, ADV NEUR IN, V32
[4]
Banerjee Satanjeev, 2005, P ACL WORKSH INTR EX, P65, DOI DOI 10.3115/1626355.1626389
[5]
Cai Q., 2020, NEURIPS
[6]
Chen David L., 2011, P 49 ANN M ASS COMP
[7]
Chen JW, 2019, AAAI CONF ARTIF INTE, P8167
[8]
Chen T, 2020, PR MACH LEARN RES, V119
[9]
Chen Yen-Chun, 2020, ECCV
[10]
Multi-fiber Networks for Video Recognition
[J].
COMPUTER VISION - ECCV 2018, PT I,
2018, 11205
:364-380