共 45 条
- [1] Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12479 - 12488
- [2] [Anonymous], 2014, ARXIV14124729
- [3] Exposing Computer Generated Images by Eye's Region Classification via Transfer Learning of VGG19 CNN [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 866 - 870
- [4] Chen D., 2011, P 49 ANN M ASS COMP, P190
- [5] Chen K., ARXIV190607155, V2019
- [6] Chen SX, 2019, AAAI CONF ARTIF INTE, P8191
- [7] Chen Y., 2020, THESIS U ELECT SCI T, DOI [10.27005/d.cnki.gdzku.2020.002623, DOI 10.27005/D.CNKI.GDZKU.2020.002623]
- [8] Denkowski M., 2014, Proceedings of the ninth workshop on statistical machine translation, P376, DOI DOI 10.3115/V1/W14-3348
- [9] Fused GRU with semantic-temporal attention for video captioning [J]. NEUROCOMPUTING, 2020, 395 : 222 - 228
- [10] Joint Syntax Representation Learning and Visual Cue Translation for Video Captioning [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8917 - 8926