共 77 条
[1]
Alemi AA, 2018, PR MACH LEARN RES, V80
[2]
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:3674-3683
[3]
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4260-4269
[4]
[Anonymous], 2017, NEURAL INFORM PROCES
[5]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]
Bhattacharyya Apratim, 2018, IEEE C COMP VIS PATT
[7]
Bowman Samuel R., 2016, 20 SIGNLL C COMP NAT
[8]
Carbonetto P, 2004, LECT NOTES COMPUT SC, V3021, P350
[9]
Chen GZ, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, ROBOTICS AND AUTOMATION (ICMRA), P188, DOI 10.1109/ICMRA.2018.8490580
[10]
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4042-4050