共 27 条
[1]
Andreas J., 2016, P 2016 C N AM CHAPT, P1545
[2]
Andreas J, 2017, PR MACH LEARN RES, V70
[3]
Neural Module Networks
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:39-48
[4]
Bellver M., 2020, ARXIV PREPRINT ARXIV
[5]
Video Search Engine Optimization Using Keyword and Feature Analysis
[J].
SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15),
2015, 58
:691-697
[6]
Actor and Action Video Segmentation from a Sentence
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:5958-5966
[7]
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[8]
Learning to Reason: End-to-End Module Networks for Visual Question Answering
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:804-813
[9]
Modeling Relationships in Referential Expressions with Compositional Modular Networks
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4418-4427
[10]
Inferring and Executing Programs for Visual Reasoning
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:3008-3017