共 11 条
- [1] Dong J(2018)Predicting visual features from text for image and video caption retrieval IEEE Trans Multimedia 20 3377-3388
- [2] Li X(2017)Movie description Int J Comput Vis 123 94-120
- [3] Snoek CG(undefined)undefined undefined undefined undefined-undefined
- [4] Rohrbach A(undefined)undefined undefined undefined undefined-undefined
- [5] Torabi A(undefined)undefined undefined undefined undefined-undefined
- [6] Rohrbach M(undefined)undefined undefined undefined undefined-undefined
- [7] Tandon N(undefined)undefined undefined undefined undefined-undefined
- [8] Pal C(undefined)undefined undefined undefined undefined-undefined
- [9] Larochelle H(undefined)undefined undefined undefined undefined-undefined
- [10] Courville A(undefined)undefined undefined undefined undefined-undefined