共 14 条
- [1] Karpathy A., Joulin A., Fei-Fei L., Deep fragment embeddings for bidirectional image sentence mapping, Proc of Conference on Advances in Neural Information Processing Systems, pp. 1889-1897, (2014)
- [2] Guadarrama S., Rodner E., Saenko K., Et al., Open-vocabulary object retrieval, Proc of Conference on Robotics: Science and Systems, 2, 5, pp. 6-14, (2014)
- [3] Hu R., Xu H., Rohrbach M., Et al., Natural language object retrieval, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4555-4564, (2016)
- [4] Mao J., Huang J., Toshev A., Et al., Generation and comp-rehension of unambiguous object descriptions, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 11-20, (2016)
- [5] Ren S., He K., Girshick R., Et al., Faster R-CNN: towards real-time object detection with region proposal networks, IEEE Transactions on Pattern Analysis & Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
- [6] Girshick R., Fast-rcnn, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1440-1448, (2015)
- [7] Li Y., He K., Sun J., R-fcn: Object detection via region-based fully convolutional networks, Proc of Conference on Advances in Neural Information Processing Systems, pp. 379-387, (2016)
- [8] Hochreiter S., Schmidhuber J., Long short-term meomory, Neural Computation, 9, 8, pp. 1735-1780, (1997)
- [9] Schroff F., Kalenichenko D., Philbin J., Facenet: a unfied embedding for face recognition, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815-823, (2015)
- [10] Kazemzadeh S., Ordonez V., Mattern M., Et al., ReferIt game: referring to object in photographs of natural scenes, Proc of Conference on Empirical. Methods in Natural Language Processing, pp. 787-798, (2014)