共 36 条
- [1] Barnard K(2005)Word sense disambiguation with pictures Artif Intell 167 13-30
- [2] Jordan MI(2003)Automatic linguistic indexing of pictures by a statistical modeling approach IEEE PAMI 25 1075-1088
- [3] Li J(2013)Mlrank: multi-correlation learning to rank for image annotation Pattern Recogn 46 2700-2710
- [4] Wang J(2007)Using large-scale web data to facilitate textual query based retrieval of consumer photos ACM MM 163 1277-1283
- [5] Li Z(2016)Optimized graph learning using partial tags and multiple features for image and video annotation IEEE Trans Image Process 25 4999-5011
- [6] Liu J(2018)Self-supervised video hashing with hierarchical binary auto-encoder IEEE Trans Image Process 27 3210-3221
- [7] Xu C(2017)Beyond frame-level cnn: saliency-aware 3-d cnn with lstm for video action recognition IEEE Signal Process Lett 24 510-514
- [8] Lu H(2018)Two-stream 3d convnet fusion for action recognition in videos with arbitrary size and length IEEE Trans Multimed 20 634-644
- [9] Liu Y(2018)Training visual-semantic embedding network for boosting automatic image annotation Neural Process Lett 3 1-17
- [10] Xu D(undefined)undefined undefined undefined undefined-undefined