共 25 条
[1]
YIN Qiyue, HUANG Yan, ZHANG Junge, Et al., Survey on deep learning based cross-modal retrieval, Journal of Image and Graphics, 26, 6, pp. 1368-1388, (2021)
[2]
LIU Ying, GUO Yingying, FANG Jie, Et al., Survey of research on deep learning image-text cross-modal retrieval, Journal of Frontiers of Computer Science and Technology, 16, 3, pp. 489-511, (2022)
[3]
HU Di, NIE Feiping, LI Xuelong, Deep binary reconstruction for cross-modal hashing [J], IEEE Transactions on Multimedia, 21, 4, pp. 973-985, (2019)
[4]
LI Huiqiong, WANG Yongxin, CHEN Zhenduo, Et al., Ranking-based supervised discrete cross-modal hashing, Chinese Journal of Computers, 44, 8, pp. 1620-1635, (2021)
[5]
WANG Liwei, LI Yin, HUANG Jing, Et al., Learning two-branch neural networks for image-text matching tasks [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 41, 2, pp. 394-407, (2019)
[6]
WANG Hongbin, ZHANG Zhiliang, LI Huafeng, Image-text cross-modal matching method based on stacked cross attention, Journal of Signal Processing, 38, 2, pp. 285-299, (2022)
[7]
CHEN Hui, DING Guiguang, LIN Zijia, Et al., Cross-modal image-text retrieval with semantic consistency, Proceedings of the 27th ACM International Conference on Multimedia, pp. 1749-1757, (2019)
[8]
REED S, AKATA Z, LEE H, Et al., Learning deep representations of fine-grained visual descriptions, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 49-58, (2016)
[9]
WANG Wei, ZHENG V W, YU Han, Et al., A survey of zero-shot learning: settings, methods, and applications [J], ACM Transactions on Intelligent Systems and Technology, 10, 2, (2019)
[10]
ZHANG Guimei, LONG Bangyao, ZENG Jiexian, Et al., Zero-shot attribute recognition based on deredundancy features and semantic relationship constraint, Pattern Recognition and Artificial Intelligence, 34, 9, pp. 809-823, (2021)