共 62 条
[1]
Bi Y., Jiang H., Hu Y., Sun Y., Yin B., See and learn more: Dense caption-aware representation for visual question answering, IEEE Trans. Circuits Syst. Video Technol., 34, 2, pp. 1135-1146, (2024)
[2]
Tang J., Liu D., Jin X., Peng Y., Zhao Q., Ding Y., Kong W., Bafn: Bi-direction attention based fusion network for multimodal sentiment analysis, IEEE Trans. Circuits Syst. Video Technol., 33, 4, pp. 1966-1978, (2022)
[3]
Zhu W., Wang X., Li H., Multi-modal deep analysis for multimedia, IEEE Trans. Circuits Syst. Video Technol., 30, 10, pp. 3740-3764, (2019)
[4]
Wang X., Huang Q., Celikyilmaz A., Gao J., Shen D., Wang Y.-F., Wang W.Y., Zhang L., Reinforced cross-modal matching and self-supervised imitation learning for vision-language navigation,, pp. 6629-6638, (2019)
[5]
Pathak D., Mahmoudieh P., Luo G., Agrawal P., Chen D., Shentu Y., Shelhamer E., Malik J., Efros A.A., Darrell T., Zero-shot visual imitation,, pp. 2050-2053, (2018)
[6]
Chen J., Shen Y., Gao J., Liu J., Liu X., Language-based image editing with recurrent attentive models,, pp. 8721-8729, (2018)
[7]
Chen H., Li C., Wang G., Li X., Mamunur Rahaman M., Sun H., Hu W., Li Y., Liu W., Sun C., Ai S., Grzegorzek M., GasHis-transformer: A multi-scale visual transformer approach for gastric histopathological image detection, Pattern Recognit., 130, (2022)
[8]
Zhang J., Li C., Kosov S., Grzegorzek M., Shirahama K., Jiang T., Sun C., Li Z., Li H., LCU-net: A novel low-cost U-net for environmental microorganism image segmentation, Pattern Recognit., 115, (2021)
[9]
Yu X., Lu X., Domain adaptation of anchor-free object detection for urban traffic, Neurocomputing, 582, (2024)
[10]
Sun S., Mo B., Xu J., Li D., Zhao J., Han S., Multi-YOLOv8: An infrared moving small object detection model based on YOLOv8 for air vehicle, Neurocomputing, 588, (2024)