共 35 条
- [1] Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 904 - 915
- [2] S2R-DepthNet: Learning a Generalizable Depth-specific Structural Representation [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3033 - 3042
- [5] Doddington G, 2004, P 4 INT C LANG RES E
- [6] Dosovitskiy A., 2020, PREPRINT
- [7] An Empirical Study of Training End-to-End Vision-and-Language Transformers [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18145 - 18155
- [8] IMGpedia: A Linked Dataset with Content-Based Analysis of Wikimedia Images [J]. SEMANTIC WEB - ISWC 2017, PT II, 2017, 10588 : 84 - 93
- [9] Ferrada Sebastian, 2017, P ISWC 2017 DEM IND, V1963
- [10] Li Lei, 2022, ABS221107504 CORR, DOI [10.48550/arXiv.2211.07504, DOI 10.48550/ARXIV.2211.07504]