共 139 条
- [1] Hudson DA, 2019, Arxiv, DOI [arXiv:1902.09506, DOI 10.48550/ARXIV.1902.09506]
- [3] Classifying Imbalanced Multi-modal Sensor Data for Human Activity Recognition in a Smart Home using Deep Learning [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [4] [Anonymous], 2014, Advances in neural information processing systems
- [5] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [6] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
- [7] Ben Abacha A., 2019, CLEF2019 WORKING NOT, P1
- [8] LaTr: Layout-Aware Transformer for Scene-Text VQA [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16527 - 16537
- [9] Blum H, 2018, IEEE INT C INT ROBOT, P3670, DOI 10.1109/IROS.2018.8593786