共 40 条
[1]
Abdelaziz AH, 2020, PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2020, P378, DOI 10.1145/3382507.3418840
[2]
Akbari H, 2021, Arxiv, DOI [arXiv:2104.11178, DOI 10.48550/ARXIV.2104.11178]
[3]
Alayrac J.B., 2020, Adv. Neural Inf. Process. Syst., P1
[5]
Look, Listen and Learn
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:609-617
[6]
Arevalo J., 2019, arXiv
[7]
Arnab A., 2021, arXiv
[8]
Ba J. L., 2016, arXiv, DOI 10.48550/arXiv:1607.06450
[9]
Bertasius G, 2021, Arxiv, DOI [arXiv:2102.05095, DOI 10.48550/ARXIV.2102.05095]
[10]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733