共 60 条
[1]
Abu-El-Haija Sami., 2016, Youtube-8m: A large-scale video classification benchmark
[2]
Alayrac JB, 2020, ADV NEUR IN, V33
[3]
Alpert J., 1990, PSYCHOL MARKET, V7, P109, DOI DOI 10.1002/MAR.4220070204
[4]
Alpert J.I., 1989, ACR North American Advances
[5]
Andrew G., 2013, ICML
[6]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[9]
Chao J., 2011, P 10 INT SEMANTIC WE
[10]
Deep Cross-Modal Audio-Visual Generation
[J].
PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17),
2017,
:349-357