共 64 条
[1]
Adavanne S., 2019, P 4 WORKSH DET CLASS, P10, DOI 10.33682/1xwd-5v76
[3]
Afouras T, 2018, Arxiv, DOI arXiv:1809.00496
[4]
Afouras T, 2018, INTERSPEECH, P3244
[5]
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
[J].
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18),
2018,
:292-301
[6]
Alcazar J. L, 2020, IEEECVF C COMPUTER V
[7]
End-to-End Active Speaker Detection
[J].
COMPUTER VISION, ECCV 2022, PT XXXVII,
2022, 13697
:126-143
[8]
[Anonymous], About us
[10]
Look, Listen and Learn
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:609-617