共 24 条
[11]
Huang X, 2014, P 16 INT C MULT INT, P514, DOI [DOI 10.1145/2663204.2666278, 10.1145/2663204.2666278]
[12]
DFEW: A Large-Scale Database for Recognizing Dynamic Facial Expressions in the Wild
[J].
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA,
2020,
:2881-2889
[13]
Recurrent Neural Networks for Emotion Recognition in Video
[J].
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION,
2015,
:467-474
[14]
Li H., 2022, ARXIV
[15]
Self-supervised Video Hashing via Bidirectional Transformers
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:13544-13553
[16]
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[17]
A Closer Look at Spatiotemporal Convolutions for Action Recognition
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6450-6459
[18]
Vaswani A, 2017, ADV NEUR IN, V30
[20]
Wang Linhuang, 2023, CHIN C PATT REC COMP, P371