共 39 条
[1]
Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution
[J].
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION,
2016,
:279-283
[2]
Cha J, 2021, ADV NEUR IN
[3]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[4]
Darwin C., 1872, P374
[5]
Dosovitskiy Alexey, 2021, INT C LEARN REPR
[6]
Facial Expression Recognition in the Wild via Deep Attentive Center Loss
[J].
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021,
2021,
:2401-2410
[7]
Feng X., 2005, Pattern Recognition and Image Analysis, V15, P546
[8]
Masked Autoencoders Are Scalable Vision Learners
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:15979-15988
[9]
Rethinking Spatial Dimensions of Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:11916-11925
[10]
Heredia J., 2022, 18 INT C INT ENV IE2, P46