共 21 条
- [1] Baevski A, 2020, ADV NEUR IN, V33
- [2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
- [3] de Kok I., 2009, P 2009 INT C MULTIMO, P91
- [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
- [6] Ekstedt E., 2020, ARXIV201010874
- [7] X3D: Expanding Architectures for Efficient Video Recognition [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 200 - 210
- [8] GRACCO VL, 1994, J NEUROSCI, V14, P6585
- [9] Hara K., 2018, LISTENER, V162, P364
- [10] Multimodal and Multitask Approach to Listener's Backchannel Prediction: Can Prediction of Turn-changing and Turn-management Willingness Improve Backchannel Modeling? [J]. PROCEEDINGS OF THE 21ST ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS (IVA), 2021, : 131 - 138