共 74 条
[11]
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:18009-18019
[12]
Chen L.-W., 2023, IEEE INT C AC SPEECH, P1
[13]
Chen PH, 2021, AAAI CONF ARTIF INTE, V35, P1045
[15]
Chen T, 2020, PR MACH LEARN RES, V119
[16]
Chen W., 2023, IEEE INT C AC SPEECH, P1
[17]
SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech
[J].
INTERSPEECH 2022,
2022,
:346-350
[19]
KEY-SPARSE TRANSFORMER FOR MULTIMODAL SPEECH EMOTION RECOGNITION
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:6897-6901
[20]
Chetia Phukan O., 2023, INTERSPEECH, P1903