共 50 条
[11]
An End-to-End Transformer Model for Crowd Localization
[J].
COMPUTER VISION - ECCV 2022, PT I,
2022, 13661
:38-54
[12]
SFTT: A Spatial-Frequency-Temporal-Based End-to-End Transformer for Heart Rate Estimation
[J].
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE,
2025,
[13]
Spatial-Temporal Transformer Network for Continuous Action Recognition in Industrial Assembly
[J].
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024,
2024, 14871
:114-130
[14]
An Investigation of Positional Encoding in Transformer-based End-to-end Speech Recognition
[J].
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP),
2021,
[16]
Semantic Mask for Transformer based End-to-End Speech Recognition
[J].
INTERSPEECH 2020,
2020,
:971-975
[17]
Transformer-based end-to-end scene text recognition
[J].
PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021),
2021,
:1691-1695
[18]
END-TO-END MULTI-SPEAKER SPEECH RECOGNITION WITH TRANSFORMER
[J].
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
2020,
:6134-6138
[19]
END-TO-END PART-LEVEL ACTION PARSING WITH TRANSFORMER
[J].
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME,
2023,
:756-761