共 50 条
[41]
Transformer Self-Attention Change Detection Network with Frozen Parameters
[J].
APPLIED SCIENCES-BASEL,
2025, 15 (06)
[43]
MVSTER: Epipolar Transformer for Efficient Multi-view Stereo
[J].
COMPUTER VISION, ECCV 2022, PT XXXI,
2022, 13691
:573-591
[44]
MULTI-VIEW SPEAKER EMBEDDING LEARNING FOR ENHANCED STABILITY AND DISCRIMINABILITY
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024),
2024,
:10081-10085
[46]
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
[J].
INTERSPEECH 2020,
2020,
:2132-2136
[47]
BiTMulV: Bidirectional-Decoding Based Transformer with Multi-view Visual Representation
[J].
PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022,
2022, 13534
:735-748
[50]
ON THE USEFULNESS OF SELF-ATTENTION FOR AUTOMATIC SPEECH RECOGNITION WITH TRANSFORMERS
[J].
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT),
2021,
:89-96