共 50 条
[31]
Improving End-to-End Speech-to-Text Translation With Document-Level Context
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2025, 33
:2098-2109
[34]
Novel Defense Method against Audio Adversarial Example for Speech-to-Text Transcription Neural Networks
[J].
2019 IEEE 11TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (IWCIA 2019),
2019,
:115-120
[35]
A Friendly Speech User Interface based on Google Cloud Platform to Access a Tourism Semantic Website
[J].
2017 CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (CHILECON),
2017,
[37]
Multimodal Error Correction for Speech-to-Text in a Mobile Office Automated Vehicle: Results From a Remote Study
[J].
IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES,
2022,
:496-505
[39]
MLLP-VRAIN Spanish ASR Systems for the Albayzin-RTVE 2020 Speech-to-Text Challenge: Extension
[J].
APPLIED SCIENCES-BASEL,
2022, 12 (02)
[40]
GenEn-MNER: Enhancing Nested Chinese NER With Multimodal Fusion and Alignment via Speech-to-Text Generation
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2025, 33
:1628-1640