Optimization and Performance Analysis of Real-time Speech Translation Systems Based on Mobile Technology

被引：0

作者：

Yang, Ning ^{[1
]}

机构：

[1] School of Education, Xi’an Fanyi University, Xi’an

来源：

International Journal of Interactive Mobile Technologies | 2024年 / 18卷 / 23期

关键词：

connectionist temporal classification (CTC); criterion; mobile technology; model compression; parameter compression; performance optimization; real-time speech translation;

D O I：

10.3991/ijim.v18i23.52879

中图分类号：

学科分类号：

摘要：

As globalization deepens and mobile technology rapidly advances, the demand for cross-linguistic communication has been steadily increasing, making real-time speech translation systems a research focus. However, given the limited computational capacity and storage space of mobile devices, optimizing system performance while maintaining translation quality has become a critical challenge. Current optimization approaches for real-time speech translation systems primarily focus on improvements to model architectures and hardware acceleration, often neglecting a systematic study of model compression. This is particularly evident when handling real-time data, where achieving both high efficiency and translation accuracy remains difficult. To address these challenges, a model compression method based on the connectionist temporal classification (CTC) criterion was proposed, along with an in-depth study of parameter compression tailored for mobile applications. The research focuses on two key areas: first, model compression techniques based on the CTC criterion were explored to enhance the efficiency of real-time speech translation; second, parameter compression methods were investigated to significantly reduce resource consumption in mobile applications while preserving translation quality. The aim of this study is to improve the performance and user experience of real-time speech translation systems on mobile devices. © 2024, International Federation of Engineering Education Societies (IFEES). All rights reserved.

引用

页码：57 / 71

页数：14

共 21 条

[1]

Kabat M., Possible translation problems, their causes, and solutions in agile localization of software, International Journal of Interactive Mobile Technologies (iJIM), 17, 1, pp. 129-140, (2023)

[2]

Lei X., Real-time translation of English speech through speech feature extraction, Artificial Life and Robotics, 29, pp. 410-415, (2024)

[3]

Rababah L. M., Al-Khawaldeh N., Rababah M. A., Mobile-assisted listening instructions with Jordanian audio materials: A pathway to EFL proficiency, International Journal of Interactive Mobile Technologies (iJIM), 17, 21, pp. 129-144, (2023)

[4]

Ke X., English synchronous real-time translation method based on reinforcement learning, Wireless Networks, 30, pp. 4167-4179, (2024)

[5]

Kozhirbayev Z., Enhancing neural machine translation with fine-tuned mBART50 pre-trained model: An examination with low-resource translation pairs, Ingénierie des Systèmes d’Information, 29, 3, pp. 831-838, (2024)

[6]

Chen H. K., Cognitive perspectives on English learning methods: Efficiency and achievements under task-based instruction, Education Science and Management, 1, 2, pp. 86-100, (2023)

[7]

Novitasari S., Sakti S., Nakamura S., Neural incremental speech recognition toward real-time machine speech translation, IEICE Transactions on Information and Systems, E104.D, 12, pp. 2195-2208, (2021)

[8]

Tripathi V. R., Jha H. K., Popli M., Shah P., Desai G., Clinic, community, and in-between: The influence of space on real-time translation of medical expertise by frontline healthcare professionals in marginal tribal communities, Journal of Professions and Organization, 8, 3, pp. 273-294, (2021)

[9]

Kumar A., Pratap A., Singh A. K., Generative adversarial neural machine translation for phonetic languages via reinforcement learning, IEEE Transactions on Emerging Topics in Computational Intelligence, 7, 1, pp. 190-199, (2022)

[10]

Bilas A., Psycholinguistic aspect of phonetic and orthographic means of French colloquial speech in Ukrainian translation, Psycholinguistics, 27, 2, pp. 71-89, (2020)

← 1 2 3 →