Image-Text Multimodal Translation Based on AIGC Human-Machine Interaction

被引：0

作者：

Yang, Lixue ^{[1
]}

机构：

[1] Tianjin Univ Technol & Educ, Tianjin, Peoples R China

来源：

2024 4TH INTERNATIONAL CONFERENCE ON HUMAN-MACHINE INTERACTION, ICHMI 2024 | 2024年

关键词：

Machine translation; AIGC; Image-text multimodal; Human-machine interaction; Artificial intelligence;

D O I：

10.1145/3678429.3678436

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The multimodal process of machine translation is studied by focusing on the development of artificial intelligence in language processing and the transformation of vocabulary into numerical representations through vectorization. The neural networks such as CBOW and Skip-gram models are applied to analyze the word vectorization. It also explores the Transformer model with self-attention mechanism, emphasizing the importance of Layer Normalization for training stability and convergence speed. The emergence of ChatGPT as a state-of-the-art conversational AI model, highlights its role in assisting translators with language understanding and generation tasks. The application of generative artificial intelligence is discussed in translation practice, where human-machine interaction maximizes human intelligence while utilizing AI capabilities. DALL.E2 is capable of generating images from text, and the integration of image with translated text plays an important role in constructing the being of the intersemiotic translated work as they maintain the existential emotions effectively through the text-image multimodal interaction.

引用

页码：44 / 51

页数：8

共 7 条

[1] Jakobson R., 1959, TRANSLATION, P232, DOI DOI 10.4159/HARVARD.9780674731615.C18
[2] Lamb Charles, 1885, Essays of Elia
[3] Mikolov T., 2013, ARXIV
[4] Parcalabescu Letitia, 2021, P 1 WORKSH MULT SEM, P1, DOI [10.48550/arXiv.2103.06304, DOI 10.48550/ARXIV.2103.06304]
[5] Ramesh A., 2022, arXiv
[6] Vaswani A, 2017, ADV NEUR IN, V30
[7] Progress in Machine Translation
Wang, Haifeng
Wu, Hua
He, Zhongjun
Huang, Liang
Church, Kenneth Ward
[J]. ENGINEERING, 2022, 18 : 143 - 153

← 1 →