Image-Text Multimodal Translation Based on AIGC Human-Machine Interaction

被引:0
作者
Yang, Lixue [1 ]
机构
[1] Tianjin Univ Technol & Educ, Tianjin, Peoples R China
来源
2024 4TH INTERNATIONAL CONFERENCE ON HUMAN-MACHINE INTERACTION, ICHMI 2024 | 2024年
关键词
Machine translation; AIGC; Image-text multimodal; Human-machine interaction; Artificial intelligence;
D O I
10.1145/3678429.3678436
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multimodal process of machine translation is studied by focusing on the development of artificial intelligence in language processing and the transformation of vocabulary into numerical representations through vectorization. The neural networks such as CBOW and Skip-gram models are applied to analyze the word vectorization. It also explores the Transformer model with self-attention mechanism, emphasizing the importance of Layer Normalization for training stability and convergence speed. The emergence of ChatGPT as a state-of-the-art conversational AI model, highlights its role in assisting translators with language understanding and generation tasks. The application of generative artificial intelligence is discussed in translation practice, where human-machine interaction maximizes human intelligence while utilizing AI capabilities. DALL.E2 is capable of generating images from text, and the integration of image with translated text plays an important role in constructing the being of the intersemiotic translated work as they maintain the existential emotions effectively through the text-image multimodal interaction.
引用
收藏
页码:44 / 51
页数:8
相关论文
共 7 条
  • [1] Jakobson R., 1959, TRANSLATION, P232, DOI DOI 10.4159/HARVARD.9780674731615.C18
  • [2] Lamb Charles, 1885, Essays of Elia
  • [3] Mikolov T., 2013, ARXIV
  • [4] Parcalabescu Letitia, 2021, P 1 WORKSH MULT SEM, P1, DOI [10.48550/arXiv.2103.06304, DOI 10.48550/ARXIV.2103.06304]
  • [5] Ramesh A., 2022, arXiv
  • [6] Vaswani A, 2017, ADV NEUR IN, V30
  • [7] Progress in Machine Translation
    Wang, Haifeng
    Wu, Hua
    He, Zhongjun
    Huang, Liang
    Church, Kenneth Ward
    [J]. ENGINEERING, 2022, 18 : 143 - 153