Cross-lingual learning for text processing: A survey

被引:33
|
作者
Pikuliak, Matus [1 ]
Simko, Marian [1 ]
Bielikova, Maria [1 ]
机构
[1] Slovak Univ Technol Bratislava, Fac Informat & Informat Technol, Ilkovicova 2, Bratislava 84216, Slovakia
关键词
Cross-lingual learning; Multilingual learning; Transfer learning; Deep learning; Machine learning; Text processing; Natural language processing;
D O I
10.1016/j.eswa.2020.113765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many intelligent systems in business, government or academy process natural language as an input during inference or they might even communicate with users in natural language. The natural language processing is currently often done with machine learning models. However, machine learning needs training data and such data are often scarce for low-resource languages. The lack of data and resulting poor performance of natural language processing can be solved with cross-lingual learning. Cross-lingual learning is a paradigm for transferring knowledge from one natural language to another. The transfer of knowledge can help us overcome the lack of data in the target languages and create intelligent systems and machine learning models for languages, where it was not possible previously. Despite its increasing popularity and potential, no comprehensive survey on cross-lingual learning was conducted so far. We survey 173 text processing cross-lingual learning papers and examine tasks, data sets and languages that were used. The most important contribution of our work is that we identify and analyze four types of cross-lingual transfer based on "what" is being transferred. Such insight might help other NLP researchers and practitioners to understand how to use cross-lingual learning for wide range of problems. In addition, we identify what we consider to be the most important research directions that might help the community to focus their future work in cross-lingual learning. We present a comprehensive table of all the surveyed papers with various data related to the cross-lingual learning techniques they use. The table can be used to find relevant papers and compare the approaches to cross-lingual learning. To the best of our knowledge, no survey of cross-lingual text processing techniques was done in this scope before. (C) 2020 Published by Elsevier Ltd.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] Cross-Lingual Transfert Learning for Speech Emotion Recognition
    Baklouti, Imen
    Ben Ahmed, Olfa
    Baklouti, Raoudha
    Fernandez, Christine
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024, 2024, : 559 - 563
  • [12] CROSS-LINGUAL TRANSFER LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
    Quynh Ngoc Thi Do
    Gaspers, Judith
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5956 - 5960
  • [13] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
  • [14] Cross-lingual Text Classification via Model Translation with Limited Dictionaries
    Xu, Ruochen
    Yang, Yiming
    Liu, Hanxiao
    Hsi, Andrew
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 95 - 104
  • [15] Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and Its Application to Cross-Lingual Text Classification
    Esuli, Andrea
    Moreo, Alejandro
    Sebastiani, Fabrizio
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2019, 37 (03)
  • [16] Reliability of electric vehicle charging infrastructure: A cross-lingual deep learning approach
    Liu, Yifan
    Francis, Azell
    Hollauer, Catharina
    Lawson, M. Cade
    Shaikh, Omar
    Cotsman, Ashley
    Bhardwaj, Khushi
    Banboukian, Aline
    Li, Mimi
    Webb, Anne
    Asensio, Omar Isaac
    COMMUNICATIONS IN TRANSPORTATION RESEARCH, 2023, 3
  • [17] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
    Gjoreski, Kristijan
    Gjoreski, Aleksandar
    Kraljevski, Ivan
    Hirschfeld, Diane
    INTERSPEECH 2019, 2019, : 1916 - 1920
  • [18] A method for generating rules for cross-lingual transliteration
    V. K. Logacheva
    Automatic Documentation and Mathematical Linguistics, 2011, 45 (5) : 239 - 248
  • [19] Cross-lingual timeline summarization
    Cagliero, Luca
    La Quatra, Moreno
    Garza, Paolo
    Baralis, Elena
    2021 IEEE FOURTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2021), 2021, : 45 - 53
  • [20] A Machine Learning Approach to Multilingual and Cross-Lingual Ontology Matching
    Spohr, Dennis
    Hollink, Laura
    Cimiano, Philipp
    SEMANTIC WEB - ISWC 2011, PT I, 2011, 7031 : 665 - +