Cross-lingual distillation for domain knowledge transfer with sentence transformers

被引：0

作者：

Piperno, Ruben ^{[1
,2
]}

Bacco, Luca ^{[1
,3
]}

Dell'Orletta, Felice ^{[1
]}

Merone, Mario ^{[2
]}

Pecchia, Leandro ^{[2
,4
]}

机构：

[1] Inst Computat Linguist Antonio Zampolli, Natl Res Council, ItaliaNLP Lab, Via Giuseppe Moruzzi, 1, I-56124 Pisa, Italy

[2] Univ Campus Biomed Roma, Dept Engn, Res Unit Intelligent Technol Hlth & Wellbeing, Via Alvaro Portillo 21, I-00128 Rome, Italy

[3] Univ Campus Biomed Roma, Dept Engn, Res Unit Comp Syst & Bioinformat, Via Alvaro Portillo 21, I-00128 Rome, Italy

[4] Fdn Policlin Univ Campus Biomed Roma, Via Alvaro del Portillo 200, I-00128 Rome, Italy

来源：

KNOWLEDGE-BASED SYSTEMS | 2025年 / 311卷

关键词：

Cross-lingual learning; Knowledge distillation; Sentence transformers; Biomedical domain; Domain adaptation;

D O I：

10.1016/j.knosys.2025.113079

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advancements in Natural Language Processing (NLP) have substantially enhanced language understanding. However, non-English languages, especially in specialized and low-resource domains like biomedicine, remain largely underrepresented. Bridging this gap is essential for promoting inclusivity and expanding the global applicability of NLP technologies. This study presents a cross-lingual knowledge distillation framework that utilizes sentence transformers to improve domain-specific NLP capabilities in non-English languages. Specifically, the framework focuses on biomedical text classification tasks. By aligning sentence embeddings between a teacher model trained on English biomedical corpora and a multilingual student model, the proposed method effectively transfers both domain-specific and task-specific knowledge. This alignment allows the student model to efficiently process and adapt to biomedical texts in Spanish, French, and German, particularly in low-resource settings with limited tuning data. Extensive experiments with domain-adapted models like BioBERT and multilingual BERT with machine-translated text pairs demonstrate substantial performance improvements in downstream biomedical NLP tasks. The proposed framework proves highly effective in scenarios characterized by limited training data availability. The results highlight the scalability and effectiveness of this approach, facilitating the development of robust multilingual models tailored to the biomedical domain, thus advancing global accessibility and impact in biomedical NLP applications.

引用

页数：10

共 50 条

[1] Cross-Lingual Knowledge Distillation for Chinese Video Captioning
Hou J.-Y.
Qi Y.-Y.
Wu X.-X.
Jia Y.-D.
Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (09): : 1907 - 1921
[2] Knowledge Distillation Based Training of Universal ASR Source Models for Cross-lingual Transfer
Fukuda, Takashi
Thomas, Samuel
INTERSPEECH 2021, 2021, : 3450 - 3454
[3] Robust cross-lingual knowledge base question answering via knowledge distillation
Wang, Shaofei
Dang, Depeng
DATA TECHNOLOGIES AND APPLICATIONS, 2021, 55 (05) : 661 - 681
[4] Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition
Liang, Shining
Gong, Ming
Pei, Jian
Shou, Linjun
Zuo, Wanli
Zuo, Xianglin
Jiang, Daxin
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3231 - 3239
[5] Mongolian-Chinese Cross-lingual Topic Detection Based on Knowledge Distillation
Wang, Yanli
Ji, Yatu
Sun, Baolei
Ren, Qing-Dao-Er-Ji
Wu, Nier
Liu, Na
Lu, Min
Zhao, Chen
Jia, Yepai
2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 383 - 388
[6] Conversations Powered by Cross-Lingual Knowledge
Sun, Weiwei
Meng, Chuan
Meng, Qi
Ren, Zhaochun
Ren, Pengjie
Chen, Zhumin
de Rijke, Maarten
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1442 - 1451
[7] DiffSLU: Knowledge Distillation Based Diffusion Model for Cross-Lingual Spoken Language Understanding
Mao, Tianjun
Zhang, Chenghong
INTERSPEECH 2023, 2023, : 715 - 719
[8] Chinese-Vietnamese cross-lingual event retrieval method based on knowledge distillation
Gao S.
He Z.
Yu Z.
Zhu E.
Wu S.
Journal of Intelligent and Fuzzy Systems, 2024, 46 (04) : 8461 - 8475
[9] Distillation Language Adversarial Network for Cross-lingual Sentiment Analysis
Wang, Deheng
Yang, Aimin
Zhou, Yongmei
Xie, Fenfang
Ouyang, Zhouhao
Peng, Sancheng
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 45 - 50
[10] Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation
Yang, Xiwei
Yun, Jing
Zheng, Bofei
Liu, Limin
Ban, Qi
ELECTRONICS, 2023, 12 (24)

← 1 2 3 4 5 →