Multilingual mixture attention interaction framework with adversarial training for cross-lingual SLU

被引：0

作者：

Zhang, Qichen ^{[1
]}

Wang, Shuai ^{[1
]}

Li, Jingmei ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Comp Sci & Technol, 145 Nantong St, Harbin 150001, Heilongjiang, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 04期

关键词：

Multilingual spoken language understanding; Cross-lingual transfer; Intent detection; Slot filling;

D O I：

10.1007/s00521-023-09132-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-lingual spoken language understanding (cross-lingual SLU), as a key component of task-oriented dialogue systems, is widely used in various industrial and real-world scenarios, such as multilingual customer support systems, cross-border communication platforms, and international language learning tools. However, obtaining large-scale and high-quality datasets for SLU is challenging due to the high cost of dialogue collection and manual annotation, particularly for minority languages. As a result, there is increasing interest in leveraging high-resource language data for cross-lingual transfer learning. Existing approaches for zero-shot cross-lingual SLU primarily focus on the relationship between the source language sentence and the single generated cross-lingual sentence, disregarding the shared information among multiple languages. This limitation weakens the robustness of multilingual word embedding representations and hampers the scalability of the model. In this paper, we propose the multilingual mixture attention interaction framework with adversarial training to alleviate the above problems. Specifically, we leverage the source language sentence to generate multiple multilingual hybrid sentences, in which words can adaptively capture unambiguous representations from the aligned multilingual words during the encoding phase, and adversarial training is introduced to enhance the scalability of the model. Then, we incorporate the symmetric kernel self-attention module with positional embedding to learn contextual information within a sentence, and employ the multi-relation graph convolutional networks to learn different granularity information between two highly correlated intent detection and slot filling tasks. Experimental results on the public dataset MultiATIS++ demonstrate that our proposed model achieves state-of-the-art performance, and comprehensive analysis validates the effectiveness of each component.

引用

页码：1915 / 1930

页数：16

共 4 条

[1] Multilingual mixture attention interaction framework with adversarial training for cross-lingual SLU
Qichen Zhang
Shuai Wang
Jingmei Li
Neural Computing and Applications, 2024, 36 : 1915 - 1930
[2] Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model
Lee, Hoyeon
Yoon, Hyun-Wook
Kim, Jong-Hwan
Kim, Jae-Min
INTERSPEECH 2023, 2023, : 611 - 615
[3] Intent detection and slot filling for Persian: Cross-lingual training for low-resource languages
Zadkamali, Reza
Momtazi, Saeedeh
Zeinali, Hossein
NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 559 - 574
[4] TCS: A Teacher-Curriculum-Student Learning Framework for Cross-Lingual Text Labeling
Pu T.
Huang S.-J.
Zhang Y.-M.
Zhou X.-S.
Tu Y.-F.
Dai X.-Y.
Chen J.-J.
Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1983 - 1996

← 1 →