Steering Large Language Models for Cross-lingual Information Retrieval

被引:1
作者
Guo, Ping [1 ,2 ]
Ren, Yubing [1 ,2 ]
Hu, Yue [1 ,2 ]
Cao, Yanan [1 ,2 ]
Li, Yunpeng [1 ,2 ]
Huang, Heyan [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci Haidian, Sch Cyber Secur, Beijing, Peoples R China
[3] Beijing Inst Technol, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024 | 2024年
基金
中国国家自然科学基金;
关键词
Cross-lingual Information Retrieval; Activation Steering; Large Language Models;
D O I
10.1145/3626772.3657819
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's digital age, accessing information across language barriers poses a significant challenge, with conventional search systems often struggling to interpret and retrieve multilingual content accurately. Addressing this issue, our study introduces a novel integration of applying Large Language Models (LLMs) as Crosslingual Readers in information retrieval systems, specifically targeting the complexities of cross-lingual information retrieval (CLIR). We present an innovative approach: Activation Steered Multilingual Retrieval (ASMR) that employs "steering activations"-a method to adjust and direct the LLM's focus-enhancing its ability to understand user queries and generate accurate, language-coherent responses. ASMR adeptly combines a Multilingual Dense Passage Retrieval (mDPR) system with an LLM, overcoming the limitations of traditional search engines in handling diverse linguistic inputs. This approach is particularly effective in managing the nuances and intricacies inherent in various languages. Rigorous testing on established benchmarks such as XOR-TyDi QA, and MKQA demonstrates that ASMR not only meets but surpasses existing standards in CLIR, achieving state-of-the-art performance. The results of our research hold significant implications for understanding the inherent features of how LLMs understand and generate natural languages, offering an attempt towards more inclusive, effective, and linguistically diverse information access on a global scale.
引用
收藏
页码:585 / 596
页数:12
相关论文
共 83 条
[1]  
Alain G., 2017, Understanding intermediate layers using linear classifier probes
[2]  
Albalak Alon, 2023, 17 C EUROPEAN CHAPTE, P1
[3]  
Asai A, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P547
[4]  
Asai Akari, 2022, P WORKSHOP MULTILING, P108
[5]  
Asai Akari, 2021, arXiv
[6]  
Askari A, 2023, Arxiv, DOI arXiv:2305.02320
[7]  
Bai Y., 2022, arXiv
[8]   Probing Classifiers: Promises, Shortcomings, and Advances [J].
Belinkov, Yonatan .
COMPUTATIONAL LINGUISTICS, 2022, 48 (01) :207-219
[9]  
Brown Davis, 2023, ICLR 2023 WORKSH MAT
[10]  
Burns C, 2024, Arxiv, DOI [arXiv:2212.03827, 10.48550/arXiv.2212.03827]