Cross-Lingual Phrase Retrieval

被引:0
作者
Zheng, Heqi [1 ,2 ]
Zhang, Xiao [1 ]
Chi, Zewen [1 ]
Huang, Heyan [1 ,2 ]
Yan, Tan [1 ]
Lan, Tian [1 ]
Wei, Wei [3 ]
Mao, Xian-Ling [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Beijing Engn Res Ctr High Volume Language Informa, Beijing, Peoples R China
[3] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
来源
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual retrieval aims to retrieve relevant text across languages. Current methods typically achieve cross-lingual retrieval by learning language-agnostic text representations in word or sentence level. However, how to learn phrase representations for cross-lingual phrase retrieval is still an open problem. In this paper, we propose XPR, a cross-lingual phrase retriever that extracts phrase representations from unlabeled example sentences. Moreover, we create a large-scale cross-lingual phrase retrieval dataset, which contains 65K bilingual phrase pairs and 4.2M example sentences in 8 English-centric language pairs. Experimental results show that XPR outperforms state-of-the-art baselines which utilize word-level or sentence-level representations. XPR also shows impressive zero-shot transferability that enables the model to perform retrieval in an unseen language pair during training. Our dataset, code, and trained models are publicly available at github.com/cwszz/XPR/.
引用
收藏
页码:4193 / 4204
页数:12
相关论文
共 50 条
  • [31] Cross-Lingual Product Retrieval in E-Commerce Search
    Zhu, Wenya
    Lv, Xiaoyu
    Yang, Baosong
    Zhang, Yinghua
    Yong, Xu
    Xu, Linlong
    Feng, Yinfu
    Zhang, Haibo
    Da, Qing
    Zeng, Anxiang
    Chen, Ronghua
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 458 - 471
  • [32] Improved Cross-Lingual Question Retrieval for Community Question Answering
    Ruckle, Andreas
    Swarnkar, Krishnkant
    Gurevych, Iryna
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3179 - 3186
  • [33] MuSeCLIR: A Multiple Senses and Cross-lingual Information Retrieval dataset
    Li, Wing Yan
    Weeds, Julie
    Weir, David
    Proceedings - International Conference on Computational Linguistics, COLING, 2022, 29 (01): : 1128 - 1135
  • [34] English-Malayalam Cross-Lingual Information Retrieval - an experience
    Nikesh, P. L.
    Sumam, Mary Idicula
    David, Peter S.
    2008 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2008, : 271 - 275
  • [35] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [36] Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
    Wang, Yabing
    Dong, Jianfeng
    Liang, Tianxiang
    Zhang, Minsong
    Cai, Rui
    Wang, Xun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [37] Pivot-based Candidate Retrieval for Cross-lingual Entity Linking
    Liu, Qian
    Geng, Xiubo
    Lu, Jie
    Jiang, Daxin
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1076 - 1085
  • [38] CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task
    Sun, Shuo
    Sia, Suzanna
    Duh, Kevin
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 134 - 141
  • [39] Expressive Machine Dubbing Through Phrase-level Cross-lingual Prosody Transfer
    Swiatkowski, Jakub
    Wang, Duo
    Babianski, Mikolaj
    Coccia, Giuseppe
    Tobing, Patrick Lumban
    Vipperla, Ravichander
    Klimkov, Viacheslav
    Pollet, Vincent
    INTERSPEECH 2023, 2023, : 5546 - 5550
  • [40] Mind the Gap: Cross-Lingual Information Retrieval with Hierarchical Knowledge Enhancement
    Zhang, Fuwei
    Zhang, Zhao
    Ao, Xiang
    Gao, Dehong
    Zhuang, Fuzhen
    Wei, Yi
    He, Qing
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4345 - 4353