Patapasco: A Python']Python Framework for Cross-Language Information Retrieval Experiments

被引:2
作者
Costello, Cash [1 ]
Yang, Eugene [1 ]
Lawrie, Dawn [1 ]
Mayfield, James [1 ]
机构
[1] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21211 USA
来源
ADVANCES IN INFORMATION RETRIEVAL, PT II | 2022年 / 13186卷
关键词
Cross-language information retrieval; CLIR; Experimental framework; Reproducible experiments;
D O I
10.1007/978-3-030-99739-7_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many language pairs, to be scalable to large document collections, and to support reproducible experiments driven by a configuration file. We include Patapsco results on standard CLIR collections using multiple settings.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 8 条
[1]  
Darwish K., 2003, P 26 ANN INT ACM SIG, P338
[2]  
Honnibal M., 2020, Zenodo, DOI [DOI 10.5281/ZENODO.1212303, DOI 10.5281/ZENODO.14494472]
[3]   Pyserini: A Python']Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations [J].
Lin, Jimmy ;
Ma, Xueguang ;
Lin, Sheng-Chieh ;
Yang, Jheng-Hong ;
Pradeep, Ronak ;
Nogueira, Rodrigo .
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :2356-2362
[4]   OpenNIR: A Complete Neural Ad-Hoc Ranking Pipeline [J].
MacAvaney, Sean .
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, :845-848
[5]  
Macdonald C., 2020, P ICTIR 2020
[6]   Cross-Language System Evaluation: The CLEF campaigns [J].
Peters, C ;
Braschler, M .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (12) :1067-1072
[7]  
Qi P, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, P101
[8]   Pytrec_eval: An Extremely Fast Python']Python Interface to trec_eval [J].
Van Gysel, Christophe ;
de Rijke, Maarten .
ACM/SIGIR PROCEEDINGS 2018, 2018, :873-876