TARexp: A Python']Python Framework for Technology-Assisted Review Experiments

被引:6
作者
Yang, Eugene [1 ]
Lewis, David D. [2 ]
机构
[1] Johns Hopkins Univ, HLTCOE, Baltimore, MD 21218 USA
[2] Redgrave Data, Chantilly, VA USA
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
关键词
reproducible experiments; technology-assisted review; eDiscovery; systematic review; opensource;
D O I
10.1145/3477495.3531663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Technology-assisted review (TAR) is an important industrial application of information retrieval (IR) and machine learning (ML). While a small TAR research community exists, the complexity of TAR software and workflows is a major barrier to entry. Drawing on past open source TAR efforts, as well as design patterns from the IR and ML open source software, we present an open source Python framework for conducting experiments on TAR algorithms. Key characteristics of this framework are declarative representations of workflows and experiment plans, the ability for components to play variable numbers of workflow roles, and state maintenance and restart capabilities. Users can draw on reference implementations of standard TAR algorithms while incorporating novel components to explore their research interests. The framework is available at https://github.com/eugene- yang/tarexp.
引用
收藏
页码:3256 / 3261
页数:6
相关论文
共 47 条
[1]   A System for Efficient High-Recall Retrieval [J].
Abualsaud, Mustafa ;
Ghelani, Nimesh ;
Zhang, Haotian ;
Smucker, Mark D. ;
Cormack, Gordon V. ;
Grossman, Maura R. .
ACM/SIGIR PROCEEDINGS 2018, 2018, :1317-1320
[2]  
Amiri Saba, 2019, 2019 15th International Conference on eScience (eScience). Proceedings, P650, DOI 10.1109/eScience.2019.00102
[3]  
[Anonymous], 2018, ARXIV180304818
[4]  
[Anonymous], 2006, TREC
[5]  
[Anonymous], P 2020 15 S PIEZ AC
[6]  
Baron J., 2016, Perspectives on Predictive Coding: And Other Advanced Search Methods for the Legal Practitioner
[7]  
Cartright Marc-Allen, 2012, SIGIR 2012 WORKSH OP, P25
[8]   The neighbourhood physical environment and active travel in older adults: a systematic review and meta-analysis [J].
Cerin, Ester ;
Nathan, Andrea ;
van Cauwenberg, Jelle ;
Barnett, David W. ;
Barnett, Anthony .
INTERNATIONAL JOURNAL OF BEHAVIORAL NUTRITION AND PHYSICAL ACTIVITY, 2017, 14
[9]  
Cormack G. V., 2010, TREC
[10]   Engineering Quality and Reliability in Technology-Assisted Review [J].
Cormack, Gordon V. ;
Grossman, Maura R. .
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, :75-84