The Legislative Influence Detector: Finding Text Reuse in State Legislation

被引:9
作者
Burgess, Matthew [1 ]
Giraudy, Eugenia [2 ]
Katz-Samuels, Julian [1 ]
Walsh, Joe [3 ]
Willis, Derek [4 ]
Haynes, Lauren [3 ]
Ghani, Rayid [3 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Univ Chicago, Chicago, IL 60637 USA
[4] ProPublica, New York, NY USA
来源
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2016年
关键词
Social Good; Government Transparency; Text Reuse; Machine Learning;
D O I
10.1145/2939672.2939697
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State legislatures introduce at least 45,000 bills each year. However, we lack a clear understanding of who is actually writing those bills. As legislators often lack the time and staff to draft each bill, they frequently copy text written by other states or interest groups. However, existing approaches to detect text reuse are slow, biased, and incomplete. Journalists or researchers who want to know where a particular bill originated must perform a largely manual search. Watchdog organizations even hire armies of volunteers to monitor legislation for matches. Given the time-consuming nature of the analysis, journalists and researchers tend to limit their analysis to a subset of topics (e.g. abortion or gun control) or a few interest groups. This paper presents the Legislative Influence Detector (LID). LID uses the Smith-Waterman local alignment algorithm to detect sequences of text that occur in model legislation and state bills. As it is computationally too expensive to run this algorithm on a large corpus of data, we use a search engine built using Elasticsearch to limit the number of comparisons. We show how LID has found 45,405 instances of bill-to-bill text reuse and 14,137 instances of model-legislation-to-bill text reuse. LID reduces the time it takes to manually find text reuse from days to seconds.
引用
收藏
页码:57 / 65
页数:9
相关论文
共 26 条
[1]   THE WELFARE-STATE AS TRANSNATIONAL EVENT - EVIDENCE FROM SEQUENCES OF POLICY ADOPTION [J].
ABBOTT, A ;
DEVINEY, S .
SOCIAL SCIENCE HISTORY, 1992, 16 (02) :245-274
[2]  
[Anonymous], FULL PART TIM LEG
[3]  
[Anonymous], 2015, F NEWS
[4]  
[Anonymous], 2015, N R LIFE 12 STATES H
[5]  
Bollman J., 2015, COMMUNICATION 0812
[6]  
Brandeis Louis Dembitz., 1932, New State Ice Co. v. Liebmann, P285
[7]  
Drummond Chris, 2009, Replicability is not reproducibility: nor is it good science
[8]  
Elsayed T., 2008, P 46 ANN M ASS COMPU, P265, DOI DOI 10.3115/1557690.1557767
[9]  
Enda J., 2014, AM SHIFTING STATEHOU
[10]  
Fording RichardC., 2003, Race and the politics of welfare reform, P298