Levelling up quantitative legislative studies on Central-Eastern Europe: Introducing the ParlText CEE Database of Speeches, Bills, and Laws

被引:0
|
作者
Sebok, Miklos [1 ]
Molnar, Csaba [1 ]
Takacs, Anna [1 ]
机构
[1] HUN REN Ctr Social Sci, Budapest, Hungary
来源
INTERSECTIONS-EAST EUROPEAN JOURNAL OF SOCIETY AND POLITICS | 2024年 / 10卷 / 04期
关键词
Central-Eastern Europe; legislative studies; legislative database; parliamentary speeches; bills and laws; BIG DATA; TRANSPARENCY;
D O I
10.17356/ieejsp.v10i4.1327
中图分类号
R47 [护理学];
学科分类号
1011 ;
摘要
The availability of ready-made textual corpora for research is crucial for social scien- tists, especially in the current era of rapid advancements in natural language process- ing (NLP) and artificial intelligence (AI) methods. Despite various useful contributions that address issues of accessibility and standardisation when it comes to such corpora, in many cases, they have limitations related to scope, geographical coverage, and time frame. This concern is particularly significant in the context of political research on Central-Eastern Europe (CEE), for which such deployment-ready databases are few and far between. In this research note, we bridge part of this gap by making available a new database: ParlText CEE. The database, prepared under the auspices of the V-Shift Momentum project at the HUN-REN Centre for Social Sciences, covers almost 1.9 mil- lion text vectors and metadata for parliamentary speeches, bills, and laws for Czechia, Hungary, Poland, and Slovakia for the period from 1990-1991 to 2022-2024. The data- sets encompass relevant dates, texts, titles, and, in the case of the speech corpora, parliamentary agendas, speaker names, and parties. All data are also linked based on unique identifiers following the ParlLawSpeech standard. This paper introduces the specifics of the 1.0 release of ParlText CEE and contemplates its possible use cases.
引用
收藏
页码:106 / 125
页数:20
相关论文
共 1 条