Classification of Brazilian Supreme Federal Court Documents: A Comparative Study

被引:0
作者
Martins, Victor [1 ]
Silva, Cleison [2 ]
机构
[1] PPCA UFPA, Tucuri, Brazil
[2] PPCA UFPA, Turucui, Brazil
来源
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024 | 2024年
关键词
Machine Learning; Natural Language Processing; Document Classification; Law;
D O I
10.1109/ICNLP60986.2024.10692383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper evaluates the application of Artificial Intelligence algorithms for the legal document classification. Algorithms are divided into linear classifiers SVM and Logistic Regression, ULMFiT Language Model and HAN. The studied dataset is called VICTOR, composed of documents from the Brazilian Supreme Federal Court (STF). The article concludes that all machine learning algorithms tested can be applied to classify legal documents from the employed dataset. Additionally, despite being less complex, the TF-IDF together linear classifiers outperform the experimented Language Model and HAN in terms of F1-score.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 21 条
[1]  
Brown TB, 2020, Arxiv, DOI [arXiv:2005.14165, DOI 10.48550/ARXIV.2005.14165]
[2]  
Capanema Silva Adriano, 2020, Intelligent Systems. 9th Brazilian Conference, BRACIS 2020. Proceedings. Lecture Notes in Artificial Intelligence. Subseries of Lecture Notes in Computer Science (LNAI 12319), P606, DOI 10.1007/978-3-030-61377-8_43
[3]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[4]  
Cho K., 2014, P SSST 8 8 WORKSH SY, DOI DOI 10.3115/V1/W14-4012
[5]   Using deep learning to predict outcomes of legal appeals better than human experts: A study with data from Brazilian federal courts [J].
de Menezes-Neto, Elias Jacob ;
Miranda Clementino, Marco Bruno .
PLOS ONE, 2022, 17 (07)
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Hartmann Nathan, 2017, P 11 BRAZ S INF HUM, P122
[8]  
Luz de Araujo P. H., Computational Processing of the Portuguese Language, V2020, P76
[9]  
Luz de Araujo P. H., P 12 LANGUAGE RESOUR, P1449
[10]  
Mikolov T, 2013, INT C LEARN REPR