Machine Learning Applied to Open Government Data for the Detection of Improprieties in the Application of Public Resources

被引:0
|
作者
Vaqueiro, Ramon Dantas [1 ]
Vargas, Ana Caroline G. [1 ]
Escovedo, Tatiana [1 ]
Kalinowski, Marcos [1 ]
机构
[1] Pontificia Univ Catolica Rio de Janeiro PUC Rio, Rio De Janeiro, RJ, Brazil
来源
PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS | 2023年
关键词
Public Purchases; Text Mining; Machine Learning;
D O I
10.1145/3592813.3592908
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Making government data publicly available is an important mechanism of transparency and social control. In this regard, inumerous laws have made it mandatory to divulgate government procurement data. Problem: The large volume of unstructured textual information available on government portals is an obstacle to effective social control. Making it difficult to do more in-depth analyzes of public spending. Solution: Use of Machine Learning algorithms to perform text mining and grouping items acquired by public administration. Labeling public purchases and grouping similar items, in order to facilitate the detection of improprieties in government purchases. IS Theory: This work is associated with the Theory of Computational Learning, which aims to understand the fundamental principles of learning and design better-automated methods. Method: The article is a case study, and its evaluation was executed with the support of specialists in the field. The results were analyzed based on a quantitative approach. Summary of Results: The results observed in the evaluated cases were promising, the resulting clusters from the application of the solution had sufficiently coherent semantic values, in order to allow more complex analyzes of government purchases. Contributions and Impact in the IS area: The results show that applying text mining and machine learning techniques can extract useful information from government purchases data and allowing to perform better analyzes of public spending.
引用
收藏
页码:213 / 220
页数:8
相关论文
共 50 条
  • [31] Application of machine learning methods to palaeoecological data
    Jeraj, M
    Dzeroski, S
    Todorovski, L
    Debeljak, M
    ECOLOGICAL MODELLING, 2006, 191 (01) : 159 - 169
  • [32] The Application of Machine Learning Algorithms in Data Mining
    Zhang, Wei
    2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 521 - 527
  • [33] The Application of the Machine Learning Method in Electromyographic Data
    Liu, Tao
    Li, Zechen
    Tang, Yuqi
    Yang, Dongdong
    Jin, Shuoguo
    Guan, Junwen
    IEEE ACCESS, 2020, 8 : 9196 - 9208
  • [34] Prediction of state anxiety by machine learning applied to photoplethysmography data
    Perpetuini, David
    Chiarelli, Antonio Maria
    Cardone, Daniela
    Filippini, Chiara
    Rinella, Sergio
    Massimino, Simona
    Bianco, Francesco
    Bucciarelli, Valentina
    Vinciguerra, Vincenzo
    Fallica, Piero
    Perciavalle, Vincenzo
    Gallina, Sabina
    Conoci, Sabrina
    Merla, Arcangelo
    PEERJ, 2021, 9
  • [35] Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers
    Xie, Wenxiu
    Ji, Meng
    Huang, Riliu
    Hao, Tianyong
    Chow, Chi-Yin
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (16)
  • [36] Smart equipment failure detection with machine learning applied to thermography inspection data in modern power systems
    Maria Garzon, Ana
    Laiton, Natalia
    Sicacha, Victor
    Celeita, David F.
    Trung Dung Le
    2023 11TH INTERNATIONAL CONFERENCE ON SMART GRID, ICSMARTGRID, 2023,
  • [37] Liberation of public data: Exploring central themes in open government data and freedom of information research
    Afful-Dadzie, Eric
    Afful-Dadzie, Anthony
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2017, 37 (06) : 664 - 672
  • [38] Detection of Suspected Collusion Cases in Public Biddings: an Application of the a Priori Machine Learning Algorithm for the State of Paraiba
    Brito Ramalho, Hilton Martins
    Cavalcanti de Almeida, Alessio Tony
    Fraga, Alcimar Alves
    TEORIA E PRATICA EM ADMINISTRACAO-TPA, 2020, 10 (02): : 5 - 22
  • [39] Application of machine learning in the development of databases of recommendation resources and music services
    Tcyguleva, Kseniya, V
    Stepanova, Anna E.
    Silnov, Dmitry S.
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [40] Collusion detection in public procurement auctions with machine learning algorithms
    Garcia Rodriguez, Manuel J.
    Rodriguez-Montequin, Vicente
    Ballesteros-Perez, Pablo
    Love, Peter E. D.
    Signor, Regis
    AUTOMATION IN CONSTRUCTION, 2022, 133