Prediction and Visualisation of SICONV Project Profiles Using Machine Learning

被引:2
作者
Andrade, Adriano de Oliveira [1 ]
Marques, Leonardo Garcia [2 ]
Resende, Osvaldo [3 ]
de Oliveira, Geraldo Andrade [3 ]
Souza, Leandro Rodrigues da Silva [3 ]
Pereira, Adriano Alves [1 ]
机构
[1] Univ Fed Uberlandia, Ctr Innovat & Technol Assessment Hlth, Postgrad Program Elect & Biomed Engn, BR-38408100 Uberlandia, Brazil
[2] Inst Fed Educ Ciencia & Tecnol, Campus Itumbiara, BR-75524245 Itumbiara, Brazil
[3] Inst Fed Goiano, Campus Rio Verde, BR-75901970 Rio Verde, Brazil
关键词
accountability; machine learning; t-SNE; PCA; BPMN; SICONV; MLR3; MANAGEMENT; SYSTEM; CLASSIFIERS; ALGORITHMS; MODEL; FRAUD;
D O I
10.3390/systems10060252
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Background: Inefficient use of public funds can have a negative impact on the lives of citizens. The development of machine learning-based technologies for data visualisation and prediction has opened the possibility of evaluating the accountability of publicly funded projects. Methods: This study describes the conception and evaluation of the architecture of a system that can be utilised for project profile definition and prediction. The system was used to analyse data from 20,942 System of Management of Agreements and Transfer Contracts (SICONV) projects in Brazil, which are government-funded projects. SICONV is a Brazilian Government initiative that records the entire life cycle of agreements, transfer contracts, and partnership terms, from proposal formalisation to final accountability. The projects were represented by seven variables, all of which were related to the timeline and budget of the project. Data statistics and clustering in a lower-dimensional space calculated using t-SNE were used to generate project profiles. Performance measures were used to test and compare several project-profile prediction models based on classifiers. Results: Data clustering was achieved, and ten project profiles were defined as a result. Among 25 prediction models, k-Nearest-Neighbor (kknn) was the one that yielded the highest accuracy (0.991 +/- 0.002). Conclusions: The system predicted SICONV project profiles accurately. This system can help auditors and citizens evaluate new and ongoing project profiles, identifying inappropriate public funding.
引用
收藏
页数:26
相关论文
共 81 条
[1]  
Abbasi A, 2012, MIS QUART, V36, P1293
[2]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[3]  
Al-Zubaidi E.A., 2022, Al-Salam J. Eng. Technol., V1, P9
[4]  
Alam F., 2017, Adv. Comput. Sci. Technol, V10, P1731
[5]   Utilizing somatic mutation data from numerous studies for cancer research: proof of concept and applications [J].
Amar, D. ;
Izraeli, S. ;
Shamir, R. .
ONCOGENE, 2017, 36 (24) :3375-3383
[6]   Detecting Anomalies in Financial Data Using Machine Learning Algorithms [J].
Bakumenko, Alexander ;
Elragal, Ahmed .
SYSTEMS, 2022, 10 (05)
[7]   An evolutionary technique based on K-Means algorithm for optimal clustering in RN [J].
Bandyopadhyay, S ;
Maulik, U .
INFORMATION SCIENCES, 2002, 146 (1-4) :221-237
[8]   Detecting Accounting Fraud in Publicly Traded US Firms Using a Machine Learning Approach [J].
Bao, Yang ;
Ke, Bin ;
Li, Bin ;
Yu, Y. Julia ;
Zhang, Jie .
JOURNAL OF ACCOUNTING RESEARCH, 2020, 58 (01) :199-235
[9]  
Barros F.H.G., 2010, REV TRIBUNAL CONTAS, V119, P65
[10]  
Behera S.S., 2012, INT J ADV ENG TECHNO, V179, P179