Fraud detection in social income transfer programs: a social data mining approach applied to data from Brazil

被引:0
作者
Diego de Castro Rodrigues
Márcio Dias de Lima
Rommel M. Barbosa
机构
[1] Instituto Federal Tocantins, TO, Dianópolis
[2] Instituto Federal de Educação Ciência e Tecnologia de Goiás, GO, Goiânia
[3] Universidade Federal de Goiás, GO, Goiânia
来源
SN Social Sciences | / 2卷 / 9期
关键词
CadÚnico; Data mining; Extreme poor; Fraud detection; Social data;
D O I
10.1007/s43545-022-00479-5
中图分类号
学科分类号
摘要
Several assistance policies have been adopted by the Brazilian government to minimize social problems. As a basis for these actions, the government created the Unified Registry for Social Programs of the Federal Government (CadÚnico). The CadÚnico database, comprising more than 20 million records and 65 attributes, was constructed with the aim of storing information about every person at a social risk in Brazil. This study aims to identify possible fraudulent cases in Brazilian social policy claims involving cash transfer using a social data mining approach. The approach takes into account the experiences of social workers besides implementing traditional data mining techniques (e.g., decision trees, generalized linear models, BayesNets, support vector machines, etc.). Via the proposed method, we identified more than 25 thousand cases of possible fraud with a success rate of 98.69%. We utilized the knowledge of groups of specialists in urban, state, and national social policies, together with data mining techniques, for validation. Identification of such cases is expected to aid the formulation of an approach that can address social demand based on correct social data. © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
引用
收藏
相关论文
共 75 条
[1]  
Abdallah A., Maarof M.A., Zainal A., Fraud detection system: a survey, J Netw Comput Appl, 68, pp. 90-113, (2016)
[2]  
Agrawal R., Singh J., Ghosh S.M., Performance appraisal of an educational institute using data mining techniques, Computing in engineering and technology, pp. 733-745, (2020)
[3]  
Aguiar G.F.M., Batista B.L., Rodrigues J.L., Silva L.R.S., Campiglia A.D., Barbosa R.M., Barbosa F., Determination of trace elements in bovine semen samples by inductively coupled plasma mass spectrometry and data mining techniques for identification of bovine class, J Dairy Sci, 95, 12, pp. 7066-7073, (2012)
[4]  
Ahmed M., Mahmood A.N., Islam M., A survey of anomaly detection techniques in financial domain, Futur Gener Comput Syst, 55, pp. 278-288, (2016)
[5]  
Anderson R., Mansingh G., Data mining approach to decision support in social welfare, Int J Bus Intell Res, 5, 2, pp. 39-61, (2014)
[6]  
Androutsopoulou A., Karacapilidis N., Loukis E., Charalabidis Y., Transforming the communication between citizens and government through AI-guided chatbots, Gov Inf Q, 36, 2, pp. 358-367, (2019)
[7]  
Barrientos A., Debowicz D., Woolard I., Heterogeneity in Bolsa Família outcomes, Q Rev Econ Finance, 62, pp. 33-40, (2016)
[8]  
Bauder R., Khoshgoftaar T., A survey of medicare data processing and integration for fraud detection, IEEE Int Conf Inf Reuse Integr, 2018, pp. 9-14, (2018)
[9]  
Bauder R., Khoshgoftaar T.M., Seliya N., A survey on the state of healthcare upcoding fraud analysis and detection, Health Serv Outcomes Res Method, 17, 1, pp. 31-55, (2017)
[10]  
Bedran-Martins A.M., Lemos M.C., Politics of drought under Bolsa Família program in Northeast Brazil, World Dev Perspect, 7-8, pp. 15-21, (2017)