Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data

被引:13
|
作者
Crocetti, Emanuele [1 ]
Randi, Giorgia [1 ]
机构
[1] European Commiss, JRC, Directorate Hlth Consumers & Reference Mat F, Hlth Soc Unit, Ispra, Italy
关键词
cancer registry; incidence; data quality; Benford; methodology; FRAUD;
D O I
10.3389/fpubh.2016.00225
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Benfords law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benfords law, and if this can be used in their data quality check process. Methods: We sampled 43 population-based cancer registry populations (CRPs) from the Cancer Incidence in 5 Continents-volume X (CI5-X). The distribution of cancer incidence rate FSD was evaluated overall, by sex, and by CRP. Several statistics, including Pearsons coefficient of correlation and distance measures, were applied to check the adherence to the Benfords law. Results: In the whole dataset (146,590 incidence rates) and for each sex (70,722 male and 75,868 female incidence rates), the FSD distributions were Benford-like. The coefficient of correlation between observed and expected FSD distributions was extremely high (0.999), and the distance measures low. Considering single CRP (from 933 to 7,222 incidence rates), the results were in agreement with the Benfords law, and only a few CRPs showed possible discrepancies from it. Conclusion: This study demonstrated for the first time that cancer incidence rates follow Benfords law. This characteristic can be used as a new, simple, and objective tool in data quality evaluation. The analyzed data had been already checked for publication in CI5-X. Therefore, their quality was expected to be good. In fact, only for a few CRPs several statistics were consistent with possible violations.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Evaluation of data quality at the Gambia national cancer registry
    Shimakawa, Yusuke
    Bah, Ebrima
    Wild, Christopher P.
    Hall, Andrew J.
    INTERNATIONAL JOURNAL OF CANCER, 2013, 132 (03) : 658 - 665
  • [22] Evaluation of data quality at the Hungarian National Cancer Registry, 2000-2019
    Weber, Andras
    Mery, Les
    Nagy, Peter
    Polgar, Csaba
    Bray, Freddie
    Kenessey, Istvan
    CANCER EPIDEMIOLOGY, 2023, 82
  • [23] Data quality at the Singapore Cancer Registry: An overview of comparability, completeness, validity and timeliness
    Fung, Janice Wing Mei
    Lim, Sandra Bee Lay
    Zheng, Huili
    Ho, William Ying Tat
    Lee, Bee Guat
    Chow, Khuan Yew
    Lee, Hin Peng
    CANCER EPIDEMIOLOGY, 2016, 43 : 76 - 86
  • [24] Data quality at the Cancer Registry of Norway: An overview of comparability, completeness, validity and timeliness
    Larsen, Inger Kristin
    Smastuen, Milada
    Johannesen, Tom Borge
    Langmark, Froydis
    Parkin, Donald Maxwell
    Bray, Freddie
    Moller, Bjorn
    EUROPEAN JOURNAL OF CANCER, 2009, 45 (07) : 1218 - 1231
  • [25] Evaluation of thyroid cancer data completeness and quality at a population-based cancer registry, Algeria
    Boukheris, Houda
    Brakni, Lila
    Boubezari, Reda Fihri
    Bettayeb, Arslan
    Bouaidjra, Noureddine Bachir
    Houari, Amina Bensetti
    Brahim, Farouk Mohamed
    Simerabet, Azeddine
    Achour, Zineb
    Attar, Sara
    Saim, Hafida
    Berber, Necib
    BULLETIN DU CANCER, 2023, 110 (09) : 873 - 882
  • [26] Utilization of cancer registry data for monitoring quality of care
    Beatty, J. David
    Adachi, Mariko
    Bonham, Candy
    Atwood, Mary
    Potts, Mary S.
    Hafterson, Jennifer L.
    Aye, Ralph W.
    AMERICAN JOURNAL OF SURGERY, 2011, 201 (05) : 640 - 644
  • [27] Analyzing the Financial Statements of Companies listed on the National Stock Exchange using the Benford's Law
    Ranade, Madhura
    Gandhi, Mahak
    CARDIOMETRY, 2022, (24): : 940 - 947
  • [28] First data from a population based cancer registry in Ethiopia
    Timotewos, Genebo
    Solomon, Asmare
    Mathewos, Assefa
    Addissie, Adamu
    Bogale, Solomon
    Wondemagegnehu, Tigeneh
    Aynalem, Abraha
    Ayalnesh, Bekele
    Dagnechew, Hailemariam
    Bireda, Wondatir
    Kroeber, Eric Sven
    Mikolajczyk, Rafael
    Bray, Freddie
    Jemal, Ahmedin
    Kantelhardt, Eva Johanna
    CANCER EPIDEMIOLOGY, 2018, 53 : 93 - 98
  • [29] Is quality of registry treatment data related to registrar experience and workload? A study of Taiwan cancer registry data
    Cheng, Chin-Ying
    Chiang, Chun-Ju
    Hsieh, Cheng-Hsing
    Chang, You-Kang
    Lai, Mei-Shu
    JOURNAL OF THE FORMOSAN MEDICAL ASSOCIATION, 2018, 117 (12) : 1093 - 1100
  • [30] Data quality at the Bulgarian National Cancer Registry: An overview of comparability, completeness, validity and timeliness
    Dimitrova, Nadya
    Parkin, Donald Maxwell
    CANCER EPIDEMIOLOGY, 2015, 39 (03) : 405 - 413