Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data

被引:13
|
作者
Crocetti, Emanuele [1 ]
Randi, Giorgia [1 ]
机构
[1] European Commiss, JRC, Directorate Hlth Consumers & Reference Mat F, Hlth Soc Unit, Ispra, Italy
关键词
cancer registry; incidence; data quality; Benford; methodology; FRAUD;
D O I
10.3389/fpubh.2016.00225
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Benfords law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benfords law, and if this can be used in their data quality check process. Methods: We sampled 43 population-based cancer registry populations (CRPs) from the Cancer Incidence in 5 Continents-volume X (CI5-X). The distribution of cancer incidence rate FSD was evaluated overall, by sex, and by CRP. Several statistics, including Pearsons coefficient of correlation and distance measures, were applied to check the adherence to the Benfords law. Results: In the whole dataset (146,590 incidence rates) and for each sex (70,722 male and 75,868 female incidence rates), the FSD distributions were Benford-like. The coefficient of correlation between observed and expected FSD distributions was extremely high (0.999), and the distance measures low. Considering single CRP (from 933 to 7,222 incidence rates), the results were in agreement with the Benfords law, and only a few CRPs showed possible discrepancies from it. Conclusion: This study demonstrated for the first time that cancer incidence rates follow Benfords law. This characteristic can be used as a new, simple, and objective tool in data quality evaluation. The analyzed data had been already checked for publication in CI5-X. Therefore, their quality was expected to be good. In fact, only for a few CRPs several statistics were consistent with possible violations.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Using Cognitive Interview Data to Assess Data Quality and the Cultural Norms of Survey Participants
    Biagas, David E., Jr.
    BMS-BULLETIN OF SOCIOLOGICAL METHODOLOGY-BULLETIN DE METHODOLOGIE SOCIOLOGIQUE, 2019, 144 (01): : 40 - 54
  • [42] An audit of cancer of unknown primary notifications: A cautionary tale for population health research using cancer registry data
    Vajdic, Claire M.
    Er, Chuang Ching
    Schaffer, Andrea
    Dobbins, Timothy
    Wyld, Lucy
    Meagher, Nicola S.
    Barrett, Jane
    Ward, Robyn L.
    Pearson, Sallie-Anne
    CANCER EPIDEMIOLOGY, 2014, 38 (04) : 460 - 464
  • [43] Data quality in population-based cancer registration: An assessment of the Merseyside and Cheshire Cancer Registry
    Seddon, DJ
    Williams, EMI
    BRITISH JOURNAL OF CANCER, 1997, 76 (05) : 667 - 674
  • [44] Claims Data Linked to Hospital Registry Data Enhance Evaluation of the Quality of Care of Breast Cancer
    Meguerditchian, Ari-Nareg
    Stewart, Andrew
    Roistacher, James
    Watroba, Nancy
    Cropp, Michael
    Edge, Stephen B.
    JOURNAL OF SURGICAL ONCOLOGY, 2010, 101 (07) : 593 - 599
  • [45] Demonstration of quality of care measurement using the Japanese liver cancer registry
    Higashi, Takahiro
    Hasegawa, Kiyoshi
    Kokudo, Norihiro
    Makuuchi, Masatoshi
    Izumi, Namiki
    Ichida, Takafumi
    Kudo, Masatoshi
    Ku, Yonson
    Sakamoto, Michiie
    Nakashima, Osamu
    Matsui, Osamu
    Matsuyama, Yutaka
    Sobue, Tomotaka
    HEPATOLOGY RESEARCH, 2011, 41 (12) : 1208 - 1215
  • [46] Incidence of prostate cancer in Sri Lanka using cancer registry data and comparisons with the incidence in South Asian men in England
    Ranasinghe, Weranja K. B.
    Sibanda, Thabani
    de Silva, M. V. C.
    Ranasinghe, Tamra I. J.
    Persad, Raj
    BJU INTERNATIONAL, 2011, 108 (8B) : E184 - E189
  • [47] Monitoring the data quality of data streams using a two-step control scheme
    Yu, Miaomiao
    Wu, Chunjie
    Tsung, Fugee
    IISE TRANSACTIONS, 2019, 51 (09) : 985 - 998
  • [48] Data Quality or Differences in Oncological Care? - Standards of Reporting for Cancer Survival Analyses Based on Registry Data A Proposal by the Association of Population-Based Cancer Registries in Germany
    Nennecke, A.
    Barnes, B.
    Brenner, H.
    Eberle, A.
    Emrich, K.
    Eisemann, N.
    Geiss, K.
    Hentschel, S.
    Holleczek, B.
    Kraywinkel, K.
    Stabenow, R.
    Hense, H. -W.
    GESUNDHEITSWESEN, 2013, 75 (02) : 94 - 98
  • [49] Cancer survival analysis for patients using population-based cancer registry data
    Ito, Yuri
    CANCER SCIENCE, 2018, 109 : 70 - 70
  • [50] Data quality and auditing within the Netherlands Heart Registration: using the PCI registry as an example
    S. Houterman
    A. van Dullemen
    M. Versteegh
    W. Aengevaeren
    P. Danse
    E. Brinkman
    D. Schuurman
    D. van Veghel
    Netherlands Heart Journal, 2023, 31 : 334 - 339