Exploring Cancer Incidence, Risk Factors, and Mortality in the Lleida Region: Interactive, Open-source R Shiny Application for Cancer Data Analysis

被引:0
作者
Florensa, Didac [1 ,2 ,3 ]
Mateo-Fornes, Jordi [1 ]
Sorribes, Sergi Lopez [1 ]
Tuca, Anna Torres [1 ]
Solsona, Francesc [1 ]
Godoy, Pere [2 ,3 ,4 ]
机构
[1] Univ Lleida, Dept Comp Engn, C Jaume II 69, Lleida 25002, Spain
[2] Santa Maria Univ Hosp, Populat Based Canc Registry, Lleida, Spain
[3] Lleida Biomed Res Inst, Field Epidemiol Unit, Lleida, Spain
[4] Hlth Inst Carlos III, CIBER Epidemiol & Publ Hlth CIBERESP, Madrid, Spain
来源
JMIR CANCER | 2023年 / 9卷 / 01期
关键词
R Shiny; cloud computing; microservices; Docker; decision support system; cancer incidence; cancer risk factors; cancer mortality; TRENDS; SMOKING;
D O I
10.2196/44695
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: The cancer incidence rate is essential to public health surveillance. The analysis of this information allows authorities to know the cancer situation in their regions, especially to determine cancer patterns, monitor cancer trends, and help prioritize the allocation of health resource.Objective: This study aimed to present the design and implementation of an R Shiny application to assist cancer registries conduct rapid descriptive and predictive analytics in a user-friendly, intuitive, portable, and scalable way. Moreover, we wanted to describe the design and implementation road map to inspire other population registries to exploit their data sets and develop similar tools and models.Methods: The first step was to consolidate the data into the population registry cancer database. These data were cross validated by ASEDAT software, checked later, and reviewed by experts. Next, we developed an online tool to visualize the data and generate reports to assist decision-making under the R Shiny framework. Currently, the application can generate descriptive analytics using population variables, such as age, sex, and cancer type; cancer incidence in region-level geographical heat maps; line plots to visualize temporal trends; and typical risk factor plots. The application also showed descriptive plots about cancer mortality in the Lleida region. This web platform was built as a microservices cloud platform. The web back end consists of an application programming interface and a database, which NodeJS and MongoDB have implemented. All these parts were encapsulated and deployed by Docker and Docker Compose.Results: The results provide a successful case study in which the tool was applied to the cancer registry of the Lleida region. The study illustrates how researchers and cancer registries can use the application to analyze cancer databases. Furthermore, the results highlight the analytics related to risk factors, second tumors, and cancer mortality. The application shows the incidence and evolution of each cancer during a specific period for gender, age groups, and cancer location, among other functionalities. The risk factors view permitted us to detect that approximately 60% of cancer patients were diagnosed with excess weight at diagnosis. Regarding mortality, the application showed that lung cancer registered the highest number of deaths for both genders. Breast cancer was the lethal cancer in women. Finally, a customization guide was included as a result of this implementation to deploy the architecture presented. Conclusions: This paper aimed to document a successful methodology for exploiting the data in population cancer registries and propose guidelines for other similar records to develop similar tools. We intend to inspire other entities to build an application that can help decision-making and make data more accessible and transparent for the community of users.
引用
收藏
页数:11
相关论文
共 44 条
  • [1] Andersen N., 2021, Psych, V3, P422, DOI DOI 10.3390/PSYCH3030030
  • [2] [Anonymous], R Project for Statistical Computing (Version 3.0.2)
  • [3] Evaluation of data quality in the cancer registry: Principles and methods. Part I: Comparability, validity and timeliness
    Bray, Freddie
    Parkin, D. Max
    [J]. EUROPEAN JOURNAL OF CANCER, 2009, 45 (05) : 747 - 755
  • [4] DataTables, US
  • [5] Web-TCGA: an online platform for integrated analysis of molecular cancer data sets
    Deng, Mario
    Braegelmann, Johannes
    Schultze, Joachim L.
    Perner, Sven
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [6] didacflorensa / CancerRegistryPlatform, DID CANCERREGISTRYPL
  • [7] Docker, US
  • [8] Survival Genie, a web platform for survival analysis across pediatric and adult cancers
    Dwivedi, Bhakti
    Mumme, Hope
    Satpathy, Sarthak
    Bhasin, Swati S.
    Bhasin, Manoj
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [9] Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods
    Ferlay, J.
    Colombet, M.
    Soerjomataram, I.
    Mathers, C.
    Parkin, D. M.
    Pineros, M.
    Znaor, A.
    Bray, F.
    [J]. INTERNATIONAL JOURNAL OF CANCER, 2019, 144 (08) : 1941 - 1953
  • [10] The Use of Multiple Correspondence Analysis to Explore Associations Between Categories of Qualitative Variables and Cancer Incidence
    Florensa, Didac
    Godoy, Pere
    Mateo, Jordi
    Solsona, Francesc
    Pedrol, Tere
    Mesas, Miquel
    Pinol, Ramon
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (09) : 3659 - 3667