Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis

被引:0
作者
Ferreira, Patricia Pita [1 ,2 ,3 ]
Simoes, Diogo Godinho [1 ,4 ]
de Carvalho, Constanca Pinto [1 ,5 ]
Duarte, Francisco
Fernandes, Eugenia [1 ]
Carvalho, Pedro Casaca [1 ]
Loff, Jose Francisco [6 ]
Soares, Ana Paula [1 ]
Albuquerque, Maria Joao [1 ]
Pinto-Leite, Pedro [1 ]
Peralta-Santos, Andre [1 ,7 ,8 ]
机构
[1] Direcao Geral Saude, Direcao Serv Informacao & Anal, Alameda D Afonso Henriques 45, P-1049005 Lisbon, Portugal
[2] Adm Reg Saude Lisboa & Vale Tejo, Agrupamento Ctr Saude Oeste Norte, Unidade Saude Publ Ze Povinho, Caldas Da Rainha, Portugal
[3] Univ NOVA Lisboa, NOVA Natl Sch Publ Hlth, Lisbon, Portugal
[4] Adm Reg Saude Lisboa & Vale Tejo, Agrupamento Ctr Saude Almada Seixal, Unidade Saude Publ Higeia, Almada, Portugal
[5] Adm Reg Saude Alentejo, Unidade Saude Publ Litoral Alentejano, Unidade Local Saude Litoral Alentejano, Santiago Do Cacem, Portugal
[6] phiStat Stat Consulting, Lisbon, Portugal
[7] Univ NOVA Lisboa, Publ Hlth Res Ctr, NOVA Natl Sch Publ Hlth, Lisbon, Portugal
[8] Univ NOVA Lisboa, Comprehens Hlth Res Ctr, Lisbon, Portugal
来源
JMIR AI | 2023年 / 2卷
关键词
artificial intelligence; AI; mortality; deep neural networks; evaluation; machine learning; deep learning; mortality statistics; underlying cause of death; EXCESS MORTALITY; SYSTEM; MODELS;
D O I
10.2196/40965
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians' death certificates (DCs). Although AUTOCOD's performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality. Objective: This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality. Methods: We included all DCs between 2016 and 2019. AUTOCOD's performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F 1-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score>4 SDs, and extreme excess mortality as 2 consecutive days with a Z score>6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification. Results: We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (>= 0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II-"Neoplasms," chapter IX-"Diseases of the circulatory system," and chapter X-"Diseases of the respiratory system"), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (>= 0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality. Conclusions: Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD's performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.
引用
收藏
页数:19
相关论文
共 37 条
[1]  
[Anonymous], 2021, Passo 1: check-list de auto-avaliacao etica
[2]  
[Anonymous], 2022, Westgard QC03-21
[3]  
[Anonymous], Graphs and maps
[4]  
[Anonymous], 2012, Lei n. 15/2012. Diario da Republica
[5]  
[Anonymous], 2014, Impact of the Implementation of IRIS software for ICD-10 cause of death coding on mortality statistics, England and Wales
[6]  
Della Mea V., 2020, Inform Med Unlocked, V21, DOI [10.1016/j.imu.2020.100456, DOI 10.1016/J.IMU.2020.100456]
[7]  
Drost H-G., 2018, J Open Source Software, V3, P765, DOI [10.21105/joss.00765, DOI 10.21105/JOSS.00765]
[8]  
Duarte FR, 2017, Thesis
[9]   Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text [J].
Duarte, Francisco ;
Martins, Bruno ;
Pinto, Catia Sousa ;
Silva, Mario J. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 80 :64-77
[10]   Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment [J].
Falissard, Louis ;
Morgand, Claire ;
Ghosn, Walid ;
Imbaud, Claire ;
Bounebache, Karim ;
Rey, Gregoire .
JMIR MEDICAL INFORMATICS, 2022, 10 (04) :115-129