A Multi-modal Deep Learning Method for Classifying Chest Radiology Exams

被引：7

作者：

Nunes, Nelson ^{[1
]}

Martins, Bruno ^{[1
]}

da Silva, Nuno Andre ^{[2
]}

Leite, Francisca ^{[2
]}

Silva, Mario J. ^{[1
]}

机构：

[1] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal

[2] Luz Saude, Hosp Luz Learning Hlth, Lisbon, Portugal

来源：

PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2019, PT I | 2019年 / 11804卷

关键词：

Classification of radiology exams; Machine learning in medicine; Learning from multi-modal data; Deep learning;

D O I：

10.1007/978-3-030-30241-2_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-invasive medical imaging techniques, such as radiography or computed tomography, are extensively used in hospitals and clinics for the diagnosis of diverse injuries or diseases. However, the interpretation of these images, which often results in a free-text radiology report and/or a classification, requires specialized medical professionals, leading to high labor costs and waiting lists. Automatic inference of thoracic diseases from the results of chest radiography exams, e.g. for the purpose of indexing these documents, is still a challenging task, even if combining images with the free-text reports. Deep neural architectures can contribute to a more efficient indexing of radiology exams (e.g., associating the data to diagnostic codes), providing interpretable classification results that can guide the domain experts. This work proposes a novel multi-modal approach, combining a dual path convolutional neural network for processing images with a bidirectional recurrent neural network for processing text, enhanced with attention mechanisms and leveraging pre-trained clinical word embeddings. The experimental results show interesting patterns, e.g. validating the high performance of the individual components, and showing promising results for the multi-modal processing of radiology examination data, particularly when pre-training the components of the model with large pre-existing datasets (i.e., a 10% increase in terms of the average value for the areas under the receiver operating characteristic curves).

引用

页码：323 / 335

页数：13

共 33 条

[1] Alsentzer E, 2019, P 2 CLIN NAT LANG PR, P72, DOI [DOI 10.18653/V1/W19-1909, 10.18653]
[2] [Anonymous], 2016, ARXIV160205980
[3] Chen, 2017, ADV NEUR IN
[4] Chen Q., 2018, COLING
[5] Preparing a collection of radiology examinations for distribution and retrieval
Demner-Fushman, Dina
Kohli, Marc D.
Rosenman, Marc B.
Shooshan, Sonya E.
Rodriguez, Laritza
Antani, Sameer
Thoma, George R.
McDonald, Clement J.
[J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (02) : 304 - 310
[6] Devlin J., 2018, ARXIV
[7] Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text
Duarte, Francisco
Martins, Bruno
Pinto, Catia Sousa
Silva, Mario J.
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 80 : 64 - 77
[8] Eger S., 2018, P C EMP METH NAT LAN
[9] Goldberg Y., 2017, SYNTHESIS LECT HUMAN, V37
[10] Fleischner Society:: Glossary of terms tor thoracic imaging
Hansell, David M.
Bankier, Alexander A.
MacMahon, Heber
McLoud, Theresa C.
Mueller, Nestor L.
Remy, Jacques
[J]. RADIOLOGY, 2008, 246 (03) : 697 - 722

← 1 2 3 4 →