Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports

被引：49

作者：

Bressem, Keno K. ^{[1
,2
,3
,4
,5
]}

Adams, Lisa C. ^{[1
,2
,3
,4
,5
]}

Gaudin, Robert A. ^{[6
]}

Troeltzsch, Daniel ^{[6
]}

Hamm, Bernd ^{[1
]}

Makowski, Marcus R. ^{[7
]}

Schuele, Chan-Yong ^{[1
]}

Vahldiek, Janis L. ^{[1
]}

Niehues, Stefan M. ^{[1
]}

机构：

[1] Charite, Dept Radiol, D-12203 Berlin, Germany

[2] Charite Univ Med Berlin, D-10117 Berlin, Germany

[3] Free Univ Berlin, D-10117 Berlin, Germany

[4] Humboldt Univ, D-10117 Berlin, Germany

[5] Berlin Inst Hlth, D-10117 Berlin, Germany

[6] Charite, Dept Oral & Maxillofacial Surg, D-12203 Berlin, Germany

[7] Tech Univ Munich, Sch Med, Dept Diagnost & Intervent Radiol, D-81675 Munich, Germany

来源：

BIOINFORMATICS | 2020年 / 36卷 / 21期

关键词：

RADIOLOGY;

D O I：

10.1093/bioinformatics/btaa668

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation: The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, advanced NLP methods are needed to enable accurate text classification. While neural networks can be used for this purpose, they must first be trained on large amounts of manually labelled data to achieve good results. In contrast, BERT models can be pre-trained on unlabelled data and then only require fine tuning on a small amount of manually labelled data to achieve even better results. Results: Using BERT to identify the most important findings in intensive care chest radiograph reports, we achieve areas under the receiver operation characteristics curve of 0.98 for congestion, 0.97 for effusion, 0.97 for consolidation and 0.99 for pneumothorax, surpassing the accuracy of previous approaches with comparatively little annotation effort. Our approach could therefore help to improve information extraction from free-text medical reports.

引用

页码：5255 / 5261

页数：7

共 27 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615

[3] PadChest: A large chest x-ray image dataset with multi-label annotated reports [J].

Bustos, Aurelia ;

Pertusa, Antonio ;

Salinas, Jose-Maria ;

de la Iglesia-Vaya, Maria .

MEDICAL IMAGE ANALYSIS, 2020, 66

[4] Natural Language Processing Technologies in Radiology Research and Clinical Applications [J].

Cai, Tianrun ;

Giannopoulos, Andreas A. ;

Yu, Sheng ;

Kelil, Tatiana ;

Ripley, Beth ;

Kumamaru, Kanako K. ;

Rybicki, Frank J. ;

Mitsouras, Dimitrios .

RADIOGRAPHICS, 2016, 36 (01) :176-191

[5] Deep Learning to Classify Radiology Free-Text Reports [J].

Chen, Matthew C. ;

Ball, Robyn L. ;

Yang, Lingyao ;

Moradzadeh, Nathaniel ;

Chapman, Brian E. ;

Larson, David B. ;

Langlotz, Curtis P. ;

Amrhein, Timothy J. ;

Lungren, Matthew P. .

RADIOLOGY, 2018, 286 (03) :845-852

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7] Structured report data can be used to develop deep learning algorithms: a proof of concept in ankle radiographs [J].

dos Santos, Daniel Pinto ;

Brodehl, Sebastian ;

Baessler, Bettina ;

Arnhold, Gordon ;

Dratsch, Thomas ;

Chon, Seung-Hun ;

Mildenberger, Peter ;

Jungmann, Florian .

INSIGHTS INTO IMAGING, 2019, 10 (01)

[8]

Gardner M, 2018, P 2018 C N AM CHAPTE, DOI DOI 10.18653/V1/N18-1202

[9] Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data [J].

Gianfrancesco, Milena A. ;

Tamang, Suzanne ;

Yazdany, Jinoos ;

Schmajuk, Gabriela .

JAMA INTERNAL MEDICINE, 2018, 178 (11) :1544-1547

[10]

Goldberg Y., 2019, ARXIV PREPRINT ARXIV

← 1 2 3 →