Natural Language Generation Model for Mammography Reports Simulation

被引:15
|
作者
Hoogi, Assaf [1 ]
Mishra, Arjun [2 ]
Gimenez, Francisco [1 ]
Dong, Jeffrey [1 ]
Rubin, Daniel [1 ]
机构
[1] Stanford Univ, Biomed Data Sci, Stanford, CA 94305 USA
[2] Berkeley City Coll, Comp Sci, Berkeley, CA 94704 USA
关键词
Natural language generation; mammography reports; RNN-LSTM; simulation;
D O I
10.1109/JBHI.2020.2980118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extending the size of labeled corpora of medical reports is a major step towards a successful training of machine learning algorithms. Simulating new text reports is a key solution for reports augmentation, which extends the cohort size. However, text generation in the medical domain is challenging because it needs to preserve both content and style that are typical for real reports, without risking the patients' privacy. In this paper, we present a conditioned LSTM-RNN architecture for simulating realistic mammography reports. We evaluated the performance by analyzing the characteristics of the simulated reports and classifying them into benign and malignant classes. An average classification AUC was calculated over two distinct test sets. A qualitative analysis was also performed in which a masked radiologist classified 0.75 of the simulated reports as real reports, showing that both the style and content of the simulated reports were similar to real reports. Finally, we compared our RNN-LSTM generative model with Markov Random Fields. The RNN-LSTM provided significantly better and more stable performance than MRFs (p< 0.01, Wilcoxon).
引用
收藏
页码:2711 / 2717
页数:7
相关论文
共 50 条
  • [11] EXPERIENCES WITH PLANNING FOR NATURAL LANGUAGE GENERATION
    Koller, Alexander
    Petrick, Ronald P. A.
    COMPUTATIONAL INTELLIGENCE, 2011, 27 (01) : 23 - 40
  • [12] Natural language generation of surgical procedures
    Wagner, JC
    Rogers, JE
    Baud, RH
    Scherrer, JR
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 1999, 53 (2-3) : 175 - 192
  • [13] Automatic Generation of Natural Language Explanations
    Costa, Felipe
    Ouyang, Sixun
    Dolog, Peter
    Lawlor, Aonghus
    COMPANION OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI'18), 2018,
  • [14] Natural Language Generation from Graphs
    Dong, Ngan T.
    Holder, Lawrence B.
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2014, 8 (03) : 335 - 384
  • [15] A KNOWLEDGE REPRESENTATION LANGUAGE FOR NATURAL LANGUAGE PROCESSING, SIMULATION AND REASONING
    McShane, Marjorie
    Nirenburg, Sergei
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2012, 6 (01) : 3 - 23
  • [16] Selectional restrictions in natural language sentence generation
    Kozlowski, R
    McCoy, KF
    Vijay-Shanker, K
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, : 127 - 132
  • [17] A SYSTEM FOR NATURAL-LANGUAGE SENTENCE GENERATION
    LEVISON, M
    LESSARD, G
    COMPUTERS AND THE HUMANITIES, 1992, 26 (01): : 43 - 58
  • [18] Natural Language Generation: Revision of the State of the Art
    Vicente, Marta
    Barros, Cristina
    Peregrino, Fernando S.
    Agullo, Francisco
    Lloret, Elena
    COMPUTACION Y SISTEMAS, 2015, 19 (04): : 721 - 756
  • [20] Roles of Natural Language Generation for Requirements Engineering
    de Brock, Bert
    BUSINESS MODELING AND SOFTWARE DESIGN, BMSD 2024, 2024, 523 : 229 - 239