The Automated Generation of Medical Reports from Polydactyly X-ray Images Using CNNs and Transformers

被引:0
作者
Vieira, Pablo de Abreu [1 ,2 ]
Mathew, Mano Joseph [2 ]
Neto, Pedro de Alcantara dos Santos [1 ]
Veloso e Silva, Romuere Rodrigues [1 ]
机构
[1] Fed Univ Piaui UFPI, Dept Comp, BR-64049550 Teresina, Piaui, Brazil
[2] EFREI, Ecole Ingn Gen Numerique, EFREI Res Lab, F-75003 Paris, France
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
polydactyly; X-ray; generative artificial intelligence; CONVOLUTIONAL NEURAL-NETWORKS; FOOT; CLASSIFICATION; RADIOGRAPHS;
D O I
10.3390/app14156566
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Pododactyl radiography is a non-invasive procedure that enables the detection of foot pathologies, as it provides detailed images of structures such as the metatarsus and phalanges, among others. This examination holds potential for employment in CAD systems. Our proposed methodology employs generative artificial intelligence to analyze pododactyl radiographs and generate automatic medical reports. We used a dataset comprising 16,710 exams, including images and medical reports on pododactylys. We implemented preprocessing of the images and text, as well as data augmentation techniques to improve the representativeness of the dataset. The proposed CAD system integrates pre-trained CNNs for feature extraction from the images and Transformers for report interpretation and generation. Our objective is to provide reports describing pododactyl pathologies, such as plantar fasciitis, bunions, heel spurs, flat feet, and lesions, among others, offering a second opinion to the specialist. The results are promising, with BLEU scores (1 to 4) of 0.612, 0.552, 0.507, and 0.470, respectively, a METEOR score of 0.471, and a ROUGE-L score of 0.633, demonstrating the model's ability to generate reports with qualities close to those produced by specialists. We demonstrate that generative AI trained with pododactyl radiographs has the potential to assist in diagnoses from these examinations.
引用
收藏
页数:26
相关论文
共 69 条
  • [41] BLEU: a method for automatic evaluation of machine translation
    Papineni, K
    Roukos, S
    Ward, T
    Zhu, WJ
    [J]. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 311 - 318
  • [42] Diagnostic captioning: a survey
    Pavlopoulos, John
    Kougia, Vasiliki
    Androutsopoulos, Ion
    Papamichail, Dimitris
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (07) : 1691 - 1722
  • [43] Pensec VD, 2004, J RHEUMATOL, V31, P66
  • [44] Pizer S. M., 1990, Proceedings of the First Conference on Visualization in Biomedical Computing (Cat. No.90TH0311-1), P337, DOI 10.1109/VBC.1990.109340
  • [45] Foot osteoarthritis: latest evidence and developments
    Roddy, Edward
    Menz, Hylton B.
    [J]. THERAPEUTIC ADVANCES IN MUSCULOSKELETAL DISEASE, 2018, 10 (04) : 91 - 103
  • [46] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [47] COMPLEXITIES OF FOOT ARCHITECTURE AS A BASE OF SUPPORT
    SALTZMAN, CL
    NAWOCZENSKI, DA
    [J]. JOURNAL OF ORTHOPAEDIC & SPORTS PHYSICAL THERAPY, 1995, 21 (06) : 354 - 360
  • [48] Saraiva A. A., 2019, BIOIMAGING P 12 INT, V2, P112, DOI [10.5220/0007404301120119, DOI 10.5220/0007404301120119]
  • [49] Selvaraju R.R., 2016, arXiv
  • [50] Gated contextual transformer network for multi-modal retinal image clinical description generation
    Shaik, Nagur Shareef
    Cherukuri, Teja Krishna
    [J]. IMAGE AND VISION COMPUTING, 2024, 143