The Automated Generation of Medical Reports from Polydactyly X-ray Images Using CNNs and Transformers

被引:0
作者
Vieira, Pablo de Abreu [1 ,2 ]
Mathew, Mano Joseph [2 ]
Neto, Pedro de Alcantara dos Santos [1 ]
Veloso e Silva, Romuere Rodrigues [1 ]
机构
[1] Fed Univ Piaui UFPI, Dept Comp, BR-64049550 Teresina, Piaui, Brazil
[2] EFREI, Ecole Ingn Gen Numerique, EFREI Res Lab, F-75003 Paris, France
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
polydactyly; X-ray; generative artificial intelligence; CONVOLUTIONAL NEURAL-NETWORKS; FOOT; CLASSIFICATION; RADIOGRAPHS;
D O I
10.3390/app14156566
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Pododactyl radiography is a non-invasive procedure that enables the detection of foot pathologies, as it provides detailed images of structures such as the metatarsus and phalanges, among others. This examination holds potential for employment in CAD systems. Our proposed methodology employs generative artificial intelligence to analyze pododactyl radiographs and generate automatic medical reports. We used a dataset comprising 16,710 exams, including images and medical reports on pododactylys. We implemented preprocessing of the images and text, as well as data augmentation techniques to improve the representativeness of the dataset. The proposed CAD system integrates pre-trained CNNs for feature extraction from the images and Transformers for report interpretation and generation. Our objective is to provide reports describing pododactyl pathologies, such as plantar fasciitis, bunions, heel spurs, flat feet, and lesions, among others, offering a second opinion to the specialist. The results are promising, with BLEU scores (1 to 4) of 0.612, 0.552, 0.507, and 0.470, respectively, a METEOR score of 0.471, and a ROUGE-L score of 0.633, demonstrating the model's ability to generate reports with qualities close to those produced by specialists. We demonstrate that generative AI trained with pododactyl radiographs has the potential to assist in diagnoses from these examinations.
引用
收藏
页数:26
相关论文
共 69 条
  • [1] Administration D, 2019, Medical X-ray Imaging
  • [2] Bergmann J N, 1990, Clin Podiatr Med Surg, V7, P243
  • [3] Brown TB, 2020, ADV NEUR IN, V33
  • [4] From Image to Text Classification: A Novel Approach based on Clustering Word Embeddings
    Butnaru, Andrei M.
    Ionescu, Radu Tudor
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 1783 - 1792
  • [5] Deep learning for chest X-ray analysis: A survey
    calli, Erdi
    Sogancioglu, Ecem
    van Ginneken, Bram
    van Leeuwen, Kicky G.
    Murphy, Keelin
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 72 (72)
  • [6] A review on lung boundary detection in chest X-rays
    Candemir, Sema
    Antani, Sameer
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (04) : 563 - 576
  • [7] Cao YM, 2023, AAAI CONF ARTIF INTE, P277
  • [8] The relationship of static foot structure to dynamic foot function
    Cavanagh, PR
    Morag, E
    Boulton, AJM
    Young, MJ
    Deffner, KT
    Pammer, SE
    [J]. JOURNAL OF BIOMECHANICS, 1997, 30 (03) : 243 - 250
  • [9] A review of medical image data augmentation techniques for deep learning applications
    Chlap, Phillip
    Min, Hang
    Vandenberg, Nym
    Dowling, Jason
    Holloway, Lois
    Haworth, Annette
    [J]. JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2021, 65 (05) : 545 - 563
  • [10] Deep learning approach for disease detection in lumbosacral spine radiographs using ConvNet
    de Abreu Vieira, Pablo
    Vogado, Luis
    Lopes, Lucas
    Ricardo, Ricardo
    Santos Neto, Pedro
    Mathew, Mano Joseph
    Magalhaes, Deborah
    Silva, Romuere
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (06) : 2560 - 2575