A multimodal deep learning model for detecting endoscopic images of near-infrared fluorescence capsules

被引:0
作者
Wang, Junhao [1 ,2 ]
Zhou, Cheng [1 ]
Wang, Wei [1 ]
Zhang, Hanxiao [2 ]
Zhang, Amin [4 ]
Cui, Daxiang [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst & Med Robot, Shanghai 200240, Peoples R China
[3] Henan Univ, Med & Engn Cross Res Inst, Sch Med, Kaifeng 475004, Peoples R China
[4] Shanghai Jiao Tong Univ, Sch Agr & Biol, Dept Food Sci & Technol, Shanghai 200240, Peoples R China
基金
国家自然科学基金国际合作与交流项目; 上海市自然科学基金; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Fluorescence endoscopy; Multimodal deep learning; Disease detection; CANCER;
D O I
10.1016/j.bios.2025.117251
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Early screening for gastrointestinal (GI) diseases is critical for preventing cancer development. With the rapid advancement of deep learning technology, artificial intelligence (AI) has become increasingly prominent in the early detection of GI diseases. Capsule endoscopy is a non-invasive medical imaging technique used to examine the gastrointestinal tract. In our previous work, we developed a near-infrared fluorescence capsule endoscope (NIRF-CE) capable of exciting and capturing near-infrared (NIR) fluorescence images to specifically identify subtle mucosal microlesions and submucosal abnormalities while simultaneously capturing conventional white- light images to detect lesions with significant morphological changes. However, limitations such as low camera resolution and poor lighting within the gastrointestinal tract may lead to misdiagnosis and other medical errors. Manually reviewing and interpreting large volumes of capsule endoscopy images is time-consuming and prone to errors. Deep learning models have shown potential in automatically detecting abnormalities in NIRF-CE images. This study focuses on an improved deep learning model called Retinex-Attention-YOLO (RAY), which is based on single-modality image data and built on the YOLO series of object detection models. RAY enhances the accuracy and efficiency of anomaly detection, especially under low-light conditions. To further improve detection performance, we also propose a multimodal deep learning model, Multimodal-Retinex-Attention-YOLO (MRAY), which combines both white-light and fluorescence image data. The dataset used in this study consists of images of pig stomachs captured by our NIRF-CE system, simulating the human GI tract. In conjunction with a targeted fluorescent probe, which accumulates at lesion sites and releases fluorescent signals for imaging when abnormalities are present, a bright spot indicates a lesion. The MRAY model achieved an impressive precision of 96.3%, outperforming similar object detection models. To further validate the model's performance, ablation experiments were conducted, and comparisons were made with publicly available datasets. MRAY shows great promise for the automated detection of GI cancers, ulcers, inflammations, and other medical conditions in clinical practice.
引用
收藏
页数:12
相关论文
共 47 条
  • [1] Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy ?
    Ali, Sharib
    Dmitrieva, Mariia
    Ghatwary, Noha
    Bano, Sophia
    Polat, Gorkem
    Temizel, Alptekin
    Krenzer, Adrian
    Hekalo, Amar
    Guo, Yun Bo
    Matuszewski, Bogdan
    Gridach, Mourad
    Voiculescu, Irina
    Yoganand, Vishnusai
    Chavan, Arnav
    Raj, Aryan
    Nguyen, Nhan T.
    Tran, Dat Q.
    Huynh, Le Duy
    Boutry, Nicolas
    Rezvy, Shahadate
    Chen, Haijian
    Choi, Yoon Ho
    Subramanian, Anand
    Balasubramanian, Velmurugan
    Gao, Xiaohong W.
    Hu, Hongyu
    Liao, Yusheng
    Stoyanov, Danail
    Daul, Christian
    Realdon, Stefano
    Cannizzaro, Renato
    Lamarque, Dominique
    Tran-Nguyen, Terry
    Bailey, Adam
    Braden, Barbara
    East, James E.
    Rittscher, Jens
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 70 (70)
  • [2] EndoL2H: Deep Super-Resolution for Capsule Endoscopy
    Almalioglu, Yasin
    Bengisu Ozyoruk, Kutsev
    Gokce, Abdulkadir
    Incetan, Kagan
    Irem Gokceler, Guliz
    Ali Simsek, Muhammed
    Ararat, Kivanc
    Chen, Richard J.
    Durr, Nicholas J.
    Mahmood, Faisal
    Turan, Mehmet
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 4297 - 4309
  • [3] Araujo Andre, 2019, Distill
  • [4] Bing Ma, 2019, Artificial Intelligence and Security. 5th International Conference, ICAIS 2019. Proceedings: Lecture Notes in Computer Science (LNCS 11632), P227, DOI 10.1007/978-3-030-24274-9_20
  • [5] Gastric polyp detection in gastroscopic images using deep neural network
    Cao, Chanting
    Wang, Ruilin
    Yu, Yao
    Zhang, Hui
    Yu, Ying
    Sun, Changyin
    [J]. PLOS ONE, 2021, 16 (04):
  • [6] Recent Progress in NIR-II Contrast Agent for Biological Imaging
    Cao, Jie
    Zhu, Binling
    Zheng, Kefang
    He, Songguo
    Meng, Liang
    Song, Jibin
    Yang, Huanghao
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 7
  • [7] Near-infrared luminescence high-contrast in vivo biomedical imaging
    Chen, Ying
    Wang, Shangfeng
    Zhang, Fan
    [J]. NATURE REVIEWS BIOENGINEERING, 2023, 1 (01): : 60 - 78
  • [8] 基于纳米技术的胃癌预警与早期诊疗系统
    崔大祥
    [J]. 上海交通大学学报, 2018, (10) : 1396 - 1403
  • [9] Classification of Lung Sounds With CNN Model Using Parallel Pooling Structure
    Demir, Fatih
    Ismael, Aras Masood
    Sengur, Abdulkadir
    [J]. IEEE ACCESS, 2020, 8 : 105376 - 105383
  • [10] A new medical image enhancement algorithm using adaptive parameters
    Dinh, Phu-Hung
    Giang, Nguyen Long
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (06) : 2198 - 2218