Global field of view-based pixel-level recognition method for medical images

被引:5
作者
He, Keke [1 ]
Tang, Haojun [2 ]
Gou, Fangfang [3 ]
Wu, Jia [2 ,4 ]
机构
[1] Changsha Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[3] Guizhou Univ, Coll Comp Sci & Technol, State Key Lab Publ Big Data, Guiyang, Peoples R China
[4] Monash Univ, Res Ctr Artificial Intelligence, Melbourne, Australia
关键词
Tumor recognition; image analysis; atention; companion diagnostics; global view; OSTEOSARCOMA SEGMENTATION; NETWORK;
D O I
10.3233/JIFS-231053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence image processing has been of interest to research investigators in tumor identification and determination. Magnetic resonance imaging for clinical detection is the technique of choice for identifying tumors because of its advantages such as accurate localization with tomography in any orientation. Nevertheless, owing to the complexity of the images and the heterogeneity of the tumors, existing methodologies have insufficient field of view and require expensive computations to capture semantic information in the view, rendering them lacking in universality of application. Consequently, this thesis developed a medical image segmentation algorithm based on global field of view attention network (GVANet). It focuses on replacing the original convolution with a transformer structure and views in a larger field-of-viewdomain to build a global view at each layer, which captures the refined pixel information and category information in the region of interest with fewer parameters so as to address the defective tumor edge segmentation problem. The dissertation exploits the pixel-level information of the input image, the category information of the tumor region and the normal tissue region to segment the MRI image and assign weights to the pixel representatives. This medical image recognition algorithm enables to undertake the ambiguous tumor edge segmentation task with lowcomputational complexity and to maximize the segmentation accuracy and model property. Nearly four thousand MRI images from the Monash University Research Center for Artificial Intelligence were applied for the experiments. The outcome indicates that the approach obtains outstanding classification capability on the data set. Both the mask (IoU) and DSC quality were improved by 7.6% and 6.3% over the strong baseline.
引用
收藏
页码:4009 / 4021
页数:13
相关论文
共 22 条
  • [1] Encoder-decoder with pyramid region attention for pixel-level pavement crack recognition
    Yao, Hui
    Liu, Yanhao
    Lv, Haotian
    Huyan, Ju
    You, Zhanping
    Hou, Yue
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1490 - 1506
  • [2] View-aligned pixel-level feature aggregation for 3D shape classification
    Xu, Yong
    Pan, Shaohui
    Xu, Ruotao
    Ling, Haibin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [3] Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
    Feng, Mingtao
    Liu, Kendong
    Zhang, Liang
    Yu, Hongshan
    Wang, Yaonan
    Mian, Ajmal
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1746 - 1756
  • [4] Pixel-Level Analysis for Enhancing Threat Detection in Large-Scale X-ray Security Images
    Dumagpi, Joanna Kazzandra
    Jeong, Yong-Jin
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [5] Pixel-Level Grasp Detection based on EfficientNet and Multi-scale Feature Fusion Network
    Gao, Junli
    Luo, Yinming
    Huang, Xianxin
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 486 - 491
  • [6] View-Based Knowledge-Augmented Multimodal Semantic Understanding for Optical Remote Sensing Images
    Zhu, Lilu
    Su, Xiaolu
    Tang, Jiaxuan
    Hu, Yanfeng
    Wang, Yang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [7] Method for the automatic recognition of cropland headland images based on deep learning
    Qiao, Yujie
    Liu, Hui
    Meng, Zhijun
    Chen, Jingping
    Ma, Luyao
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING, 2023, 16 (02) : 216 - 224
  • [8] Multi-view Isolated sign language recognition based on cross-view and multi-level transformer
    Guan, Zhong
    Hu, Yongli
    Jiang, Huajie
    Sun, Yanfeng
    Yin, Baocai
    MULTIMEDIA SYSTEMS, 2025, 31 (03)
  • [9] Research on an Optimal Path Planning Method Based on A* Algorithm for Multi-View Recognition
    Li, Xinning
    He, Qun
    Yang, Qin
    Wang, Neng
    Wu, Hu
    Yang, Xianhai
    ALGORITHMS, 2022, 15 (05)
  • [10] Constructing global view with an ontology-based method for information sharing in the virtual organization
    张英朝
    张维明
    肖卫东
    黄金才
    沙基昌
    Journal of Systems Engineering and Electronics, 2005, (03) : 566 - 573