Global field of view-based pixel-level recognition method for medical images

被引：5

作者：

He, Keke ^{[1
]}

Tang, Haojun ^{[2
]}

Gou, Fangfang ^{[3
]}

Wu, Jia ^{[2
,4
]}

机构：

[1] Changsha Univ, Sch Comp Sci & Engn, Changsha, Peoples R China

[2] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China

[3] Guizhou Univ, Coll Comp Sci & Technol, State Key Lab Publ Big Data, Guiyang, Peoples R China

[4] Monash Univ, Res Ctr Artificial Intelligence, Melbourne, Australia

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2023年 / 45卷 / 03期

关键词：

Tumor recognition; image analysis; atention; companion diagnostics; global view; OSTEOSARCOMA SEGMENTATION; NETWORK;

D O I：

10.3233/JIFS-231053

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Artificial intelligence image processing has been of interest to research investigators in tumor identification and determination. Magnetic resonance imaging for clinical detection is the technique of choice for identifying tumors because of its advantages such as accurate localization with tomography in any orientation. Nevertheless, owing to the complexity of the images and the heterogeneity of the tumors, existing methodologies have insufficient field of view and require expensive computations to capture semantic information in the view, rendering them lacking in universality of application. Consequently, this thesis developed a medical image segmentation algorithm based on global field of view attention network (GVANet). It focuses on replacing the original convolution with a transformer structure and views in a larger field-of-viewdomain to build a global view at each layer, which captures the refined pixel information and category information in the region of interest with fewer parameters so as to address the defective tumor edge segmentation problem. The dissertation exploits the pixel-level information of the input image, the category information of the tumor region and the normal tissue region to segment the MRI image and assign weights to the pixel representatives. This medical image recognition algorithm enables to undertake the ambiguous tumor edge segmentation task with lowcomputational complexity and to maximize the segmentation accuracy and model property. Nearly four thousand MRI images from the Monash University Research Center for Artificial Intelligence were applied for the experiments. The outcome indicates that the approach obtains outstanding classification capability on the data set. Both the mask (IoU) and DSC quality were improved by 7.6% and 6.3% over the strong baseline.

引用

页码：4009 / 4021

页数：13

共 22 条

[1] Encoder-decoder with pyramid region attention for pixel-level pavement crack recognition
Yao, Hui
Liu, Yanhao
Lv, Haotian
Huyan, Ju
You, Zhanping
Hou, Yue
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1490 - 1506
[2] View-aligned pixel-level feature aggregation for 3D shape classification
Xu, Yong
Pan, Shaohui
Xu, Ruotao
Ling, Haibin
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
[3] Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
Feng, Mingtao
Liu, Kendong
Zhang, Liang
Yu, Hongshan
Wang, Yaonan
Mian, Ajmal
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1746 - 1756
[4] Pixel-Level Analysis for Enhancing Threat Detection in Large-Scale X-ray Security Images
Dumagpi, Joanna Kazzandra
Jeong, Yong-Jin
APPLIED SCIENCES-BASEL, 2021, 11 (21):
[5] Pixel-Level Grasp Detection based on EfficientNet and Multi-scale Feature Fusion Network
Gao, Junli
Luo, Yinming
Huang, Xianxin
2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 486 - 491
[6] View-Based Knowledge-Augmented Multimodal Semantic Understanding for Optical Remote Sensing Images
Zhu, Lilu
Su, Xiaolu
Tang, Jiaxuan
Hu, Yanfeng
Wang, Yang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[7] Method for the automatic recognition of cropland headland images based on deep learning
Qiao, Yujie
Liu, Hui
Meng, Zhijun
Chen, Jingping
Ma, Luyao
INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING, 2023, 16 (02) : 216 - 224
[8] Multi-view Isolated sign language recognition based on cross-view and multi-level transformer
Guan, Zhong
Hu, Yongli
Jiang, Huajie
Sun, Yanfeng
Yin, Baocai
MULTIMEDIA SYSTEMS, 2025, 31 (03)
[9] Research on an Optimal Path Planning Method Based on A* Algorithm for Multi-View Recognition
Li, Xinning
He, Qun
Yang, Qin
Wang, Neng
Wu, Hu
Yang, Xianhai
ALGORITHMS, 2022, 15 (05)
[10] Constructing global view with an ontology-based method for information sharing in the virtual organization
张英朝
张维明
肖卫东
黄金才
沙基昌
Journal of Systems Engineering and Electronics, 2005, (03) : 566 - 573

← 1 2 3 →