MLANet: Multi-Layer Anchor-free Network for generic lesion detection

被引:5
作者
Liu, Zhe [1 ]
Xie, Xi [1 ]
Song, Yuqing [1 ]
Zhang, Yang [2 ]
Liu, Xuesheng [3 ,4 ]
Zhang, Jiawen [5 ,6 ]
Sheng, Victor S. [7 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212003, JS, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, 10 Xitucheng Rd, Bg 100875, Peoples R China
[3] Anhui Med Univ, Affiliated Hosp 1, Dept Anesthesiol, Ah 230022, Peoples R China
[4] Anhui Med Univ, Anhui Higher Educ Inst, Key Lab Anesthesiol & Perioperat Med, Ah 230022, Peoples R China
[5] Fudan Univ, Dept Radiol, Sh 200433, Peoples R China
[6] Huashan Hosp, Sh 200040, Peoples R China
[7] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Medical imaging; Lesion detection; Deep learning; Anchor-free detector;
D O I
10.1016/j.engappai.2021.104255
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In medical image processing, detecting lesions from computed tomography (CT) scans becomes an important research problem with increasing attention. However, this problem is nontrivial because lesions from different organs and parts reflect different characteristics as well as different sizes. Most conventional methods only use a single-scale architecture to detect lesion areas. To get rid of the drawbacks above in medical imaging, a multi-scale framework called MLANet is proposed. To deal with the scale imbalance problem, we design a new backbone & mdash;a mixed hourglass network, in which each hourglass module share different input sizes and orders to extract features from different scales. And then the information is sent to the proposed Strengthen Weighted Feature Pyramid Network (SWFPN), a multi-layer weighted feature fusion module, to combine more semantic and spatial information, especially for the case where the number of layers is small. Finally, a Center-to-Corner (C2C) transformation is proposed to deal with the inaccurate size prediction of lesions. It is a non-linear transformation function, aiming to make the predictions more stable and accurate. MLANet is an end-to-end network and is easy to train. In our experiment, it achieves 65.2% AP50, as well as 88.3% in the sensitivity of FPs@4.0 on the DeepLesion dataset, which exceeds many state-of-the-art detectors.
引用
收藏
页数:12
相关论文
共 34 条
[1]  
[Anonymous], 2016, P 29 IEEE C COMPUTER
[2]  
Borji A., 2019, ARXIV PREPRINT ARXIV
[3]   One stage lesion detection based on 3D context convolutional neural networks [J].
Cai, Guorong ;
Chen, Jinshan ;
Wu, Zebiao ;
Tang, Haoming ;
Liu, Yujun ;
Wang, Senyuan ;
Su, Songzhi .
COMPUTERS & ELECTRICAL ENGINEERING, 2019, 79
[4]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[5]  
Dai J., 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), V37, P9
[6]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[7]   NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].
Ghiasi, Golnaz ;
Lin, Tsung-Yi ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]  
He K., 2017, IEEE INT C COMPUT VI, P2961, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[10]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001