CephaNN: A Multi-Head Attention Network for Cephalometric Landmark Detection

被引:27
作者
Qian, Jiahong [1 ]
Luo, Weizhi [1 ]
Cheng, Ming [1 ]
Tao, Yubo [1 ,2 ]
Lin, Jun [3 ]
Lin, Hai [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Innovat Ctr Minimally Invas Tech & Device, Hangzhou 310058, Peoples R China
[3] Zhejiang Univ, Coll Med, Affiliated Hosp 1, Dept Stomatol, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Heating systems; Neural networks; Kernel; Feature extraction; Annotations; Two dimensional displays; Deep learning; Cephalometric landmark detection; multi-head attention; neural network; intermediate supervision; region enhance;
D O I
10.1109/ACCESS.2020.3002939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cephalometric landmark detection is a crucial step in orthodontic and orthognathic treatments. To detect cephalometric landmarks accurately, we propose a novel multi-head attention neural network (CephaNN). CephaNN is an end-to-end network based on the heatmaps of annotated landmarks, and it consists of two parts, the multi-head part and the attention part. In the multi-head part, we adopt multi-head subnets to gain comprehensive knowledge of various subspaces of a cephalogram. The intermediate supervision is applied to accelerate the convergence. Based on the feature maps learned from the multi-head Part, the attention part applies the multi-attention mechanism to obtain a refined detection. For solving the class imbalance problem, we propose a region enhancing (RE) loss, to enhance the efficient regions on the regressed heatmaps. Experiments in the benchmark dataset demonstrate that CephaNN is state-of-the-art with the detection accuracy of 87.61% in the clinically accepted 2.0-mm range. Furthermore, CephaNN is efficient in classifying the anatomical types and robust in a real application on a 75-landmark dataset.
引用
收藏
页码:112633 / 112641
页数:9
相关论文
共 20 条
[1]  
[Anonymous], 2017, P IEEE INT C COMPUTE
[2]   Fully automated quantitative cephalometry using convolutional neural networks [J].
Arik S.Ö. ;
Ibragimov B. ;
Xing L. .
Journal of Medical Imaging, 2017, 4 (01)
[3]   Cephalometric Landmark Detection by Attentive Feature Pyramid Fusion and Regression-Voting [J].
Chen, Runnan ;
Ma, Yuexin ;
Chen, Nenglun ;
Lee, Daniel ;
Wang, Wenping .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 :873-881
[4]   Cascaded Pyramid Network for Multi-Person Pose Estimation [J].
Chen, Yilun ;
Wang, Zhicheng ;
Peng, Yuxiang ;
Zhang, Zhiqiang ;
Yu, Gang ;
Sun, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112
[5]   Locating Anatomical Landmarks on 2D Lateral Cephalograms Through Adversarial Encoder-Decoder Networks [J].
Dai, Xiubin ;
Zhao, Hao ;
Liu, Tianliang ;
Cao, Dan ;
Xie, Lizhe .
IEEE ACCESS, 2019, 7 :132738-132747
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Ibragimov B., 2015, P INT S BIOM IM ISBI
[9]  
King DB, 2015, ACS SYM SER, V1214, P1
[10]   Cephalometric Landmark Detection in Dental X-ray Images Using Convolutional Neural Networks [J].
Lee, Hansang ;
Park, Minseok ;
Kim, Junmo .
MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134