Towards Better Cephalometric Landmark Detection With Diffusion Data Generation

被引：0

作者：

Guo, Dongqian ^{[1
]}

Han, Wencheng ^{[1
]}

Lyu, Pang ^{[2
]}

Zhou, Yuxi ^{[3
,4
]}

Shen, Jianbing ^{[1
]}

机构：

[1] Univ Macau, Dept Comp & Informat Sci, State Key Lab Internet Things Smart City, Macau, Peoples R China

[2] Fudan Univ, Zhongshan Hosp, Dept Orthopaed Surg, Shanghai 200032, Peoples R China

[3] Justus Liebig Univ Giessen, Dept Periodontol, D-35390 Giessen, Germany

[4] Guangzhou Med Univ, Stomatol Hosp, Dept Periodont, Guangzhou 511436, Peoples R China

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2025年 / 44卷 / 07期

基金：

上海市科技启明星计划;

关键词：

X-ray imaging; Data models; Annotations; Training; Generators; Topology; Image synthesis; Pipelines; Medical diagnostic imaging; Data collection; Landmark detection; cephalometric X-ray; diffusion; anatomy-informed topology; data generation;

D O I：

10.1109/TMI.2025.3557430

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Cephalometric landmark detection is essential for orthodontic diagnostics and treatment planning. Nevertheless, the scarcity of samples in data collection and the extensive effort required for manual annotation have significantly impeded the availability of diverse datasets. This limitation has restricted the effectiveness of deep learning-based detection methods, particularly those based on large-scale vision models. To address these challenges, we have developed an innovative data generation method capable of producing diverse cephalometric X-ray images along with corresponding annotations without human intervention. To achieve this, our approach initiates by constructing new cephalometric landmark annotations using anatomical priors. Then, we employ a diffusion-based generator to create realistic X-ray images that correspond closely with these annotations. To achieve precise control in producing samples with different attributes, we introduce a novel prompt cephalometric X-ray image dataset. This dataset includes real cephalometric X-ray images and detailed medical text prompts describing the images. By leveraging these detailed prompts, our method improves the generation process to control different styles and attributes. Facilitated by the large, diverse generated data, we introduce large-scale vision detection models into the cephalometric landmark detection task to improve accuracy. Experimental results demonstrate that training with the generated data substantially enhances the performance. Compared to methods without using the generated data, our approach improves the Success Detection Rate (SDR) by 6.5%, attaining a notable 82.2%. All code and data are available at: https://um-lab.github.io/cepha-generation/

引用

页码：2784 / 2794

页数：11

共 49 条

[1] Synthetic data in machine learning for medicine and healthcare [J].

Chen, Richard J. ;

Lu, Ming Y. ;

Chen, Tiffany Y. ;

Williamson, Drew F. K. ;

Mahmood, Faisal .

NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) :493-497

[2] Cephalometric Landmark Detection by Attentive Feature Pyramid Fusion and Regression-Voting [J].

Chen, Runnan ;

Ma, Yuexin ;

Chen, Nenglun ;

Lee, Daniel ;

Wang, Wenping .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 :873-881

[3]

Ching -Wei W., 2023, Cephalometric Landmark Detection in Lateral X-ray Images 2023

[4]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[5] Data augmentation for medical imaging: A systematic literature review [J].

Garcea, Fabio ;

Serra, Alessio ;

Lamberti, Fabrizio ;

Morra, Lia .

COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152

[6]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8] Diverse Data Generation for Retinal Layer Segmentation With Potential Structure Modeling [J].

Huang, Kun ;

Ma, Xiao ;

Zhang, Zetian ;

Zhang, Yuhan ;

Yuan, Songtao ;

Fu, Huazhu ;

Chen, Qiang .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (10) :3584-3595

[9] Cross-Modality Image Synthesis via Weakly Coupled and Geometry Co-Regularized Joint Dictionary Learning [J].

Huang, Yawen ;

Shao, Ling ;

Frangi, Alejandro F. .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (03) :815-827

[10]

Ibragimov B., 2014, P IEEE 11 INT S BIOM, P1

← 1 2 3 4 5 →