Segmentation of 2D cardiac ultrasound with deep learning: simpler models for a simple task

被引：1

作者：

Chernyshov, Artem ^{[1
]}

Ostvik, Andreas ^{[2
]}

Smistad, Erik ^{[2
]}

Lovstakken, Lasse ^{[2
]}

机构：

[1] Norwegian Univ Sci & Technol, ProCardio Ctr Innovat, Trondheim, Norway

[2] Norwegian Univ Sci & Technol, Ctr Innovat Ultrasound Solut, SINTEF Med Technol, Trondheim, Norway

来源：

2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS) | 2022年

关键词：

Deep learning; echocardiography; segmentation;

D O I：

10.1109/IUS54386.2022.9957618

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Low-complexity convolutional neural networks have been shown to be sufficient for segmentation of cardiac US images in A2C and A4C views. The performance of 24 varying-complexity implementations of U-Net and DeepLabV3+ (popular segmentation architectures) has been tested on cardiac US data (CAMUS data set) and street view data (Cityscapes data set). The inference speed of the models has also been measured before and after post-training optimization. The models systematically differed in their structural components: the number of layers and convolutional filters as well as the receptive field size. All models trained to maximize the Dice Coefficient. The Dice Coefficient was consistently high (0.86-0.90) on CAMUS data and low (0.48-0.67) on Cityscapes data for all models. Each ten-fold reduction in the number of model parameters tended to reduce the score by approximate to 0.01 on CAMUS and by 0.030.05 on Cityscapes. Likewise, low-parameter models, especially the ones based on U-Net, had yielded predictions with higher (worse) Hausdorff Distance values. Increasing the receptive field size of the models partially mitigated this effect. Without post-training optimization, the inference speed mostly varied with the number of layers in the networks. The least complex U-Net model was 83% faster than the most complex one; for the DeepLab models the difference was 53%. With post-training optimization, any reduction in the number of parameters led to increased speed: up to more than 700% for both architecture types.

引用

页数：4

共 4 条

[1] Chen LC, 2018, Arxiv, DOI [arXiv:1802.02611, DOI 10.18550/ARXIV.1802.02611]
[2] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[3] Deep Learning for Segmentation Using an Open Large-Scale Dataset in 2D Echocardiography
Leclerc, Sarah
Smistad, Erik
Pedrosa, Joao
Ostvik, Andreas
Cervenansky, Frederic
Espinosa, Florian
Espeland, Torvald
Berg, Erik Andreas Rye
Jodoin, Pierre-Marc
Grenier, Thomas
Lartizien, Carole
D'hooge, Jan
Lovstakken, Lasse
Bernard, Olivier
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (09) : 2198 - 2210
[4] Ronneberger O, 2015, Arxiv, DOI [arXiv:1505.04597, DOI 10.48550/ARXIV.1505.04597]

← 1 →