Segmentation of 2D cardiac ultrasound with deep learning: simpler models for a simple task

被引:1
作者
Chernyshov, Artem [1 ]
Ostvik, Andreas [2 ]
Smistad, Erik [2 ]
Lovstakken, Lasse [2 ]
机构
[1] Norwegian Univ Sci & Technol, ProCardio Ctr Innovat, Trondheim, Norway
[2] Norwegian Univ Sci & Technol, Ctr Innovat Ultrasound Solut, SINTEF Med Technol, Trondheim, Norway
来源
2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS) | 2022年
关键词
Deep learning; echocardiography; segmentation;
D O I
10.1109/IUS54386.2022.9957618
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Low-complexity convolutional neural networks have been shown to be sufficient for segmentation of cardiac US images in A2C and A4C views. The performance of 24 varying-complexity implementations of U-Net and DeepLabV3+ (popular segmentation architectures) has been tested on cardiac US data (CAMUS data set) and street view data (Cityscapes data set). The inference speed of the models has also been measured before and after post-training optimization. The models systematically differed in their structural components: the number of layers and convolutional filters as well as the receptive field size. All models trained to maximize the Dice Coefficient. The Dice Coefficient was consistently high (0.86-0.90) on CAMUS data and low (0.48-0.67) on Cityscapes data for all models. Each ten-fold reduction in the number of model parameters tended to reduce the score by approximate to 0.01 on CAMUS and by 0.030.05 on Cityscapes. Likewise, low-parameter models, especially the ones based on U-Net, had yielded predictions with higher (worse) Hausdorff Distance values. Increasing the receptive field size of the models partially mitigated this effect. Without post-training optimization, the inference speed mostly varied with the number of layers in the networks. The least complex U-Net model was 83% faster than the most complex one; for the DeepLab models the difference was 53%. With post-training optimization, any reduction in the number of parameters led to increased speed: up to more than 700% for both architecture types.
引用
收藏
页数:4
相关论文
共 4 条
  • [1] Chen LC, 2018, Arxiv, DOI [arXiv:1802.02611, DOI 10.18550/ARXIV.1802.02611]
  • [2] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [3] Deep Learning for Segmentation Using an Open Large-Scale Dataset in 2D Echocardiography
    Leclerc, Sarah
    Smistad, Erik
    Pedrosa, Joao
    Ostvik, Andreas
    Cervenansky, Frederic
    Espinosa, Florian
    Espeland, Torvald
    Berg, Erik Andreas Rye
    Jodoin, Pierre-Marc
    Grenier, Thomas
    Lartizien, Carole
    D'hooge, Jan
    Lovstakken, Lasse
    Bernard, Olivier
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (09) : 2198 - 2210
  • [4] Ronneberger O, 2015, Arxiv, DOI [arXiv:1505.04597, DOI 10.48550/ARXIV.1505.04597]