Cardiac Phase Detection in Echocardiograms With Densely Gated Recurrent Neural Networks and Global Extrema Loss

被引:40
作者
Dezaki, Fatemeh Taheri [1 ]
Liao, Zhibin [1 ]
Luong, Christina [2 ]
Girgis, Hany [2 ]
Dhungel, Neeraj [1 ]
Abdi, Amir H. [1 ]
Behnami, Delaram [1 ]
Gin, Ken [2 ]
Rohling, Robert [1 ,3 ]
Abolmaesumi, Purang [1 ]
Tsang, Teresa [2 ,4 ,5 ]
机构
[1] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[2] Univ British Columbia, Dept Med, Div Cardiol, Vancouver Gen Hosp,Echocardiog Lab, Vancouver, BC V5Z 1M9, Canada
[3] Univ British Columbia, Dept Mech Engn, Vancouver, BC V6T 1Z4, Canada
[4] Univ British Columbia, Vancouver Gen Hosp, Vancouver, BC V5Z 1M9, Canada
[5] Univ British Columbia, Echocardiog Labs, Vancouver, BC V5Z 1M9, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
Deep residual neural networks; densely-connected networks; recurrent neural networks; long short term memory; gated recurrent unit; bi-directional RNN; echocardiography; cardiac cycle phase detection; AUTOMATIC DETECTION; QUANTIFICATION;
D O I
10.1109/TMI.2018.2888807
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Accurate detection of end-systolic (ES) and end-diastolic (ED) frames in an echocardiographic cine series can be difficult but necessary pre-processing step for the development of automatic systems to measure cardiac parameters. The detection task is challenging due to variations in cardiac anatomy and heart rate often associated with pathological conditions. We formulate this problem as a regression problem and propose several deep learning-based architectures that minimize a novel global extrema structured loss function to localize the ED and ES frames. The proposed architectures integrate convolution neural networks (CNNs)-based image feature extraction model and recurrent neural networks (RNNs) to model temporal dependencies between each frame in a sequence. We explore two CNN architectures: DenseNet and ResNet, and four RNN architectures: long short-term memory, bi-directional LSTM, gated recurrent unit (GRU), and Bi-GRU, and compare the performance of these models. The optimal deep learning model consists of a DenseNet and GRU trained with the proposed loss function. On average, we achieved 0.20 and 1.43 frame mismatch for the ED and ES frames, respectively, which are within reported inter-observer variability for the manual detection of these frames.
引用
收藏
页码:1821 / 1832
页数:12
相关论文
共 36 条
[1]   Echocardiography without electrocardiogram [J].
Aase, Svein Arne ;
Snare, Sten Roar ;
Dalen, Havard ;
Stoylen, Asbjorn ;
Orderud, Fredrik ;
Torp, Hans .
EUROPEAN JOURNAL OF ECHOCARDIOGRAPHY, 2011, 12 (01) :3-10
[2]  
Abboud Anas A., 2015, Journal of Computer Science, V11, P230, DOI 10.3844/jcssp.2015.230.240
[3]  
[Anonymous], 2017, CORR
[4]  
[Anonymous], PROC CVPR IEEE
[5]  
[Anonymous], 1995, HDB BRAIN THEORY NEU
[6]  
[Anonymous], ADV NEURAL INFORM PR
[7]  
[Anonymous], 2014, WORKSHOP SYNTAX SEMA
[8]   Automatic computation of left ventricle ejection fraction from dynamic ultrasound images [J].
Barcaro U. ;
Moroni D. ;
Salvetti O. .
Pattern Recogn. Image Anal., 2008, 2 (351-358) :351-358
[9]  
Bengio S, 2015, ADV NEUR IN, V28
[10]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166