CacheTrack-YOLO: Real-Time Detection and Tracking for Thyroid Nodules and Surrounding Tissues in Ultrasound Videos

被引:32
作者
Wu, Xiangqiong [1 ]
Tan, Guanghua [1 ]
Zhu, Ningbo [1 ]
Chen, Zhilun [1 ]
Yang, Yan [2 ,3 ]
Wen, Huaxuan [4 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Wenzhou Med Univ, Affiliated Hosp 2, Dept Ultrason Diag, Wenzhou 325088, Peoples R China
[3] Wenzhou Med Univ, Yuying Childrens Hosp, Wenzhou 325088, Peoples R China
[4] Southern Med Univ sity, Affiliated Shenzhen Maternal & Child Healthcare, Dept Ultrasound, Shenzhen 518047, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Thyroid; Ultrasonic imaging; Videos; Solid modeling; Feature extraction; Image segmentation; Cancer; Deep learning; thyroid nodules; computer-aided diagnosis; ultrasound videos; detection and tracking; CONVOLUTIONAL NEURAL-NETWORKS; CLASSIFICATION; IMAGES;
D O I
10.1109/JBHI.2021.3084962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To accurately detect and track the thyroid nodules in a video is a crucial step in the thyroid screening for identification of benign and malignant nodules in computer-aided diagnosis (CAD) systems. Most existing methods just perform excellent on static frames selected manually from ultrasound videos. However, manual acquisition is labor-intensive work. To make the thyroid screening process in a more natural way with less labor operations, we develop a well-designed framework suitable for practical applications for thyroid nodule detection in ultrasound videos. Particularly, in order to make full use of the characteristics of thyroid videos, we propose a novel post-processing approach, called Cache-Track, which exploits the contextual relation among video frames to propagate the detection results into adjacent frames to refine the detection results. Additionally, our method can not only detect and count thyroid nodules, but also track and monitor surrounding tissues, which can greatly reduce the labor work and achieve computer-aided diagnosis. Experimental results show that our method performs better in balancing accuracy and speed.
引用
收藏
页码:3812 / 3823
页数:12
相关论文
共 58 条
[1]   Tracking without bells and whistles [J].
Bergmann, Philipp ;
Meinhardt, Tim ;
Leal-Taixe, Laura .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951
[2]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[3]  
Bochkovskiy A., 2020, PREPRINT
[4]   Data-Driven Analysis of Radiologists Behavior for Diagnosing Thyroid Nodules [J].
Chang, Leilei ;
Fu, Chao ;
Wu, Zijian ;
Liu, Weiyong ;
Yang, Shanlin .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (11) :3111-3123
[5]   Thyroid Nodule Classification in Ultrasound Images by Fine-Tuning Deep Convolutional Neural Network [J].
Chi, Jianning ;
Walia, Ekta ;
Babyn, Paul ;
Wang, Jimmy ;
Groot, Gary ;
Eramian, Mark .
JOURNAL OF DIGITAL IMAGING, 2017, 30 (04) :477-486
[6]  
Deng J., 2020, ARXIVABS190408900
[7]   A Deep Learning Framework for Single-Sided Sound Speed Inversion in Medical Ultrasound [J].
Feigin, Micha ;
Freedman, Daniel ;
Anthony, Brian W. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (04) :1142-1151
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]  
Han W, 2016, ARXIV160208465
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778