Towards Robust Object Detection in Unseen Catheterization Laboratories

被引：2

作者：

Wang, Zipeng ^{[1
]}

Butler, Rick ^{[1
]}

van den Dobbelsteen, John J. ^{[1
]}

Hendriks, Benno H. W. ^{[1
,2
]}

Van der Elst, Maarten ^{[1
,3
]}

Dauwels, Justin ^{[1
]}

机构：

[1] Delft Univ Technol, Delft, Netherlands

[2] Philips Res Labs, Eindhoven, Netherlands

[3] Reinier de Graaf Grp, Delft, Netherlands

来源：

2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024 | 2024年

关键词：

Object Detection; Catheterization Laboratory; Domain Shift; Clinical Workflow Analysis;

D O I：

10.1109/MEMEA60663.2024.10596906

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning-based object detectors, while offering exceptional performance, are data-dependent and can suffer from generalization issues. In this work, we investigated deep neural networks for detecting people and medical instruments for the vision-based workflow analysis system inside Catheterization Laboratories (Cath Labs). The central problem explored in this paper is the fact that the performance of the detector can degrade drastically if it is trained and tested on data from different Cath Labs. Our research aimed to investigate the underlying causes of this specific performance degradation and find solutions to mitigate this issue. We employed the YOLOv8 object detector and created datasets from clinical procedures recorded at Reinier de Graaf Hospital (RdGG) and Philips Best Campus, supplemented with publicly accessible images. Through a series of experiments complemented by data visualization, we discovered that the performance degradation primarily stems from data distribution shifts in the feature space. Notably, the object detector trained on non-sensitive online images can generalize to unseen Cath Labs, outperforming the model trained on a procedure recording from a different Cath Lab. The detector trained on the online images achieved an mAP@0.5 of 0.517 on the RdGG dataset. Furthermore, by switching to the most suitable camera for each object in the Cath Lab, the multi-camera system can further improve the detection performance significantly. An aggregated 1-camera mAP@0.5 of 0.679 is achieved for single-object classes on the RdGG dataset.

引用

页数：6

共 26 条

[1] Parsing human skeletons in an operating room [J].

Belagiannis, Vasileios ;

Wang, Xinchao ;

Ben Shitrit, Horesh Beny ;

Hashimoto, Kiyoshi ;

Stauder, Ralf ;

Aoki, Yoshimitsu ;

Kranzfelder, Michael ;

Schneider, Armin ;

Fua, Pascal ;

Ilic, Slobodan ;

Feussner, Hubertus ;

Navab, Nassir .

MACHINE VISION AND APPLICATIONS, 2016, 27 (07) :1035-1046

[2]

Cao TS, 2020, Arxiv, DOI [arXiv:2007.04250, DOI 10.48550/ARXIV.2007.04250]

[3] Radiation Safety Program for the Cardiac Catheterization Laboratory [J].

Chambers, Charles E. ;

Fetterly, Kenneth A. ;

Holzer, Ralf ;

Lin, Pei-Jan Paul ;

Blankenship, James C. ;

Balter, Stephen ;

Laskey, Warren K. .

CATHETERIZATION AND CARDIOVASCULAR INTERVENTIONS, 2011, 77 (04) :546-556

[4] A human fall detection framework based on multi-camera fusion [J].

Ezatzadeh, Shabnam ;

Keyvanpour, Mohammad Reza ;

Shojaedini, Seyed Vahab .

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (06) :905-924

[5]

Fan Q., 11 INT C LEARN REPR

[6] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[7]

Jocher G., 2023, YOLO by ultralytics

[8] Diagnostic cardiac catheterization [J].

Lange, RA ;

Hillis, LD .

CIRCULATION, 2003, 107 (17) :E111-E113

[9]

Liu JS, 2023, Arxiv, DOI arXiv:2108.13624

[10] Deep Learning for Generic Object Detection: A Survey [J].

Liu, Li ;

Ouyang, Wanli ;

Wang, Xiaogang ;

Fieguth, Paul ;

Chen, Jie ;

Liu, Xinwang ;

Pietikainen, Matti .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (02) :261-318

← 1 2 3 →