Surgical Instrument Detection Algorithm Based on Improved YOLOv7x

被引：10

作者：

Ran, Boping ^{[1
]}

Huang, Bo ^{[1
]}

Liang, Shunpan ^{[1
]}

Hou, Yulei ^{[2
]}

机构：

[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066000, Peoples R China

[2] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066000, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 11期

关键词：

deep learning; YOLOV7x; surgical instrument detection; computer vision;

D O I：

10.3390/s23115037

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The counting of surgical instruments is an important task to ensure surgical safety and patient health. However, due to the uncertainty of manual operations, there is a risk of missing or miscounting instruments. Applying computer vision technology to the instrument counting process can not only improve efficiency, but also reduce medical disputes and promote the development of medical informatization. However, during the counting process, surgical instruments may be densely arranged or obstruct each other, and they may be affected by different lighting environments, all of which can affect the accuracy of instrument recognition. In addition, similar instruments may have only minor differences in appearance and shape, which increases the difficulty of identification. To address these issues, this paper improves the YOLOv7x object detection algorithm and applies it to the surgical instrument detection task. First, the RepLK Block module is introduced into the YOLOv7x backbone network, which can increase the effective receptive field and guide the network to learn more shape features. Second, the ODConv structure is introduced into the neck module of the network, which can significantly enhance the feature extraction ability of the basic convolution operation of the CNN and capture more rich contextual information. At the same time, we created the OSI26 data set, which contains 452 images and 26 surgical instruments, for model training and evaluation. The experimental results show that our improved algorithm exhibits higher accuracy and robustness in surgical instrument detection tasks, with F1, AP, AP50, and AP75 reaching 94.7%, 91.5%, 99.1%, and 98.2%, respectively, which are 4.6%, 3.1%, 3.6%, and 3.9% higher than the baseline. Compared to other mainstream object detection algorithms, our method has significant advantages. These results demonstrate that our method can more accurately identify surgical instruments, thereby improving surgical safety and patient health.

引用

页数：18

共 39 条

[1] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[2] Synthetic CT generation from CBCT images via deep learning
Chen, Liyuan
Liang, Xiao
Shen, Chenyang
Jiang, Steve
Wang, Jing
[J]. MEDICAL PHYSICS, 2020, 47 (03) : 1115 - 1125
[3] Dynamic Convolution: Attention over Convolution Kernels
Chen, Yinpeng
Dai, Xiyang
Liu, Mengchen
Chen, Dongdong
Yuan, Lu
Liu, Zicheng
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11027 - 11036
[4] Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Ding, Xiaohan
Zhang, Xiangyu
Han, Jungong
Ding, Guiguang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11953 - 11965
[5] Deep learning-enabled medical computer vision
Esteva, Andre
Chou, Katherine
Yeung, Serena
Naik, Nikhil
Madani, Ali
Mottaghi, Ali
Liu, Yun
Topol, Eric
Dean, Jeff
Socher, Richard
[J]. NPJ DIGITAL MEDICINE, 2021, 4 (01)
[6] Haowei Ma, 2021, 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT), P52, DOI 10.1109/ISRIMT53730.2021.9597049
[7] Hooshangnejad H, 2023, Arxiv, DOI arXiv:2301.11085
[8] Hua R.F, 2014, P 2014 HENAN PROVINC, P61
[9] Huang X.F., 2007, J NURSE ED, V20, P1835, DOI [10.16821/j.cnki.hsjx.2007.20.005, DOI 10.16821/J.CNKI.HSJX.2007.20.005]
[10] Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks
Jin, Amy
Yeung, Serena
Jopling, Jeffrey
Krause, Jonathan
Azagury, Dan
Milstein, Arnold
Li Fei-Fei
[J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 691 - 699

← 1 2 3 4 →