Surgical Instrument Detection Algorithm Based on Improved YOLOv7x

被引:10
作者
Ran, Boping [1 ]
Huang, Bo [1 ]
Liang, Shunpan [1 ]
Hou, Yulei [2 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066000, Peoples R China
[2] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066000, Peoples R China
关键词
deep learning; YOLOV7x; surgical instrument detection; computer vision;
D O I
10.3390/s23115037
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The counting of surgical instruments is an important task to ensure surgical safety and patient health. However, due to the uncertainty of manual operations, there is a risk of missing or miscounting instruments. Applying computer vision technology to the instrument counting process can not only improve efficiency, but also reduce medical disputes and promote the development of medical informatization. However, during the counting process, surgical instruments may be densely arranged or obstruct each other, and they may be affected by different lighting environments, all of which can affect the accuracy of instrument recognition. In addition, similar instruments may have only minor differences in appearance and shape, which increases the difficulty of identification. To address these issues, this paper improves the YOLOv7x object detection algorithm and applies it to the surgical instrument detection task. First, the RepLK Block module is introduced into the YOLOv7x backbone network, which can increase the effective receptive field and guide the network to learn more shape features. Second, the ODConv structure is introduced into the neck module of the network, which can significantly enhance the feature extraction ability of the basic convolution operation of the CNN and capture more rich contextual information. At the same time, we created the OSI26 data set, which contains 452 images and 26 surgical instruments, for model training and evaluation. The experimental results show that our improved algorithm exhibits higher accuracy and robustness in surgical instrument detection tasks, with F1, AP, AP50, and AP75 reaching 94.7%, 91.5%, 99.1%, and 98.2%, respectively, which are 4.6%, 3.1%, 3.6%, and 3.9% higher than the baseline. Compared to other mainstream object detection algorithms, our method has significant advantages. These results demonstrate that our method can more accurately identify surgical instruments, thereby improving surgical safety and patient health.
引用
收藏
页数:18
相关论文
共 39 条
  • [1] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [2] Synthetic CT generation from CBCT images via deep learning
    Chen, Liyuan
    Liang, Xiao
    Shen, Chenyang
    Jiang, Steve
    Wang, Jing
    [J]. MEDICAL PHYSICS, 2020, 47 (03) : 1115 - 1125
  • [3] Dynamic Convolution: Attention over Convolution Kernels
    Chen, Yinpeng
    Dai, Xiyang
    Liu, Mengchen
    Chen, Dongdong
    Yuan, Lu
    Liu, Zicheng
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11027 - 11036
  • [4] Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
    Ding, Xiaohan
    Zhang, Xiangyu
    Han, Jungong
    Ding, Guiguang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11953 - 11965
  • [5] Deep learning-enabled medical computer vision
    Esteva, Andre
    Chou, Katherine
    Yeung, Serena
    Naik, Nikhil
    Madani, Ali
    Mottaghi, Ali
    Liu, Yun
    Topol, Eric
    Dean, Jeff
    Socher, Richard
    [J]. NPJ DIGITAL MEDICINE, 2021, 4 (01)
  • [6] Haowei Ma, 2021, 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT), P52, DOI 10.1109/ISRIMT53730.2021.9597049
  • [7] Hooshangnejad H, 2023, Arxiv, DOI arXiv:2301.11085
  • [8] Hua R.F, 2014, P 2014 HENAN PROVINC, P61
  • [9] Huang X.F., 2007, J NURSE ED, V20, P1835, DOI [10.16821/j.cnki.hsjx.2007.20.005, DOI 10.16821/J.CNKI.HSJX.2007.20.005]
  • [10] Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks
    Jin, Amy
    Yeung, Serena
    Jopling, Jeffrey
    Krause, Jonathan
    Azagury, Dan
    Milstein, Arnold
    Li Fei-Fei
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 691 - 699