Object Tracking Based Surgical Incision Region Encoding using Scalable High Efficiency Video Coding for Surgical Telementoring Applications

被引:3
作者
Sanagavarapu, Karthik Sairam [1 ]
Pullakandam, Muralidhar [1 ]
机构
[1] NIT Warangal, Dept ECE, Warangal, Andhra Pradesh, India
关键词
Surgical telementoring; object tracking; KCF tracker; region of interest; High Efficiency Video Coding; HEVC; EXTENSIONS; SYSTEM;
D O I
10.13164/re.2022.0231
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Surgical telementoring is an advanced tele-medicine concept where the expert surgeon guides the onsite novice present at the remote location. The efficient telementoring system requires the wireless transmission of high-quality surgical video with less bitrate in less time. The bit rate of the surgical video can be decreased by segmenting the surgical incision region and removing the background region. The High Efficiency Video Coding (HEVC) standard has provided promising results for surgical telementoring applications. But the Rate-Distortion Optimization (RDO) search process in HEVC increases the complexity that in turn increases the encoding time. We propose the method which involves the segmentation of the surgical incision region using the Kernelized Correlation Filter (KCF) object tracking technique. The segmented region is encoded by the complexity-efficient Scalable HEVC (SHVC) to meet the resolution of an end-user device. The complexity of SHVC is decreased by using the Convolutional Neural Network (CNN) and Long- and Short- Term Memory (LSTM) to predict the Coding Tree Unit (CTU) structure. The results show that the proposed method decreases the bitrate significantly for segmented surgical video sequences without degradation in Peak Signal-to-Noise Ratio (PSNR). These results are obtained for the surgical video sequences with slow-moving objects. Furthermore, the CNN+LSTM approach reduces the encoding time of standard SHVC by 51% with negligible Rate-Distortion (RD) performance loss.
引用
收藏
页码:231 / 242
页数:12
相关论文
共 45 条
  • [1] [Anonymous], 2017, Int J Adv Res Basic Eng Sci Technol (IJARBEST)
  • [2] [Anonymous], 2010, 2010 IFIP Wireless Days
  • [3] [Anonymous], 2012, P IEEE MIL COMM C
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Barsakar T, 2016, 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), P212
  • [6] Bjontegaard G, 2008, ITU T SG16Q6 35 VCEG
  • [7] Bjontegaard G., 2001, P 13 VCEG M AUST TX
  • [8] Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding Standard
    Boyce, Jill M.
    Ye, Yan
    Chen, Jianle
    Ramasubramonian, Adarsh K.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) : 20 - 34
  • [9] Chen C., 2014, J TRAVEL RES, P1
  • [10] Mean shift: A robust approach toward feature space analysis
    Comaniciu, D
    Meer, P
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 603 - 619