GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object Tracking

被引:8
作者
Kumar, Abhishek [1 ]
Ajani, Oladayo S. [1 ]
Das, Swagatam [2 ]
Mallipeddi, Rammohan [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 37224, South Korea
[2] Indian Stat Inst Kolkata, Elect & Commun Sci Unit, Kolkata 700108, India
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
基金
新加坡国家研究基金会;
关键词
MEAN-SHIFT;
D O I
10.1109/CVPR52688.2022.00796
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning and computer vision, mean shift (MS) qualifies as one of the most popular mode-seeking algorithms used for clustering and image segmentation. It iteratively moves each data point to the weighted mean of its neighborhood data points. The computational cost required to find the neighbors of each data point is quadratic to the number of data points. Consequently, the vanilla MS appears to be very slow for large-scale datasets. To address this issue, we propose a mode-seeking algorithm called GridShift, with significant speedup and principally based on MS. To accelerate, GridShift employs a grid-based approach for neighbor search, which is linear in the number of data points. In addition, GridShift moves the active grid cells (grid cells associated with at least one data point) in place of data points towards the higher density, a step that provides more speedup. The runtime of GridShift is linear in the number of active grid cells and exponential in the number of features. Therefore, it is ideal for large-scale low-dimensional applications such as object tracking and image segmentation. Through extensive experiments, we showcase the superior performance of GridShift compared to other MS-based as well as state-of-the-art algorithms in terms of accuracy and runtime on benchmark datasets for image segmentation. Finally, we provide a new object-tracking algorithm based on GridShift and show promising results for object tracking compared to CamShift and meanshift++.
引用
收藏
页码:8121 / 8129
页数:9
相关论文
共 28 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]  
Allen J. G., 2004, Proceedings of the Pan-Sydney area workshop on Visual information processing, V100, P3
[3]   A common framework for nonlinear diffusion, adaptive smoothing, bilateral filtering and mean shift [J].
Barash, D ;
Comaniciu, D .
IMAGE AND VISION COMPUTING, 2004, 22 (01) :73-81
[4]  
Bigdeli S.A., 2017, NIPS
[5]  
Bradski GaryR., 1998, Computer vision face tracking for use in a perceptual user interface
[6]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[7]   Kernel-based object tracking [J].
Comaniciu, D ;
Ramesh, V ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (05) :564-577
[8]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[9]   A METHOD FOR COMPARING 2 HIERARCHICAL CLUSTERINGS [J].
FOWLKES, EB ;
MALLOWS, CL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (383) :553-569
[10]   Ultra-Scalable Spectral Clustering and Ensemble Clustering [J].
Huang, Dong ;
Wang, Chang-Dong ;
Wu, Jian-Sheng ;
Lai, Jian-Huang ;
Kwoh, Chee-Keong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (06) :1212-1226