Multi-scale HOG Feature Used in Object Detection

被引:0
作者
Li, Jin [1 ]
Zhang, Hong [1 ]
Zhang, Lei [1 ]
Li, Yawei [1 ]
Kang, Qiaochu [2 ]
Luo, Zhaohui [1 ]
Wu, Yujie [1 ]
机构
[1] Beihang Univ, Image Ctr, Sch Astronaut, Beijing, Peoples R China
[2] Univ Massachusetts, Amherst, MA 01003 USA
来源
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018) | 2019年 / 11069卷
基金
中国国家自然科学基金;
关键词
multi-scale; HOG; object detection;
D O I
10.1117/12.2524169
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Object detection is one of the most popular and difficult field in computer vision. Although deep learning methods have great performance on object detection. For specific application, algorithms which use hand-crafted features are still widely used. One main problem in object detection is the scale problem. Algorithms usually use image pyramid to cover as many scales as possible. But gaps still exist between scale levels in image pyramid. Our work extends some sub scale level to fill the gaps between image pyramids. To this end, we use Gaussian Scales Pyramid to generate sub-scale image and extract HOG feature on the sub-scale. We use framework offered by DPM algorithm and make modification on it. We compare the result of our method with DPM baseline on Pascal VOC database. Our work has great performance on some categories and makes an improvement on the overall performance. This work can be used in other object detection frameworks. We apply multi-scale HOG feature on pre-process procedure of our own detection framework and test it on our own dataset. Then the framework gains performance improvement on precision and recall rate of the pre-process procedure comparing to the original HOG feature architecture.
引用
收藏
页数:7
相关论文
共 15 条
[1]  
[Anonymous], 2005, PROC CVPR IEEE
[2]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[3]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[4]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[5]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[6]   BOOSTING A WEAK LEARNING ALGORITHM BY MAJORITY [J].
FREUND, Y .
INFORMATION AND COMPUTATION, 1995, 121 (02) :256-285
[7]  
Girshick R.B., 2012, Discriminatively trained deformable part models, release 5
[8]   THE STRUCTURE OF IMAGES [J].
KOENDERINK, JJ .
BIOLOGICAL CYBERNETICS, 1984, 50 (05) :363-370
[9]   Feature detection with automatic scale selection [J].
Lindeberg, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 30 (02) :79-116
[10]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110