SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL OBJECT DETECTION

被引:205
作者
Akyon, Fatih Cagatay [1 ,2 ]
Altinuc, Sinan Onur [1 ,2 ]
Temizel, Alptekin [2 ]
机构
[1] OBSS AI, Ankara, Turkey
[2] Middle East Tech Univ, Grad Sch Informat, Ankara, Turkey
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
关键词
small object detection; sliced inference; windowed inference; visdrone; xview; NETWORK;
D O I
10.1109/ICIP46576.2022.9897990
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and finetuning pipeline for small object detection. The proposed technique is generic in the sense that it can be applied on top of any available object detector without any fine-tuning. Experimental evaluations, using object detection baselines on the Visdrone and xView aerial object detection datasets show that the proposed inference method can increase object detection AP by 6.8%, 5.1% and 5.3% for FCOS, VFNet and TOOD detectors, respectively. Moreover, the detection accuracy can be further increased with a slicing aided fine-tuning, resulting in a cumulative increase of 12.7%, 13.4% and 14.5% AP in the same order. Proposed technique has been integrated with Detectron2, MMDetection and YOLOv5 models and it is publicly available at https://github.com/obss/sahi.git
引用
收藏
页码:966 / 970
页数:5
相关论文
共 26 条
[1]   Deep Attention Neural Tensor Network for Visual Question Answering [J].
Bai, Yalong ;
Fu, Jianlong ;
Zhao, Tiejun ;
Mei, Tao .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :21-37
[2]  
Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934
[3]  
Bosquet B., 2018, P BMVC
[4]  
Chen K., 2019, CoRR abs/1906.07155
[5]   SSD-MSN: An Improved Multi-Scale Object Detection Network Based on SSD [J].
Chen, Zuge ;
Wu, Kehe ;
Li, Yuanbo ;
Wang, Minjian ;
Li, Wei .
IEEE ACCESS, 2019, 7 :80622-80632
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Du D., 2019, P IEEE CVF ICCV WORK, P0
[8]  
E. C. for Electro-technical Standardization, 2012, AL SYST CCTV SURV SY
[9]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[10]   TOOD: Task-aligned One-stage Object Detection [J].
Feng, Chengjian ;
Zhong, Yujie ;
Gao, Yu ;
Scott, Matthew R. ;
Huang, Weilin .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499