Synthetic region screening and adaptive feature fusion for constructing a flexible object detection database

被引：0

作者：

Guan, Licong ^{[1
]}

Yuan, Xue ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2021年 / 30卷 / 05期

基金：

中国国家自然科学基金;

关键词：

synthetic region screening; adaptive feature fusion; image synthesis; object detection;

D O I：

10.1117/1.JEI.30.5.053027

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A large amount of training data (including samples and labels) are required to achieve the task of object detection. However, the construction of databases generally has several problems. The samples are not balanced. In some scenarios, very few or even no samples can be collected; manual collection and labeling are expensive. Obtaining samples from public object detection data sets is simple, but the categories that can be obtained are limited and cannot meet the needs of other detection tasks. Therefore, we propose a method to automatically construct a flexible object detection database and complete automatic labeling. Our main advantage lies in the proposed synthetic region screening and adaptive feature fusion algorithm, which generates sample images and corresponding annotation files that are similar to the real collection. Our algorithm has a stronger and more realistic synthesis ability for three-dimensional objects than other data synthesis methods. Our method improves the system's object detection accuracy in the following ways: (1) it synthesizes a wider range of images for certain categories with few samples in the data set to solve the problem of sample imbalance; (2) it adds new categories to the public data set to meet the needs of rapid deployment of special object detection tasks; and (3) it fuses different foregrounds and backgrounds to enrich the diversity of database samples. To effectively evaluate the proposed method, we conducted experiments separately, and the experimental results proved that our method is superior to other existing data synthesis and data enhancement methods. Combining the generated samples with the real data set, the object detection accuracy value increased from 9.47% to 58.92%. Extending the objects of the self-collected data set to the public data set and automatically generating high-quality annotations, the detection accuracy of extended objects reached more than 50%. (C) 2021 SPIE and IS&T

引用

页数：17

共 48 条

[1] Amodei D, 2016, PR MACH LEARN RES, V48
[2] Contour Detection and Hierarchical Image Segmentation
Arbelaez, Pablo
Maire, Michael
Fowlkes, Charless
Malik, Jitendra
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916
[3] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Bell, Sean
Zitnick, C. Lawrence
Bala, Kavita
Girshick, Ross
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
[4] Bochkovskiy A., 2020, PREPRINT, DOI DOI 10.48550/ARXIV.2004.10934
[5] DeveshWalawalkar Zhiqiang Shen, 2020, ICASSP 2020, P3642
[6] DeVries T., 2017, ARXIV
[7] Dubrofsky E., 2009, Homography estimation, P5
[8] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
Dwibedi, Debidatta
Misra, Ishan
Hebert, Martial
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1310 - 1319
[9] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
[10] InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting
Fang, Hao-Shu
Sun, Jianhua
Wang, Runzhong
Gou, Minghao
Li, Yong-Lu
Lu, Cewu
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 682 - 691

← 1 2 3 4 5 →