Synthetic region screening and adaptive feature fusion for constructing a flexible object detection database

被引:0
作者
Guan, Licong [1 ]
Yuan, Xue [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
synthetic region screening; adaptive feature fusion; image synthesis; object detection;
D O I
10.1117/1.JEI.30.5.053027
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A large amount of training data (including samples and labels) are required to achieve the task of object detection. However, the construction of databases generally has several problems. The samples are not balanced. In some scenarios, very few or even no samples can be collected; manual collection and labeling are expensive. Obtaining samples from public object detection data sets is simple, but the categories that can be obtained are limited and cannot meet the needs of other detection tasks. Therefore, we propose a method to automatically construct a flexible object detection database and complete automatic labeling. Our main advantage lies in the proposed synthetic region screening and adaptive feature fusion algorithm, which generates sample images and corresponding annotation files that are similar to the real collection. Our algorithm has a stronger and more realistic synthesis ability for three-dimensional objects than other data synthesis methods. Our method improves the system's object detection accuracy in the following ways: (1) it synthesizes a wider range of images for certain categories with few samples in the data set to solve the problem of sample imbalance; (2) it adds new categories to the public data set to meet the needs of rapid deployment of special object detection tasks; and (3) it fuses different foregrounds and backgrounds to enrich the diversity of database samples. To effectively evaluate the proposed method, we conducted experiments separately, and the experimental results proved that our method is superior to other existing data synthesis and data enhancement methods. Combining the generated samples with the real data set, the object detection accuracy value increased from 9.47% to 58.92%. Extending the objects of the self-collected data set to the public data set and automatically generating high-quality annotations, the detection accuracy of extended objects reached more than 50%. (C) 2021 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 48 条
  • [1] Amodei D, 2016, PR MACH LEARN RES, V48
  • [2] Contour Detection and Hierarchical Image Segmentation
    Arbelaez, Pablo
    Maire, Michael
    Fowlkes, Charless
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916
  • [3] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
    Bell, Sean
    Zitnick, C. Lawrence
    Bala, Kavita
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
  • [4] Bochkovskiy A., 2020, PREPRINT, DOI DOI 10.48550/ARXIV.2004.10934
  • [5] DeveshWalawalkar Zhiqiang Shen, 2020, ICASSP 2020, P3642
  • [6] DeVries T., 2017, ARXIV
  • [7] Dubrofsky E., 2009, Homography estimation, P5
  • [8] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
    Dwibedi, Debidatta
    Misra, Ishan
    Hebert, Martial
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1310 - 1319
  • [9] Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
  • [10] InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting
    Fang, Hao-Shu
    Sun, Jianhua
    Wang, Runzhong
    Gou, Minghao
    Li, Yong-Lu
    Lu, Cewu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 682 - 691