Unsupervised Object Detection using Patch Based Image Classifier and Gradient Importance Map

被引:0
|
作者
Vanita Jain [1 ]
Manu S. Pillai [2 ]
Achin Jain [3 ]
Arun Kumar Dubey [3 ]
机构
[1] University of Delhi,Department of Electronic Science
[2] University of Central Florida,Center for Research in Computer Vision
[3] Bharati Vidyapeeth’s College of Engineering,undefined
关键词
Image classification; Object detection; Deep learning; Grad-CAM;
D O I
10.1007/s41870-025-02412-4
中图分类号
学科分类号
摘要
Image classification in computer vision has seen tremendous amount of success in recent years. Deep learning has played a pivotal role in achieving human level performance in many image recognition challenges and benchmarks. Even though, image classification has been so successful, no other closely related domains have taken advantage from the efforts put into development of image classification methods. One such closely related field is of Object Detection. Object detection or localisation is a computer vision problem whose solutions have not been victorious enough to human level performance. Many challenges arise when developing object detection models for newly generated domains, one of which is labelling of datasets. Preparation of dataset is one of the most cumbersome and expensive task to accomplish while developing an object detection model. Although, image classifiers are used as a feature extractor in object detection training regimes, their localisation abilities are barely studied. In this paper, we propose an object detection training regime, that does not rely on bounding box labelled datasets, hence unsupervised in nature, and is solely based on trained image classifiers. We build up on our hypothesis, that, "if an image classifier is able to predict what object is in the input image, then it must have information about where the object is, we just need a mechanism to extract that information from it". Precisely, we divide the input image into patches of same size and employ a parameter restricted convolutional classifier on each patches to predict whether it contains the object or not, we call this our patch-based image classifier (the object here is the prediction of the trained image classifier). The training of the patch-based classifier is not straightforward as there is no true labels for each patches on which we can reduce the binary cross-entropy. Therefore, we propose a loss function weighted by the importance map, which we generate using Grad-CAM, that when minimized detects patches only containing objects that the classifier predicted. Empirically, we show that our method performs competitively on the ILSVRC-2017 object localisation benchmark.
引用
收藏
页码:2407 / 2416
页数:9
相关论文
共 50 条
  • [1] Out-of-Plane Rotated Object Detection using Patch Feature based Classifier
    Mustafah, Yasir Mohd
    Azman, Amelia Wong
    INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS 2012 (IRIS 2012), 2012, 41 : 170 - 174
  • [2] Object Detection based on Saliency Map using Reference Image Containing Complex Background
    Tamura, Yasuto
    Masuta, Hiroyuki
    Takanishi, Atsuo
    Lim, Hun-ok
    2014 WORLD AUTOMATION CONGRESS (WAC): EMERGING TECHNOLOGIES FOR A NEW PARADIGM IN SYSTEM OF SYSTEMS ENGINEERING, 2014,
  • [3] Cross-domain object detection using unsupervised image translation
    Arruda, Vinicius F.
    Berriel, Rodrigo F.
    Paixao, Thiago M.
    Badue, Claudine
    De Souza, Alberto F.
    Sebe, Nicu
    Oliveira-Santos, Thiago
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [4] Unsupervised Domain Adaptation for Image Classification and Object Detection Using Guided Transfer Learning Approach and JS']JS Divergence
    Goel, Parth
    Ganatra, Amit
    SENSORS, 2023, 23 (09)
  • [5] Binary Image Filtering for Object Detection Based on Haar Feature Density Map
    Li, Chengqi
    Ren, Zhigang
    Yang, Bo
    2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND MACHINE VISION, 2017, 10613
  • [6] Square Patch Feature: Faster Weak-Classifier for Robust Object Detection
    Mustafah, Yasir M.
    Bigdeli, Abbas
    Azman, Amelia W.
    Dadgostar, Farhad
    Lovell, Brian C.
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2073 - 2077
  • [7] An image caption method based on object detection
    Danyang Cao
    Menggui Zhu
    Lei Gao
    Multimedia Tools and Applications, 2019, 78 : 35329 - 35350
  • [8] An image caption method based on object detection
    Cao, Danyang
    Zhu, Menggui
    Gao, Lei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35329 - 35350
  • [9] A Novel Pornographic Visual Content Classifier based on Sensitive Object Detection
    Dinh-Duy Phan
    Thanh-Thien Nguyen
    Quang-Huy Nguyen
    Hoang-Loc Tran
    Khac-Ngoc-Khoi Nguyen
    Duc-Lung Vu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 787 - 795
  • [10] Generic Object Detection Using Improved Gentleboost Classifier
    Guo, Li
    Liao, Yu
    Luo, Daisheng
    Liao, Honghua
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1528 - 1535