Unsupervised Object Detection using Patch Based Image Classifier and Gradient Importance Map

被引：0

作者：

Vanita Jain ^{[1
]}

Manu S. Pillai ^{[2
]}

Achin Jain ^{[3
]}

Arun Kumar Dubey ^{[3
]}

机构：

[1] University of Delhi,Department of Electronic Science

[2] University of Central Florida,Center for Research in Computer Vision

[3] Bharati Vidyapeeth’s College of Engineering,undefined

来源：

International Journal of Information Technology | 2025年 / 17卷 / 4期

关键词：

Image classification; Object detection; Deep learning; Grad-CAM;

D O I：

10.1007/s41870-025-02412-4

中图分类号：

学科分类号：

摘要：

Image classification in computer vision has seen tremendous amount of success in recent years. Deep learning has played a pivotal role in achieving human level performance in many image recognition challenges and benchmarks. Even though, image classification has been so successful, no other closely related domains have taken advantage from the efforts put into development of image classification methods. One such closely related field is of Object Detection. Object detection or localisation is a computer vision problem whose solutions have not been victorious enough to human level performance. Many challenges arise when developing object detection models for newly generated domains, one of which is labelling of datasets. Preparation of dataset is one of the most cumbersome and expensive task to accomplish while developing an object detection model. Although, image classifiers are used as a feature extractor in object detection training regimes, their localisation abilities are barely studied. In this paper, we propose an object detection training regime, that does not rely on bounding box labelled datasets, hence unsupervised in nature, and is solely based on trained image classifiers. We build up on our hypothesis, that, "if an image classifier is able to predict what object is in the input image, then it must have information about where the object is, we just need a mechanism to extract that information from it". Precisely, we divide the input image into patches of same size and employ a parameter restricted convolutional classifier on each patches to predict whether it contains the object or not, we call this our patch-based image classifier (the object here is the prediction of the trained image classifier). The training of the patch-based classifier is not straightforward as there is no true labels for each patches on which we can reduce the binary cross-entropy. Therefore, we propose a loss function weighted by the importance map, which we generate using Grad-CAM, that when minimized detects patches only containing objects that the classifier predicted. Empirically, we show that our method performs competitively on the ILSVRC-2017 object localisation benchmark.

引用

页码：2407 / 2416

页数：9

共 50 条

[1] Out-of-Plane Rotated Object Detection using Patch Feature based Classifier
Mustafah, Yasir Mohd
Azman, Amelia Wong
INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS 2012 (IRIS 2012), 2012, 41 : 170 - 174
[2] Object Detection based on Saliency Map using Reference Image Containing Complex Background
Tamura, Yasuto
Masuta, Hiroyuki
Takanishi, Atsuo
Lim, Hun-ok
2014 WORLD AUTOMATION CONGRESS (WAC): EMERGING TECHNOLOGIES FOR A NEW PARADIGM IN SYSTEM OF SYSTEMS ENGINEERING, 2014,
[3] Cross-domain object detection using unsupervised image translation
Arruda, Vinicius F.
Berriel, Rodrigo F.
Paixao, Thiago M.
Badue, Claudine
De Souza, Alberto F.
Sebe, Nicu
Oliveira-Santos, Thiago
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
[4] Unsupervised Domain Adaptation for Image Classification and Object Detection Using Guided Transfer Learning Approach and JS']JS Divergence
Goel, Parth
Ganatra, Amit
SENSORS, 2023, 23 (09)
[5] Binary Image Filtering for Object Detection Based on Haar Feature Density Map
Li, Chengqi
Ren, Zhigang
Yang, Bo
2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND MACHINE VISION, 2017, 10613
[6] Square Patch Feature: Faster Weak-Classifier for Robust Object Detection
Mustafah, Yasir M.
Bigdeli, Abbas
Azman, Amelia W.
Dadgostar, Farhad
Lovell, Brian C.
11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2073 - 2077
[7] An image caption method based on object detection
Danyang Cao
Menggui Zhu
Lei Gao
Multimedia Tools and Applications, 2019, 78 : 35329 - 35350
[8] An image caption method based on object detection
Cao, Danyang
Zhu, Menggui
Gao, Lei
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35329 - 35350
[9] A Novel Pornographic Visual Content Classifier based on Sensitive Object Detection
Dinh-Duy Phan
Thanh-Thien Nguyen
Quang-Huy Nguyen
Hoang-Loc Tran
Khac-Ngoc-Khoi Nguyen
Duc-Lung Vu
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 787 - 795
[10] Generic Object Detection Using Improved Gentleboost Classifier
Guo, Li
Liao, Yu
Luo, Daisheng
Liao, Honghua
INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1528 - 1535

← 1 2 3 4 5 →