Unsupervised Logo Detection Using Adversarial Learning From Synthetic to Real Images

被引：1

作者：

Jain, Rahul Kumar ^{[1
]}

Sato, Takahiro ^{[2
]}

El-Sayed, Ahmed M. ^{[2
]}

Watasue, Taro ^{[2
]}

Nakagawa, Tomohiro ^{[2
]}

Iwamoto, Yutaro ^{[3
]}

Ruan, Xiang ^{[2
]}

Chen, Yen-Wei ^{[4
]}

机构：

[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu, Shiga 6038577, Japan

[2] Tiwaki Co Ltd, Kusatsu 5250047, Japan

[3] Osaka Electrocommun Univ, Informat & Commun Engn, Osaka 5720833, Japan

[4] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu, Shiga 6038577, Japan

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 01期

关键词：

Logo detection; unsupervised domain adaptation; adversarial learning; synthesize images; anchorless object detectors; entropy minimization; maximum square loss;

D O I：

10.1109/TETCI.2023.3256475

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most existing deep learning-based methods for logo detection require a large amount of object-level annotated training data. However, it takes a lot of time and effort to create a good object-level annotated logo detection dataset. Many researchers have tried to utilize synthesized images with automatically generated annotations for training to deal with data annotation problem. However, in real applications, model trained using synthesized images suffer from performance loss due to domain shift between synthesized and real-world images. To this end, in this paper, we propose an adversarial learning-based unsupervised domain adaptation method for logo detection. We only use labeled synthesized logo images for model training and adapt target domain knowledge using unlabeled real-world logo images. We propose entropy minimization of mid-level output feature maps in order to effectively align the domain gap between synthetic and real images for the logo detection task. Additionally, we have generated coherent synthesized logo images with automatically constructed bounding box annotations for different datasets to perform unsupervised training for the experiments. Our experiments show that the proposed method improves performance on different logo datasets compared to direct transfer from source to target domain (synthetic-to-real images) without any labeling cost and increasing network parameters.

引用

页码：710 / 723

页数：14

共 52 条

[1] Dual Discriminator-Based Unsupervised Domain Adaptation Using Adversarial Learning for Liver Segmentation on Multiphase CT Images [J].

Ananda, Swathi ;

Iwamoto, Yutaro ;

Han, Xianhua ;

Lin, Lanfen ;

Hu, Hongjie ;

Chen, Yen-Wei .

2022 44TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2022, :1552-1555

[2]

[Anonymous], 2014, P INT C MULT RETR

[3]

[Anonymous], 2013, IET INT C SMART SUST, DOI DOI 10.1049/CP.2013.1994

[4]

Bernabeu Marisa, 2022, ARXIV

[5] Domain Adaptation for Semantic Segmentation with Maximum Squares Loss [J].

Chen, Minghao ;

Xue, Hongyang ;

Cai, Deng .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2090-2099

[6] Domain Adaptive Faster R-CNN for Object Detection in the Wild [J].

Chen, Yuhua ;

Li, Wen ;

Sakaridis, Christos ;

Dai, Dengxin ;

Van Gool, Luc .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3339-3348

[7]

Chen YW, 2020, Deep learning in healthcare

[8] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[9] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[10] Scalable Logo Recognition using Proxies [J].

Fehervari, Istvan ;

Appalaraju, Srikar .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :715-725

← 1 2 3 4 5 6 →