Robust Signboard Detection and Recognition in Real Environments

被引：11

作者：

Cheewaprakobkit, Pimpa ^{[1
]}

Lin, Chih-Yang ^{[2
]}

Lin, Kuan-Hung ^{[1
]}

Shih, Timothy K. K. ^{[1
]}

机构：

[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan

[2] Natl Cent Univ, Dept Mech Engn, Taoyuan 32001, Taiwan

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2023年 / 69卷 / 03期

关键词：

Cyclical generative adversarial networks; signboard detection; one-stage detector;

D O I：

10.1109/TCE.2023.3257288

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The detection and recognition of signboards have become increasingly important in the consumer electronics industry due to its wide range of potential applications. These applications include aiding visually impaired consumers in navigating through unfamiliar areas, identifying location landmarks for wayfinding, and providing targeted advertising and marketing services to consumers. However, the accuracy of signboard detection remains challenging due to the diversity of designs, which may incorporate text and images, and the complexity of environments, such as occlusion, shooting angles, and lighting conditions. Most existing detection methods struggle to distinguish small and similar signboards. In this paper, we propose robust signboard detection and recognition based on template generation. We also collected a new dataset that contains about 30,000 images, in 14 categories of signboards in Taiwan for training and free public use. The proposed method is a one-stage detector, which utilizes multi-scale features in the Darknet-19 network to learn object features effectively, detecting tiny and large objects. In addition, the proposed template generation method was designed to improve the overall accuracy. We compare our results with the Yolo series models. The results show that our proposed method more efficiently detects and recognizes signboards, achieving an mAP score of 95.99%, total parameters of 62.7M, and FPS of 8.3.

引用

页码：421 / 430

页数：10

共 40 条

[1] CDGAN: Cyclic Discriminative Generative Adversarial Networks for image-to-image transformation [J].

Babu, Kancharagunta Kishan ;

Dubey, Shiv Ram .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 82

[2]

Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]

[3] STDnet-ST: Spatio-temporal ConvNet for small object detection [J].

Bosquet, Brais ;

Mucientes, Manuel ;

Brea, Victor M. .

PATTERN RECOGNITION, 2021, 116 (116)

[4] Cascade R-CNN: Delving into High Quality Object Detection [J].

Cai, Zhaowei ;

Vasconcelos, Nuno .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162

[5] Scale-aware Automatic Augmentation for Object Detection [J].

Chen, Yukang ;

Li, Yanwei ;

Kong, Tao ;

Qi, Lu ;

Chu, Ruihang ;

Li, Lei ;

Jia, Jiaya .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9558-9567

[6]

Chen-Ya Hong, 2019, 2019 Twelfth International Conference on Ubi-Media Computing (Ubi-Media). Proceedings, P256, DOI 10.1109/Ubi-Media.2019.00057

[7] A Multiscale Recognition Method for the Optimization of Traffic Signs Using GMM and Category Quality Focal Loss [J].

Gao, Mingyu ;

Chen, Chao ;

Shi, Jie ;

Lai, Chun Sing ;

Yang, Yuxiang ;

Dong, Zhekang .

SENSORS, 2020, 20 (17) :1-20

[8]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916

[10] A Survey of Deep Learning-Based Object Detection [J].

Jiao, Licheng ;

Zhang, Fan ;

Liu, Fang ;

Yang, Shuyuan ;

Li, Lingling ;

Feng, Zhixi ;

Qu, Rong .

IEEE ACCESS, 2019, 7 :128837-128868

← 1 2 3 4 →