Transformer-Based Parking Slot Detection Using Fixed Anchor Points

被引:4
作者
Bui, Quang Huy [1 ]
Suhr, Jae Kyu [1 ]
机构
[1] Sejong Univ, Sch Intelligent Mechatron Engn, Seoul 05006, South Korea
基金
新加坡国家研究基金会;
关键词
Automatic parking system; parking slot detection; deep learning; transformers; convolutional neural network (CNN); around view monitor (AVM);
D O I
10.1109/ACCESS.2023.3315738
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformer-based architectures have recently gained significant attention in various computer vision tasks. Their ability to capture non-local dependencies and intricate characteristics makes them a promising complement to CNNs. However, their application in parking slot detection tasks is still limited. Thus, this paper proposes an appropriate way to apply transformer-based architectures to parking slot detection tasks. The proposed method adopts the Detection Transformer (DETR) architecture, which employs a standard transformer encoder-decoder framework. Since this approach requires a long training time, this paper suggests utilizing fixed anchor points to replace object queries in the original DETR architecture. Each anchor point is assigned a known location and focuses only on a predefined area of the feature map, resulting in a considerable reduction in training time. In addition, this paper suggests using a more suitable and efficient two-point parking slot representation to improve detection performance. In experiments, the proposed method was evaluated with the public large-scale SNU dataset and showed comparable detection performance to the state-of-the-art CNN-based methods with 96.11% recall and 96.61% precision.
引用
收藏
页码:104417 / 104427
页数:11
相关论文
共 31 条
[1]  
[Anonymous], Multi-Camera System.
[2]  
[Anonymous], Surround View Camera
[3]  
[Anonymous], Intelligent Around View Monitor
[4]   CNN-Based Two-Stage Parking Slot Detection Using Region-Specific Multi-Scale Feature Extraction [J].
Bui, Quang Huy ;
Suhr, Jae Kyu .
IEEE ACCESS, 2023, 11 :58491-58505
[5]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[6]   Context-Based Parking Slot Detection With a Realistic Dataset [J].
Do, Hoseok ;
Choi, Jin Young .
IEEE ACCESS, 2020, 8 :171551-171559
[7]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110
[8]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[9]   DMPR-PS: A NOVEL APPROACH FOR PARKING-SLOT DETECTION USING DIRECTIONAL MARKING-POINT REGRESSION [J].
Huang, Junhao ;
Zhang, Lin ;
Shen, Ying ;
Zhang, Huijuan ;
Zhao, Shengjie ;
Yang, Yukai .
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, :212-217
[10]   Semantic segmentation-based parking space detection with standalone around view monitoring system [J].
Jang, Chulhoon ;
Sunwoo, Myoungho .
MACHINE VISION AND APPLICATIONS, 2019, 30 (02) :309-319