Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards

被引：0

作者：

Nguyen, Manh-Hung ^{[1
]}

Chao, Tzu-Yin ^{[2
]}

Hsiao, Ching-Chun ^{[2
]}

Li, Yung-Hui ^{[3
]}

Huang, Ching-Chun ^{[2
]}

机构：

[1] HCM Univ Technol & Educ, Fac Elect & Elect Engn, Ho Chi Minh City 700000, Vietnam

[2] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 300093, Taiwan

[3] Hon Hai Res Inst, AI Res Ctr, New Taipei City 114699, Taiwan

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 02期

关键词：

Task-consistency; reinforcement learning; learn from corrupted rewards; domain transfer; MODEL; INFERENCE;

D O I：

10.1109/TITS.2023.3319531

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

This paper proposes a novel task-consistency learning method that enables us to train a vacant space detection network (target task) based on the logic consistency with the semantic outcomes from a flow-based motion behavior classifier (source task) in a parking lot. Note that the source task can introduce false detection during task-consistency learning, which implies noisy rewards or supervision. The target network can be trained in a reinforcement learning setting by appropriately designing the reward mechanism upon semantic consistency. We also introduce a novel symmetric constraint to detect corrupted samples and reduce the effect of noisy rewards. Unlike conventional corrupted learning methods that use only training losses to identify corrupted samples, our symmetric constraint also explores the relationship among training samples to improve performance. Compared with conventional supervised detection methods, the main contribution of our work is the ability to learn a vacant space detector via semantic consistency rather than supervised labels. The dynamic learning property allows the proposed detector to be easily deployed and updated in various lots without heavy human loads. Experiments demonstrate that our noisy task consistency mechanism can be successfully applied to train a vacant space detector from scratch.

引用

页码：1346 / 1363

页数：18

共 42 条

[1] Parking Space Detection using Textural Descriptors
Almeida, Paulo
Oliveira, Luiz S.
Silva, Eunelson, Jr.
Britto, Alceu, Jr.
Koerich, Alessandro
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 3603 - 3608
[2] Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking
Dong, Xingping
Shen, Jianbing
Wang, Wenguan
Shao, Ling
Ling, Haibin
Porikli, Fatih
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1515 - 1529
[3] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
FISCHLER, MA
BOLLES, RC
[J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
[4] TROVE: A Context-Awareness Trust Model for VANETs Using Reinforcement Learning
Guo, Jingjing
Li, Xinghua
Liu, Zhiquan
Ma, Jianfeng
Yang, Chao
Zhang, Junwei
Wu, Dapeng
[J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6647 - 6662
[5] Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking
Guo, Minghao
Lu, Jiwen
Zhou, Jie
[J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 783 - 799
[6] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[7] Parking Space Status Inference Upon a Deep CNN and Multi-Task Contrastive Network With Spatial Transform
Hoang Tran Vu
Huang, Ching-Chun
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1194 - 1208
[8] Vacant Parking Space Detection Based on a Multilayer Inference Framework
Huang, Ching-Chun
Hoang Tran Vu
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (09) : 2041 - 2054
[9] Vacant Parking Space Detection Based on Plane-based Bayesian Hierarchical Framework
Huang, Ching-Chun
Tai, Yu-Shu
Wang, Sheng-Jyh
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1598 - 1610
[10] A Hierarchical Bayesian Generation Framework for Vacant Parking Space Detection
Huang, Ching-Chun
Wang, Sheng-Jyh
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (12) : 1770 - 1785

← 1 2 3 4 5 →