Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards

被引:0
作者
Nguyen, Manh-Hung [1 ]
Chao, Tzu-Yin [2 ]
Hsiao, Ching-Chun [2 ]
Li, Yung-Hui [3 ]
Huang, Ching-Chun [2 ]
机构
[1] HCM Univ Technol & Educ, Fac Elect & Elect Engn, Ho Chi Minh City 700000, Vietnam
[2] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 300093, Taiwan
[3] Hon Hai Res Inst, AI Res Ctr, New Taipei City 114699, Taiwan
关键词
Task-consistency; reinforcement learning; learn from corrupted rewards; domain transfer; MODEL; INFERENCE;
D O I
10.1109/TITS.2023.3319531
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a novel task-consistency learning method that enables us to train a vacant space detection network (target task) based on the logic consistency with the semantic outcomes from a flow-based motion behavior classifier (source task) in a parking lot. Note that the source task can introduce false detection during task-consistency learning, which implies noisy rewards or supervision. The target network can be trained in a reinforcement learning setting by appropriately designing the reward mechanism upon semantic consistency. We also introduce a novel symmetric constraint to detect corrupted samples and reduce the effect of noisy rewards. Unlike conventional corrupted learning methods that use only training losses to identify corrupted samples, our symmetric constraint also explores the relationship among training samples to improve performance. Compared with conventional supervised detection methods, the main contribution of our work is the ability to learn a vacant space detector via semantic consistency rather than supervised labels. The dynamic learning property allows the proposed detector to be easily deployed and updated in various lots without heavy human loads. Experiments demonstrate that our noisy task consistency mechanism can be successfully applied to train a vacant space detector from scratch.
引用
收藏
页码:1346 / 1363
页数:18
相关论文
共 42 条
  • [1] Parking Space Detection using Textural Descriptors
    Almeida, Paulo
    Oliveira, Luiz S.
    Silva, Eunelson, Jr.
    Britto, Alceu, Jr.
    Koerich, Alessandro
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 3603 - 3608
  • [2] Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking
    Dong, Xingping
    Shen, Jianbing
    Wang, Wenguan
    Shao, Ling
    Ling, Haibin
    Porikli, Fatih
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1515 - 1529
  • [3] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [4] TROVE: A Context-Awareness Trust Model for VANETs Using Reinforcement Learning
    Guo, Jingjing
    Li, Xinghua
    Liu, Zhiquan
    Ma, Jianfeng
    Yang, Chao
    Zhang, Junwei
    Wu, Dapeng
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6647 - 6662
  • [5] Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking
    Guo, Minghao
    Lu, Jiwen
    Zhou, Jie
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 783 - 799
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] Parking Space Status Inference Upon a Deep CNN and Multi-Task Contrastive Network With Spatial Transform
    Hoang Tran Vu
    Huang, Ching-Chun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1194 - 1208
  • [8] Vacant Parking Space Detection Based on a Multilayer Inference Framework
    Huang, Ching-Chun
    Hoang Tran Vu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (09) : 2041 - 2054
  • [9] Vacant Parking Space Detection Based on Plane-based Bayesian Hierarchical Framework
    Huang, Ching-Chun
    Tai, Yu-Shu
    Wang, Sheng-Jyh
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1598 - 1610
  • [10] A Hierarchical Bayesian Generation Framework for Vacant Parking Space Detection
    Huang, Ching-Chun
    Wang, Sheng-Jyh
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (12) : 1770 - 1785