Label-Free Event-based Object Recognition via Joint Learning with Image Reconstruction from Events

被引:8
作者
Cho, Hoonhee [1 ]
Kim, Hyeonseong [1 ]
Chae, Yujeong [1 ]
Yoon, Kuk-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
基金
新加坡国家研究基金会;
关键词
VISION;
D O I
10.1109/ICCV51070.2023.01819
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing objects from sparse and noisy events becomes extremely difficult when paired images and category labels do not exist. In this paper, we study label-free event-based object recognition where category labels and paired images are not available. To this end, we propose a joint formulation of object recognition and image reconstruction in a complementary manner. Our method first reconstructs images from events and performs object recognition through Contrastive Language-Image Pre-training (CLIP), enabling better recognition through a rich context of images. Since the category information is essential in reconstructing images, we propose category-guided attraction loss and category-agnostic repulsion loss to bridge the textual features of predicted categories and the visual features of reconstructed images using CLIP. Moreover, we introduce a reliable data sampling strategy and local-global reconstruction consistency to boost joint learning of two tasks. To enhance the accuracy of prediction and quality of reconstruction, we also propose a prototype-based approach using unpaired images. Extensive experiments demonstrate the superiority of our method and its extensibility for zero-shot object recognition. Our project code is available at https://github.com/Chohoonhee/Ev-LaFOR.
引用
收藏
页码:19809 / 19820
页数:12
相关论文
共 63 条
[1]  
Amir Arnon, 2017, P IEEE C COMP VIS PA, P7243
[2]  
[Anonymous], 2021, Front Neurosci, DOI [DOI 10.3389/FNINS.2015.00481, DOI 10.3389/FNINS]
[3]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.102
[4]   Mapping from Frame-Driven to Frame-Free Event-Driven Vision Systems by Low-Rate Rate Coding and Coincidence Processing-Application to Feedforward ConvNets [J].
Antonio Perez-Carrasco, Jose ;
Zhao, Bo ;
Serrano, Carmen ;
Acha, Begona ;
Serrano-Gotarredona, Teresa ;
Chen, Shouchun ;
Linares-Barranco, Bernabe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2706-2719
[5]   Graph-Based Object Classification for Neuromorphic Vision Sensing [J].
Bi, Yin ;
Chadha, Aaron ;
Abbas, Alhabib ;
Bourtsoulatze, Eirina ;
Andreopoulos, Yiannis .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :491-501
[6]   A Differentiable Recurrent Surface for Asynchronous Event-Based Data [J].
Cannici, Marco ;
Ciccone, Marco ;
Romanoni, Andrea ;
Matteucci, Matteo .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :136-152
[7]   Image-Based CLIP-Guided Essence Transfer [J].
Chefer, Hila ;
Benaim, Sagie ;
Paiss, Roni ;
Wolf, Lior .
COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 :695-711
[8]  
Chen Runnan, 2023, ARXIV230104926
[9]  
Cheng W., 2019, P IEEE CVF C COMP VI
[10]   Learning Adaptive Dense Event Stereo from the Image Domain [J].
Cho, Hoonhee ;
Cho, Jegyeong ;
Yoon, Kuk-Jin .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :17797-17807