Event-LSTM: An Unsupervised and Asynchronous Learning-Based Representation for Event-Based Data

被引:10
作者
Annamalai, Lakshmi [1 ,2 ]
Ramanathan, Vignesh [2 ]
Thakur, Chetan Singh [2 ]
机构
[1] Defence Res & Dev Org, Bangalore 560093, Karnataka, India
[2] Indian Inst Sci, Bangalore 560093, Karnataka, India
关键词
Deep learning methods; deep learning for visual perception; representation learning; event camera; LSTM;
D O I
10.1109/LRA.2022.3151426
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Event cameras are activity-driven bio-inspired vision sensors that respond asynchronously to intensity changes resulting in sparse data known as events. It has potential advantages over conventional cameras, such as high temporal resolution, low latency, and low power consumption. Given the sparse and asynchronous spatio-temporal nature of the data, event processing is predominantly solved by transforming events into a 2D spatial grid representation and applying standard vision pipelines. In this work, we propose an auto-encoder architecture named as Event-LSTM to generate 2D spatial grid representation. Ours has the following main advantages 1) Unsupervised, task-agnostic learning of 2D spatial grid. Ours is ideally suited for the event domain, where task-specific labeled data is scarce, 2) Asynchronous sampling of event 2D spatial grid. This leads to speed invariant and energy-efficient representation. Evaluations on appearance-based and motion-based tasks demonstrate that our approach yields improvement over state-of-the-art techniques while providing the flexibility to learn spatial grid representation from unlabelled data.
引用
收藏
页码:4678 / 4685
页数:8
相关论文
共 39 条
[1]   ACE: An Efficient Asynchronous Corner Tracker for Event Cameras [J].
Alzugaray, Ignacio ;
Chli, Margarita .
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, :653-661
[2]   A Low Power, Fully Event-Based Gesture Recognition System [J].
Amir, Arnon ;
Taba, Brian ;
Berg, David ;
Melano, Timothy ;
McKinstry, Jeffrey ;
Di Nolfo, Carmelo ;
Nayak, Tapan ;
Andreopoulos, Alexander ;
Garreau, Guillaume ;
Mendoza, Marcela ;
Kusnitz, Jeff ;
Debole, Michael ;
Esser, Steve ;
Delbruck, Tobi ;
Flickner, Myron ;
Modha, Dharmendra .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7388-7397
[3]   Graph-Based Spatio-Temporal Feature Learning for Neuromorphic Vision Sensing [J].
Bi, Yin ;
Chadha, Aaron ;
Abbas, Alhabib ;
Bourtsoulatze, Eirina ;
Andreopoulos, Yiannis .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :9084-9098
[4]   Graph-Based Object Classification for Neuromorphic Vision Sensing [J].
Bi, Yin ;
Chadha, Aaron ;
Abbas, Alhabib ;
Bourtsoulatze, Eirina ;
Andreopoulos, Yiannis .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :491-501
[5]   DHP19: Dynamic Vision Sensor 3D Human Pose Dataset [J].
Calabrese, Enrico ;
Taverni, Gemma ;
Easthope, Christopher Awai ;
Skriabine, Sophie ;
Corradi, Federico ;
Longinotti, Luca ;
Eng, Kynan ;
Delbruck, Tobi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1695-1704
[6]   A Differentiable Recurrent Surface for Asynchronous Event-Based Data [J].
Cannici, Marco ;
Ciccone, Marco ;
Romanoni, Andrea ;
Matteucci, Matteo .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :136-152
[7]   Attention Mechanisms for Object Recognition with Event-Based Cameras [J].
Cannici, Marco ;
Ciccone, Marco ;
Romanoni, Andrea ;
Matteucci, Matteo .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1127-1136
[8]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[9]   AutoAugment: Learning Augmentation Strategies from Data [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Mane, Dandelion ;
Vasudevan, Vijay ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :113-123
[10]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497