Mapping from Frame-Driven to Frame-Free Event-Driven Vision Systems by Low-Rate Rate Coding and Coincidence Processing-Application to Feedforward ConvNets

被引:240
作者
Antonio Perez-Carrasco, Jose [1 ]
Zhao, Bo [2 ]
Serrano, Carmen [1 ]
Acha, Begona [1 ]
Serrano-Gotarredona, Teresa [3 ]
Chen, Shouchun [2 ]
Linares-Barranco, Bernabe [3 ]
机构
[1] Univ Seville, Dept Teoria Senal & Comunicac, Escuela Super Ingn, Seville 41092, Spain
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Inst Microelect Sevilla IMSE CNM CSIC, Seville 41092, Spain
关键词
Feature extraction; convolutional neural networks; object recognition; spiking neural networks; event-driven neural networks; bioinspired vision; high speed vision; OPTIC-NERVE SIGNALS; ON-CHIP; FACE DETECTION; RETINA; LATENCY;
D O I
10.1109/TPAMI.2013.71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event-driven visual sensors have attracted interest from a number of different research communities. They provide visual information in quite a different way from conventional video systems consisting of sequences of still images rendered at a given "frame rate." Event-driven vision sensors take inspiration from biology. Each pixel sends out an event (spike) when it senses something meaningful is happening, without any notion of a frame. A special type of event-driven sensor is the so-called dynamic vision sensor (DVS) where each pixel computes relative changes of light or "temporal contrast." The sensor output consists of a continuous flow of pixel events that represent the moving objects in the scene. Pixel events become available with microsecond delays with respect to "reality." These events can be processed "as they flow" by a cascade of event (convolution) processors. As a result, input and output event flows are practically coincident in time, and objects can be recognized as soon as the sensor provides enough meaningful events. In this paper, we present a methodology for mapping from a properly trained neural network in a conventional frame-driven representation to an event-driven representation. The method is illustrated by studying event-driven convolutional neural networks (ConvNet) trained to recognize rotating human silhouettes or high speed poker card symbols. The event-driven ConvNet is fed with recordings obtained from a real DVS camera. The event-driven ConvNet is simulated with a dedicated event-driven simulator and consists of a number of event-driven processing modules, the characteristics of which are obtained from individually manufactured hardware modules.
引用
收藏
页码:2706 / 2719
页数:14
相关论文
共 50 条
[1]  
[Anonymous], PRODOCUMENT RECOGNIT
[2]  
[Anonymous], P SNOWB LEARN WORKSH
[3]  
[Anonymous], IEEE T BIOM IN PRESS
[4]  
[Anonymous], 2011, NEUFLOW RUNTIME RECO, DOI DOI 10.1109/CVPRW.2011.5981829
[5]  
[Anonymous], 2002, Advances in Neural Information Processing Systems
[6]   A Five-Decade Dynamic-Range Ambient-Light-Independent Calibrated Signed-Spatial-Contrast AER Retina With 0.1-ms Latency and Optional Time-to-First-Spike Mode [J].
Antonio Lenero-Bardallo, Juan ;
Serrano-Gotarredona, Teresa ;
Linares-Barranco, Bernabe .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2010, 57 (10) :2632-2643
[7]  
Azadmehr M, 2005, IEEE INT SYMP CIRC S, P2751
[8]   Error-backpropagation in temporally encoded networks of spiking neurons [J].
Bohte, SM ;
Kok, JN ;
La Poutré, H .
NEUROCOMPUTING, 2002, 48 :17-37
[9]   A gradient descent rule for spiking neurons emitting multiple spikes [J].
Booij, O ;
Nguyen, HT .
INFORMATION PROCESSING LETTERS, 2005, 95 (06) :552-558
[10]  
Camuñas-Mesa L, 2010, IEEE INT SYMP CIRC S, P249, DOI 10.1109/ISCAS.2010.5537918