Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition

被引:1
|
作者
Sanchez-Garcia, Melani [1 ]
Chauhan, Tushar [2 ,3 ]
Cottereau, Benoit R. [3 ,4 ]
Beyeler, Michael [1 ,5 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[2] MIT, Picower Inst Learning & Memory, Dept Brain & Cognit Sci, Boston, MA USA
[3] Univ Toulouse III Paul Sabatier, CerCo CNRS UMR5549, Toulouse, France
[4] IPAL, CNRS IRL 2955, Singapore, Singapore
[5] Univ Calif Santa Barbara, Dept Psychol & Brain Sci, Santa Barbara, CA USA
关键词
Spiking neural networks; Spike-timing-dependent-plasticity; Multi-scale processing; Spike-latency code; Winner-take-all inhibition; TIMING-DEPENDENT PLASTICITY; RETINAL GANGLION-CELLS; RESPONSE PROPERTIES; FREQUENCY; CATEGORIZATION; STATISTICS; NETWORK; CORTEX;
D O I
10.1007/s00422-023-00956-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks have surpassed human performance in key visual challenges such as object recognition, but require a large amount of energy, computation, and memory. In contrast, spiking neural networks (SNNs) have the potential to improve both the efficiency and biological plausibility of object recognition systems. Here we present a SNN model that uses spike-latency coding and winner-take-all inhibition (WTA-I) to efficiently represent visual stimuli using multi-scale parallel processing. Mimicking neuronal response properties in early visual cortex, images were preprocessed with three different spatial frequency (SF) channels, before they were fed to a layer of spiking neurons whose synaptic weights were updated using spike-timing-dependent-plasticity. We investigate how the quality of the represented objects changes under different SF bands and WTA-I schemes. We demonstrate that a network of 200 spiking neurons tuned to three SFs can efficiently represent objects with as little as 15 spikes per neuron. Studying how core object recognition may be implemented using biologically plausible learning rules in SNNs may not only further our understanding of the brain, but also lead to novel and efficient artificial vision systems.
引用
收藏
页码:95 / 111
页数:17
相关论文
共 2 条
  • [1] Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition
    Melani Sanchez-Garcia
    Tushar Chauhan
    Benoit R. Cottereau
    Michael Beyeler
    Biological Cybernetics, 2023, 117 : 95 - 111
  • [2] Multi-Scale Super-Resolution Reconstruction via a Deep Winner-Take-All Cascaded NetWork
    Wang, Wei
    Wang, Fei
    Qiu, Zhiliang
    Jin, Ruizhi
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 352 - 359