Core Interface Optimization for Multi-core Neuromorphic Processors

被引:1
作者
Su, Zhe [1 ,2 ]
Hwang, Hyunjung [2 ]
Torchet, Tristan [2 ]
Indiveri, Giacomo [1 ,2 ]
机构
[1] Univ Zurich, Inst Neuroinformat, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2023 28TH IEEE INTERNATIONAL SYMPOSIUM ON ASYNCHRONOUS CIRCUITS AND SYSTEMS, ASYNC | 2023年
基金
欧洲研究理事会;
关键词
Multi-core neuromorphic processors; core interface; arbitration architecture; asynchronous CAM; CIRCUITS; ARCHITECTURE;
D O I
10.1109/ASYNC58294.2023.10239574
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hardware implementations of Spiking Neural Networks (SNNs) represent a promising approach to edge-computing for applications that require low-power and low-latency, and which cannot resort to external cloud-based computing services. However, most solutions proposed so far either support only relatively small networks, or take up significant hardware resources, to implement large networks. To realize large-scale and scalable SNNs it is necessary to develop an efficient asynchronous communication and routing fabric that enables the design of multi-core architectures. In particular the core interface that manages inter-core spike communication is a crucial component as it represents the bottleneck of Power-Performance-Area (PPA) especially for the arbitration architecture and the routing memory. In this paper we present an arbitration mechanism with the corresponding asynchronous encoding pipeline circuits, based on hierarchical arbiter trees. The proposed scheme reduces the latency by more than 70% in sparse-event mode, compared to the state-of-the-art arbitration architectures, with lower area cost. The routing memory makes use of asynchronous Content Addressable Memory (CAM) with Current Sensing Completion Detection (CSCD), which saves approximately 46% energy, and achieves a 40% increase in throughput against conventional asynchronous CAM using configurable delay lines, at the cost of only a slight increase in area. In addition as it radically reduces the core interface resources in multi-core neuromorphic processors, the arbitration architecture and CAM architecture we propose can be also applied to a wide range of general asynchronous circuits and systems.
引用
收藏
页码:89 / 98
页数:10
相关论文
共 18 条
[1]   Cost-Effective and Flexible Asynchronous Interconnect Technology for GALS Systems [J].
Bertozzi, Davide ;
Miorandi, Gabriele ;
Ghiribaldi, Alberto ;
Burleson, Wayne ;
Sadowski, Greg ;
Bhardwaj, Kshitij ;
Jiang, Weiwei ;
Nowick, Steven M. .
IEEE MICRO, 2021, 41 (01) :69-81
[2]   Point-to-point connectivity between neuromorphic chips using address events [J].
Boahen, KA .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2000, 47 (05) :416-434
[3]   Neuromorphic Electronic Circuits for Building Autonomous Cognitive Systems [J].
Chicca, Elisabetta ;
Stefanini, Fabio ;
Bartolozzi, Chiara ;
Indiveri, Giacomo .
PROCEEDINGS OF THE IEEE, 2014, 102 (09) :1367-1388
[4]   Loihi: A Neuromorphic Manycore Processor with On-Chip Learning [J].
Davies, Mike ;
Srinivasa, Narayan ;
Lin, Tsung-Han ;
Chinya, Gautham ;
Cao, Yongqiang ;
Choday, Sri Harsha ;
Dimou, Georgios ;
Joshi, Prasad ;
Imam, Nabil ;
Jain, Shweta ;
Liao, Yuyun ;
Lin, Chit-Kwan ;
Lines, Andrew ;
Liu, Ruokun ;
Mathaikutty, Deepak ;
Mccoy, Steve ;
Paul, Arnab ;
Tse, Jonathan ;
Venkataramanan, Guruguhanathan ;
Weng, Yi-Hsin ;
Wild, Andreas ;
Yang, Yoonseok ;
Wang, Hong .
IEEE MICRO, 2018, 38 (01) :82-99
[5]   SELF-TIMED LOGIC USING CURRENT-SENSING COMPLETION DETECTION (CSCD) [J].
DEAN, ME ;
DILL, DL ;
HOROWITZ, M .
JOURNAL OF VLSI SIGNAL PROCESSING, 1994, 7 (1-2) :7-16
[6]  
Ghiribaldi A, 2013, DES AUT TEST EUROPE, P332
[7]   An Asynchronous Bundled-Data Template With Current Sensing Completion Detection Technique [J].
Huang, Yuhao ;
Xiao, Shanlin ;
Li, Zhiyu ;
Yu, Zhiyi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (09) :3904-3908
[8]   A High-Throughput Asynchronous Multi-Resource Arbiter Using a Pipelined Assignment Approach [J].
Jiang, Weiwei ;
Nowick, Steven M. .
2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON ASYNCHRONOUS CIRCUITS AND SYSTEMS (ASYNC), 2017, :1-9
[9]   A 65-nm 0.6-fJ/Bit/Search Ternary Content Addressable Memory Using an Adaptive Match-Line Discharge [J].
Lee, Kyeongho ;
Choi, Woong ;
Park, Jongsun .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2021, 56 (08) :2574-2584
[10]  
Leite Vanessa R. C., 2022, 2022 IEEE Biomedical Circuits and Systems Conference (BioCAS), P364, DOI 10.1109/BioCAS54905.2022.9948653