Ferroelectric ternary content-addressable memory for one-shot learning

被引:285
作者
Ni, Kai [1 ]
Yin, Xunzhao [1 ]
Laguna, Ann Franchesca [1 ]
Joshi, Siddharth [1 ]
Duenkel, Stefan [2 ]
Trentzsch, Martin [2 ]
Mueeller, Johannes [2 ]
Beyer, Sven [2 ]
Niemier, Michael [1 ]
Hu, Xiaobo Sharon [1 ]
Datta, Suman [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] GLOBALFOUNDRIES Fab1 LLC & Co KG, Dresden, Germany
关键词
SEARCH-TIME; TCAM; CELL;
D O I
10.1038/s41928-019-0321-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep neural networks are efficient at learning from large sets of labelled data, but struggle to adapt to previously unseen data. In pursuit of generalized artificial intelligence, one approach is to augment neural networks with an attentional memory so that they can draw on already learnt knowledge patterns and adapt to new but similar tasks. In current implementations of such memory augmented neural networks (MANNs), the content of a network's memory is typically transferred from the memory to the compute unit (a central processing unit or graphics processing unit) to calculate similarity or distance norms. The processing unit hardware incurs substantial energy and latency penalties associated with transferring the data from the memory and updating the data at random memory addresses. Here, we show that ternary content-addressable memories (TCAMs) can be used as attentional memories, in which the distance between a query vector and each stored entry is computed within the memory itself, thus avoiding data transfer. Our compact and energy-efficient TCAM cell is based on two ferroelectric field-effect transistors. We evaluate the performance of our ferroelectric TCAM array prototype for one- and few-shot learning applications. When compared with a MANN where cosine distance calculations are performed on a graphics processing unit, the ferroelectric TCAM approach provides a 60-fold reduction in energy and 2,700-fold reduction in latency for a single memory search operation.
引用
收藏
页码:521 / 529
页数:9
相关论文
共 51 条
[1]  
Ahn S. J., 2005, 2004 International Electron Devices Meeting (IEEE Cat. No.04CH37602), P907
[2]  
Andoni A, 2006, ANN IEEE SYMP FOUND, P459
[3]   A 0.7-fJ/bit/search 2.2-ns search time hybrid-type TCAM architecture [J].
Choi, S ;
Sohn, K ;
Yoo, HJ .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2005, 40 (01) :254-260
[4]  
Dong Q, 2017, ISSCC DIG TECH PAP I, P198, DOI 10.1109/ISSCC.2017.7870329
[5]  
Dunkel S., 2018, 2017 IEEE INT EL DEV, P485
[6]  
Fedorov VV, 2014, PR IEEE COMP DESIGN, P55, DOI 10.1109/ICCD.2014.6974662
[7]  
Franklin, 2017, NVIDIA Accelerated Computing| Parallel Forall
[8]  
Govoreanu B., 2012, 2011 INT EL DEV M, P729
[9]  
Graves A., 2014, Generating sequences with recurrent neural networks
[10]   Hybrid computing using a neural network with dynamic external memory [J].
Graves, Alex ;
Wayne, Greg ;
Eynolds, Malcolm R. ;
Harley, Tim ;
Danihelka, Ivo ;
Grabska-Barwinska, Agnieszka ;
Colmenarejo, Sergio Gomez ;
Grefenstette, Edward ;
Amalho, Tiago R. ;
Agapiou, John ;
Badia, Adria Puigdomenech ;
Hermann, Karl Moritz ;
Zwols, Yori ;
Strovski, Georg O. ;
Ain, Adam C. ;
King, Helen ;
Summerfield, Christopher ;
Lunsom, Phil B. ;
Kavukcuoglu, Koray ;
Hassabis, Demis .
NATURE, 2016, 538 (7626) :471-+