Automatic Learning Rate Adaption for Memristive Deep Learning Systems

被引:2
作者
Zhang, Yang [1 ,2 ,3 ]
Shen, Linlin [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Sch Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[3] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Memristors; Adaptive learning; Neural networks; Adaptation models; Image recognition; Computer architecture; Tuning; Adaptive learning rate; deep learning (DL); fuzzy logic; image recognition; memristor; quantized neural network; DESIGN; ARCHITECTURE; MEMORY; MODEL;
D O I
10.1109/TNNLS.2023.3244006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a possible device to further enhance the performance of the hybrid complementary metal oxide semiconductor (CMOS) technology in the hardware, the memristor has attracted widespread attention in implementing efficient and compact deep learning (DL) systems. In this study, an automatic learning rate tuning method for memristive DL systems is presented. Memristive devices are utilized to adjust the adaptive learning rate in deep neural networks (DNNs). The speed of the learning rate adaptation process is fast at first and then becomes slow, which consist of the memristance or conductance adjustment process of the memristors. As a result, no manual tuning of learning rates is required in the adaptive back propagation (BP) algorithm. While cycle-to-cycle and device-to-device variations could be a significant issue in memristive DL systems, the proposed method appears robust to noisy gradients, various architectures, and different datasets. Moreover, fuzzy control methods for adaptive learning are presented for pattern recognition, such that the over-fitting issue can be well addressed. To our best knowledge, this is the first memristive DL system using an adaptive learning rate for image recognition. Another highlight of the presented memristive adaptive DL system is that quantized neural network architecture is utilized, and there is therefore a significant increase in the training efficiency, without the loss of testing accuracy.
引用
收藏
页码:10791 / 10802
页数:12
相关论文
共 71 条
[1]   YodaNN: An Architecture for Ultralow Power Binary-Weight CNN Acceleration [J].
Andri, Renzo ;
Cavigelli, Lukas ;
Rossi, Davide ;
Benini, Luca .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (01) :48-60
[2]   A Hybrid CMOS-Memristor Neuromorphic Synapse [J].
Azghadi, Mostafa Rahimi ;
Linares-Barranco, Bernabe ;
Abbott, Derek ;
Leong, Philip H. W. .
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2017, 11 (02) :434-445
[3]   Neuromorphic Hardware System for Visual Pattern Recognition With Memristor Array and CMOS Neuron [J].
Chu, Myonglae ;
Kim, Byoungho ;
Park, Sangsu ;
Hwang, Hyunsang ;
Jeon, Moongu ;
Lee, Byoung Hun ;
Lee, Byung-Geun .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (04) :2410-2419
[4]   MEMRISTOR - MISSING CIRCUIT ELEMENT [J].
CHUA, LO .
IEEE TRANSACTIONS ON CIRCUIT THEORY, 1971, CT18 (05) :507-+
[5]   Light-tuned selective photosynthesis of azo- and azoxy-aromatics using graphitic C3N4 [J].
Dai, Yitao ;
Li, Chao ;
Shen, Yanbin ;
Lim, Tingbin ;
Xu, Jian ;
Li, Yongwang ;
Niemantsverdriet, Hans ;
Besenbacher, Flemming ;
Lock, Nina ;
Su, Ren .
NATURE COMMUNICATIONS, 2018, 9
[6]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[7]   Demonstration of Convolution Kernel Operation on Resistive Cross-Point Array [J].
Gao, Ligang ;
Chen, Pai-Yu ;
Yu, Shimeng .
IEEE ELECTRON DEVICE LETTERS, 2016, 37 (07) :870-873
[8]   Learning Deep Gradient Descent Optimization for Image Deconvolution [J].
Gong, Dong ;
Zhang, Zhen ;
Shi, Qinfeng ;
van den Hengel, Anton ;
Shen, Chunhua ;
Zhang, Yanning .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) :5468-5482
[9]   A SPICE Compact Model of Metal Oxide Resistive Switching Memory With Variations [J].
Guan, Ximeng ;
Yu, Shimeng ;
Wong, H. -S. Philip .
IEEE ELECTRON DEVICE LETTERS, 2012, 33 (10) :1405-1407
[10]   Rearrangement of 1D Conducting Nanomaterials towards Highly Electrically Conducting Nanocomposite Fibres for Electronic Textiles [J].
Han, Joong Tark ;
Choi, Sua ;
Jang, Jeong In ;
Seol, Seung Kwon ;
Woo, Jong Seok ;
Jeong, Hee Jin ;
Jeong, Seung Yol ;
Baeg, Kang-Jun ;
Lee, Geon-Woong .
SCIENTIFIC REPORTS, 2015, 5