A conditional branch predictor based on weightless neural networks

被引:3
作者
Villon, Luis A. Q. [1 ]
Susskind, Zachary [2 ]
Bacellar, Alan T. L. [1 ]
Miranda, Igor D. S. [5 ]
de Araujo, Leandro S. [3 ]
Lima, Priscila M. V. [1 ]
Breternitz, Mauricio [4 ]
John, Lizy K. [2 ]
Franca, Felipe M. G. [1 ,6 ]
Dutra, Diego L. C. [1 ]
机构
[1] Univ Fed Rio de Janeiro UFRJ, Rio De Janeiro, RJ, Brazil
[2] Univ Texas Austin, Austin, TX USA
[3] Univ Fed Fluminense UFF, Niteroi, RJ, Brazil
[4] Inst Univ Lisboa ISCTE IUL, ISTAR, Lisbon, Portugal
[5] Univ Fed Reconcavo Bahia UFRB, Salvador, BA, Brazil
[6] Inst Telecomunicacoes, Lisbon, Portugal
关键词
Weightless neural network; WiSARD; Branch prediction; Binary classification;
D O I
10.1016/j.neucom.2023.126637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conditional branch prediction allows the speculative fetching and execution of instructions before knowing the direction of conditional statements. As in other areas, machine learning techniques are a promising approach to building branch predictors, e.g., the Perceptron predictor. However, those traditional solutions demand large input sizes, which impose a considerable area overhead. We propose a conditional branch predictor based on the WiSARD (Wilkie, Stoneham, and Aleksander's Recognition Device) weightless neural network model. The WiSARD-based predictor implements one-shot online training designed to address branch prediction as a binary classification problem. We compare the WiSARD-based predictor with two state-of-the-art predictors: TAGESC-L (TAgged GEometric-Statistical Corrector-Loop) and the Multiperspective Perceptron. Our experimental evaluation shows that our proposed predictor, with a smaller input size, outperforms the perceptron-based predictor by about 0.09% and achieves similar accuracy to that of TAGE-SC-L. In addition, we perform an experimental sensitivity analysis to find the best predictor for each dataset, and based on these results, we designed new specialized predictors using a particular parameter composition for each dataset. The results show that the specialized WiSARD-based predictor outperforms the state-of-the-art by more than 2.3% in the best case. Furthermore, through the implementation of specialized predictor classifiers, we discovered that utilizing 90% of the specialized predictor for a specific dataset yielded comparable performance to the corresponding specialized predictor.
引用
收藏
页数:20
相关论文
共 36 条
[1]  
Aleksander I., 1984, Sensor Review, V4, P120, DOI 10.1108/eb007637
[2]  
Aleksander I., 2009, Esann, P299
[3]   Low-Power, High-Performance Analog Neural Branch Prediction [J].
Amant, Renee St. ;
Jimenez, Daniel A. ;
Burger, Doug .
2008 PROCEEDINGS OF THE 41ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE: MICRO-41, 2008, :447-+
[4]  
Bledsoe W.W., 1959, P E JOINT COMP C, P225, DOI [10.1145/1460299.1460326, DOI 10.1145/1460299.1460326]
[5]  
Penney DD, 2019, Arxiv, DOI arXiv:1909.12373
[6]  
Filho L.A.L., 2019, ESANN, P6
[7]   Evolution of the Samsung Exynos CPU Microarchitecture Industrial Product [J].
Grayson, Brian ;
Rupley, Jeff ;
Zuraski, Gerald, Jr. ;
Quinnell, Eric ;
Jimenez, Daniel A. ;
Nakra, Tarun ;
Kitchin, Paul ;
Hensley, Ryan ;
Brekelbaum, Edward ;
Sinha, Vikas ;
Ghiya, Ankit .
2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020), 2020, :40-51
[8]   Producing pattern examples from "mental" images [J].
Grieco, Bruno P. A. ;
Lima, Priscila M. V. ;
De Gregorio, Massimo ;
Franca, Felipe M. G. .
NEUROCOMPUTING, 2010, 73 (7-9) :1057-1064
[9]  
Harris S., 2021, Digital Design and Computer Architecture: RISC-V Edition
[10]  
Hennessy John L., 2011, Computer Architecture: A Quantitative Approach