Towards Out-of-core Neural Networks on Microcontrollers

被引:0
|
作者
Miao, Hongyu [1 ]
Lin, Felix Xiaozhu [2 ]
机构
[1] Purdue ECE, W Lafayette, IN 47907 USA
[2] Univ Virginia, Charlottesville, VA 22903 USA
来源
2022 IEEE/ACM 7TH SYMPOSIUM ON EDGE COMPUTING (SEC 2022) | 2022年
基金
美国国家科学基金会;
关键词
tinyML; Edge Computing; On-device Machine; Learning;
D O I
10.1109/SEC54971.2022.00008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To run neural networks (NNs) on microcontroller units (MCUs), memory size is the major constraint. While algorithm-level techniques exist to reduce NN memory footprints, the resultant losses in NN accuracy and generality disqualify MCUs for many important use cases. To address the constraint, we investigate out-of-core execution of NNs on MCUs: dynamically swapping NN data tiles between an MCU's small SRAM and its large, low-cost external flash. Accordingly, we present a scheduler design that automatically schedules compute tasks and swapping IO tasks in order to minimize the IO overhead in swapping. Out-of-core NNs on MCUs raise multiple concerns: execution slowdown, storage wear out, energy consumption, and data security. Our empirical study shows that none of these concerns is a showstopper; the key benefit - MCUs being able to run large NNs with full accuracy/generality - trumps the overheads. Our findings suggest that MCUs can play a much greater role in edge intelligence.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Towards Out-of-Core Simulators for Quantum Computing
    Trummer, Immanuel
    PROCEEDINGS OF THE 1ST WORKSHOP ON QUANTUM COMPUTING AND QUANTUM-INSPIRED TECHNOLOGY FOR DATA-INTENSIVE SYSTEMS AND APPLICATIONS, Q-DATA, CO-LOCATED WITH ACM INTERNATIONAL CONFERENCE ON DATA MANAGEMENT, SIGMOD, 2024, : 1 - 1
  • [2] POSTER: Profiling based out-of-core Hybrid method for large neural networks
    Ito, Yuki
    Imai, Haruki
    Tung Le Duc
    Negishi, Yasushi
    Kawachiya, Kiyokuni
    Matsumiya, Ryo
    Endo, Toshio
    PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 399 - 400
  • [3] MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks
    Waleffe, Roger
    Mohoney, Jason
    Rekatsinas, Theodoros
    Venkataraman, Shivaram
    PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023, 2023, : 144 - 161
  • [4] GAMER with out-of-core computation
    Schive, Hsi-Yu
    Tsai, Yu-Chih
    Chiueh, Tzihong
    COMPUTATIONAL STAR FORMATION, 2011, (270): : 401 - 405
  • [5] A Cholesky out-of-core factorization
    Castellanos, J. A.
    Larrazabal, G.
    MATHEMATICAL AND COMPUTER MODELLING, 2013, 57 (9-10) : 2207 - 2222
  • [6] Out-of-core mls reconstruction
    Fiorin, Valentino
    Cignoni, Paolo
    Scopigno, Roberto
    PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS AND IMAGING, 2007, : 27 - 34
  • [7] Out-of-core segmentation by deformable models
    Giraldi, G
    Schaefer, L
    Farias, R
    Silva, R
    FUZZY LOGIC AND APPLICATIONS, 2006, 2955 : 216 - 223
  • [8] Amy files for out-of-core computations
    Zhang, Y
    Apon, A
    Pulay, P
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 191 - 197
  • [9] Large out-of-core tetrahedral meshing
    Alleaume, Aurelien
    Francez, Laurent
    Loriot, Alark
    Maman, Nathan
    PROCEEDINGS OF THE 16TH INTERNATIONAL MESHING ROUNDTABLE, 2008, : 461 - +
  • [10] Interactive out-of-core isosurface extraction
    Chiang, YJ
    Silva, CT
    Schroeder, WJ
    VISUALIZATION '98, PROCEEDINGS, 1998, : 167 - +