In-Place Activated BatchNorm for Memory-Optimized Training of DNNs

被引:182
|
作者
Bulo, Samuel Rota [1 ]
Porzi, Lorenzo [1 ]
Kontschieder, Peter [1 ]
机构
[1] Mapillary Res, Graz, Austria
关键词
D O I
10.1109/CVPR.2018.00591
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we present In-Place Activated Batch Normalization (INPLACE-ABN) - a novel approach to drastically reduce the training memory footprint of modern deep neural networks in a computationally efficient way. Our solution substitutes the conventionally used succession of BatchNorm + Activation layers with a single plugin layer, hence avoiding invasive framework surgery while providing straightforward applicability for existing deep learning frameworks. We obtain memory savings of up to 50% by dropping intermediate results and by recovering required information during the backward pass through the inversion of stored forward results, with only minor increase (0.8-2%) in computation time. Also, we demonstrate how frequently used checkpointing approaches can be made computationally as efficient as INPLACE-ABN. In our experiments on image classification, we demonstrate on-par results on ImageNet-1k with state-of-the-art approaches. On the memory-demanding task of semantic segmentation, we report competitive results for COCO-Stuff and set new state-of-the-art results for Cityscapes and Mapillary Vistas. Code can be found at https://github.com/mapillary/inplace_abn.
引用
收藏
页码:5639 / 5647
页数:9
相关论文
共 50 条
  • [31] Selection in the Presence of Memory Faults, with Applications to In-place Resilient Sorting
    Kopelowitz, Tsvi
    Talmon, Nimrod
    ALGORITHMS AND COMPUTATION, ISAAC 2012, 2012, 7676 : 558 - 567
  • [32] AN IN-PLACE TEXTURE SYNTHESIS TECHNIQUE FOR MEMORY CONSTRAINED MULTIMEDIA APPLICATIONS
    Badalov, Alexey
    Cheng, Irene
    Silva, Claudio
    Basu, Anup
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [33] Generalization of In-Place Balance Perturbation Training in People With Parkinson Disease
    Monaghan, Andrew S.
    Hooyman, Andrew
    Dibble, Leland E.
    Mehta, Shyamal H.
    Peterson, Daniel S.
    JOURNAL OF NEUROLOGIC PHYSICAL THERAPY, 2024, 48 (03): : 165 - 173
  • [34] Memory-Optimized Voronoi Cell-based Parallel Kernels for the Shortest Vector Problem on Lattices
    Cabeleira, Filipe
    Mariano, Artur
    Falcao, Gabriel
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [35] Memory-optimized software synthesis from dataflow program graphs with large size data samples
    Oh, H
    Ha, S
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (06) : 514 - 529
  • [36] Memory-optimized software synthesis from dataflow program graphs with large size data samples
    Oh, H. (oho@comp.snu.ac.kr), 1600, Hindawi Publishing Corporation (2003):
  • [37] Memory-Optimized Multi-Version Concurrency Control for Disk-Based Database Systems
    Freitag, Michael
    Kemper, Alfons
    Neumann, Thomas
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (11): : 2797 - 2810
  • [38] An In-Place Algorithm for Irregular All-to-All Communication with Limited Memory
    Hofmann, Michael
    Ruenger, Gudula
    RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2010, 6305 : 113 - 121
  • [39] Adaptive computation of self sorting in-place FFTs on hierarchical memory architectures
    Ali, Ayaz
    Johnsson, Lennart
    Subhlok, Jaspal
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 372 - 383
  • [40] Efficient support for in-place metadata in Java']Java software transactional memory
    Dias, Ricardo J.
    Vale, Tiago M.
    Lourenco, Joao M.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (17): : 2394 - 2411