Infer-HiRes: Accelerating Inference for High-Resolution Images with Quantization and Distributed Deep Learning

被引:0
|
作者
Gulhane, Radha [1 ]
Anthony, Quentin [1 ]
Shafi, Aamir [1 ]
Subramoni, Hari [1 ]
Panda, Dhabaleswar K. [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
来源
PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2024, PEARC 2024 | 2024年
关键词
Inference; Quantization; High-Resolution Images; Distributed Deep Learning;
D O I
10.1145/3626203.3670548
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-Resolution Images are being used in various applications, including Medical Imaging, Satellite Imagery, and Surveillance. Due to the evolution of Deep Learning (DL) and its widespread usage, it has also become a prominent choice for high-resolution image applications. But, large image sizes and denser convolutional neural networks pose limitations over computation and memory requirements. To overcome these challenges, several studies have discussed efficient approaches to accelerate training, but the inference of high-resolution images with deep learning and quantization techniques remains unexplored. In this paper, we propose accelerated and memory efficient inference techniques leveraging quantization techniques to reduce the memory and computation requirements while maintaining accuracy. Furthermore, we utilize different parallelism for Distributed DL to enable inference for high-resolution images on out-of-core models. We demonstrate an average 6.5x speedup and 4.55x memory reduction with a single GPU using INT8 quantization. By utilizing Distributed DL, we enabled inference for scaled images, achieving an average 1.58x speedup and 1.57x memory reduction using half-precision. To the best of our knowledge, this paper is the first in the literature to focus on highresolution image inference using quantization with the support of Distributed DL.
引用
收藏
页数:9
相关论文
共 6 条
  • [1] ACCELERATING DISTRIBUTED DEEP LEARNING BY ADAPTIVE GRADIENT QUANTIZATION
    Guo, Jinrong
    Liu, Wantao
    Wang, Wang
    Han, Jizhong
    Li, Ruixuan
    Lu, Yijun
    Hu, Songlin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1603 - 1607
  • [2] Human emotion recognition based on facial expressions via deep learning on high-resolution images
    Yahia Said
    Mohammad Barr
    Multimedia Tools and Applications, 2021, 80 : 25241 - 25253
  • [3] Human emotion recognition based on facial expressions via deep learning on high-resolution images
    Said, Yahia
    Barr, Mohammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25241 - 25253
  • [4] Learning to Semantically Segment High-Resolution Remote Sensing Images
    Nogueira, Keiller
    Dalla Mura, Mauro
    Chanussot, Jocelyn
    Schwartz, William Robson
    dos Santos, Jefersson A.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3566 - 3571
  • [5] SUPER-RESOLUTION OF LARGE VOLUMES OF SENTINEL-2 IMAGES WITH HIGH PERFORMANCE DISTRIBUTED DEEP LEARNING
    Zhang, Run
    Cavallaro, Gabriele
    Jitsev, Jenia
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 617 - 620
  • [6] Machine learning approach to extract building footprint from high-resolution images: the case study of Makkah, Saudi Arabia
    Faisal, Kamil
    Imam, Ayman
    Majrashi, Abdulrahman
    Hegazy, Ibrahim
    INTERNATIONAL JOURNAL OF LOW-CARBON TECHNOLOGIES, 2021, 16 (02) : 655 - 663