Evaluating the Power Efficiency of Deep Learning Inference on Embedded GPU Systems

被引:0
作者
Rungsuptaweekoon, Kanokwan [1 ]
Visoottiviseth, Vasaka [1 ]
Takano, Ryousei [2 ]
机构
[1] Mahidol Univ, Fac Informat & Commun Technol, Bangkok, Thailand
[2] Natl Inst Adv Ind Sci & Technol, Informat Technol Res Inst, Tsukuba, Ibaraki, Japan
来源
2017 2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT) | 2017年
关键词
low-power image recognition; object detection; power efficiency; embedded GPU system; deep learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning inference on embedded systems requires not only high throughput but also low power consumption. To address this challenge, this paper evaluates the power efficiency of image recognition with YOLO, a real-time object detection algorithm, on the latest NVIDIA embedded GPU systems: Jetson TX1 and TX2. For this evaluation, we deployed the Low-Power Image Recognition Challenge (LPIRC) system and integrated YOLO, a power meter, and target hardware into the system. The experimental results show that Jetson TX2 with Max-N mode has the highest throughput; Jetson TX2 with Max Q mode has the highest power efficiency. These findings indicate it is possible to adjust the trade-off relationship of throughput and power efficiency in Jetson TX2. Therefore, Jetson TX2 has advantages for image recognition on embedded systems more than Jetson TM and a PC server with NVIDIA Tesla P40.
引用
收藏
页码:230 / 234
页数:5
相关论文
共 10 条
[1]  
[Anonymous], Yolo: Real-time object detection
[2]  
[Anonymous], 2016, NVIDIA TESL ACC DEEP
[3]  
[Anonymous], 2017, Nvidia Jetson TX2 Delivers Twice the Intelligence to the Edge
[4]  
Gauen K, 2017, ASIA S PACIF DES AUT, P99, DOI 10.1109/ASPDAC.2017.7858303
[5]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[6]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[7]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[8]  
Redmon J., 2016, Proceedings of the IEEE conference on computer vision and pattern recognition, P779, DOI [DOI 10.1109/CVPR.2016.91, 10.1109/CVPR.2016.91]
[9]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[10]  
Rosebrock A., 2016, Intersection over union (iou) for object detection