共 9 条
Tinker: A Middleware for Deploying Multiple NN-Based Applications on a Single Machine
被引:2
|作者:
Wang, Chao
[1
]
Jin, Lihui
[1
]
Gong, Lei
[1
]
Xu, Chongchong
[1
]
Hu, Yahui
[1
]
Tan, Luchao
[1
]
Zhou, Xuehai
[1
]
机构:
[1] Univ Sci & Technol China, Sch Comp Sci, Hefei 230026, Peoples R China
来源:
基金:
美国国家科学基金会;
关键词:
Task analysis;
Graphics processing units;
Kernel;
System performance;
Runtime;
Machine learning;
Middleware;
Deep learning;
GPU;
inference;
middleware;
GPU;
D O I:
10.1109/TCAD.2020.3019981
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Currently, deep learning technology is widely used in various fields, such as face recognition, object recognition, and image classification. Multiple application instances sharing GPU resources can deploy more applications with limited GPU resources. However, this will lead to resource competition problems, resulting in application switching, timeouts, and other phenomena. Therefore, how to deploy these applications to a single machine with limited resources and properly schedule tasks while maximizing system performance is a new challenge. In this article, we propose Tinker, a middleware, to solve multiple CNN-based application deployment problems on a single machine. Tinker has two phases: 1) offline analysis and 2) runtime scheduling. Offline analysis generates the best application deployment configuration information of the current system. Runtime scheduling can properly schedule tasks to ensure that they are completed normally and efficiently utilize resources to improve system performance. Our experiment proved that Tinker boosts system performance and ensures that most tasks are completed efficiently.
引用
收藏
页码:1495 / 1499
页数:5
相关论文