Automated end-to-end management of the modeling lifecycle in deep learning

被引：21

作者：

Gharibi, Gharib ^{[1
]}

Walunj, Vijay ^{[1
]}

Nekadi, Raju ^{[1
]}

Marri, Raj ^{[1
]}

Lee, Yugyung ^{[1
]}

机构：

[1] Univ Missouri, Sch Comp & Engn, 5000 Holmes St, Kansas City, MO 64110 USA

来源：

EMPIRICAL SOFTWARE ENGINEERING | 2021年 / 26卷 / 02期

关键词：

Data management; Deep learning; Software automation;

D O I：

10.1007/s10664-020-09894-9

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep learning has improved the state-of-the-art results in an ever-growing number of domains. This success heavily relies on the development and training of deep learning models-an experimental, iterative process that produces tens to hundreds of models before arriving at a satisfactory result. While there has been a surge in the number of tools and frameworks that aim at facilitating deep learning, the process of managing the models and their artifacts is still surprisingly challenging and time-consuming. Existing model-management solutions are either tailored for commercial platforms or require significant code changes. Moreover, most of the existing solutions address a single phase of the modeling lifecycle, such as experiment monitoring, while ignoring other essential tasks, such as model deployment. In this paper, we present a software system to facilitate and accelerate the deep learning lifecycle, named ModelKB. ModelKB can automatically manage the modeling lifecycle end-to-end, including (1) monitoring and tracking experiments; (2) visualizing, searching for, and comparing models and experiments; (3) deploying models locally and on the cloud; and (4) sharing and publishing trained models. Moreover, our system provides a stepping-stone for enhanced reproducibility. ModelKB currently supports TensorFlow 2.0, Keras, and PyTorch, and it can be extended to other deep learning frameworks easily.

引用

页数：33

共 68 条

[21]

Grinberg M, 2018, FLASK WEB DEV DEVELO

[22]

Hall M.A., 1999, P 17 INT C MACHINE L, P359

[23]

Hannun Awni, 2014, CoRR, P4

[24] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[25]

Hellerstein J.M., 2017, CIDR

[26] ModelDB: A database to support computational neuroscience [J].

Hines, ML ;

Morse, T ;

Migliore, M ;

Carnevale, NT ;

Shepherd, GM .

JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2004, 17 (01) :7-11

[27] Caffe: Convolutional Architecture for Fast Feature Embedding [J].

Jia, Yangqing ;

Shelhamer, Evan ;

Donahue, Jeff ;

Karayev, Sergey ;

Long, Jonathan ;

Girshick, Ross ;

Guadarrama, Sergio ;

Darrell, Trevor .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678

[28]

Jinja, 2019, PYTHON TEMPLATE LANG

[29] Large-scale Video Classification with Convolutional Neural Networks [J].

Karpathy, Andrej ;

Toderici, George ;

Shetty, Sanketh ;

Leung, Thomas ;

Sukthankar, Rahul ;

Fei-Fei, Li .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732

[30] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

← 1 2 3 4 5 6 7 →