Automated end-to-end management of the modeling lifecycle in deep learning

被引:21
作者
Gharibi, Gharib [1 ]
Walunj, Vijay [1 ]
Nekadi, Raju [1 ]
Marri, Raj [1 ]
Lee, Yugyung [1 ]
机构
[1] Univ Missouri, Sch Comp & Engn, 5000 Holmes St, Kansas City, MO 64110 USA
关键词
Data management; Deep learning; Software automation;
D O I
10.1007/s10664-020-09894-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning has improved the state-of-the-art results in an ever-growing number of domains. This success heavily relies on the development and training of deep learning models-an experimental, iterative process that produces tens to hundreds of models before arriving at a satisfactory result. While there has been a surge in the number of tools and frameworks that aim at facilitating deep learning, the process of managing the models and their artifacts is still surprisingly challenging and time-consuming. Existing model-management solutions are either tailored for commercial platforms or require significant code changes. Moreover, most of the existing solutions address a single phase of the modeling lifecycle, such as experiment monitoring, while ignoring other essential tasks, such as model deployment. In this paper, we present a software system to facilitate and accelerate the deep learning lifecycle, named ModelKB. ModelKB can automatically manage the modeling lifecycle end-to-end, including (1) monitoring and tracking experiments; (2) visualizing, searching for, and comparing models and experiments; (3) deploying models locally and on the cloud; and (4) sharing and publishing trained models. Moreover, our system provides a stepping-stone for enhanced reproducibility. ModelKB currently supports TensorFlow 2.0, Keras, and PyTorch, and it can be extended to other deep learning frameworks easily.
引用
收藏
页数:33
相关论文
共 68 条
[21]  
Grinberg M, 2018, FLASK WEB DEV DEVELO
[22]  
Hall M.A., 1999, P 17 INT C MACHINE L, P359
[23]  
Hannun Awni, 2014, CoRR, P4
[24]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[25]  
Hellerstein J.M., 2017, CIDR
[26]   ModelDB: A database to support computational neuroscience [J].
Hines, ML ;
Morse, T ;
Migliore, M ;
Carnevale, NT ;
Shepherd, GM .
JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2004, 17 (01) :7-11
[27]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[28]  
Jinja, 2019, PYTHON TEMPLATE LANG
[29]   Large-scale Video Classification with Convolutional Neural Networks [J].
Karpathy, Andrej ;
Toderici, George ;
Shetty, Sanketh ;
Leung, Thomas ;
Sukthankar, Rahul ;
Fei-Fei, Li .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732
[30]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90