MLife: a lite framework for machine learning lifecycle initialization

被引:6
作者
Yang, Cong [1 ]
Wang, Wenfeng [1 ]
Zhang, Yunhui [1 ]
Zhang, Zhikai [1 ]
Shen, Lina [1 ]
Li, Yipeng [2 ]
See, John [3 ]
机构
[1] Horizon Robot, Nanjing, Peoples R China
[2] Clobot, Seattle, WA 98195 USA
[3] Heriot Watt Univ Malaysia, Putrajaya, Malaysia
关键词
Machine learning; Machine learning lifecycle; Machine learning system; Deep learning; Data flow;
D O I
10.1007/s10994-021-06052-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning (ML) lifecycle is a cyclic process to build an efficient ML system. Though a lot of commercial and community (non-commercial) frameworks have been proposed to streamline the major stages in the ML lifecycle, they are normally overqualified and insufficient for an ML system in its nascent phase. Driven by real-world experience in building and maintaining ML systems, we find that it is more efficient to initialize the major stages of ML lifecycle first for trial and error, followed by the extension of specific stages to acclimatize towards more complex scenarios. For this, we introduce a simple yet flexible framework, MLife, for fast ML lifecycle initialization. This is built on the fact that data flow in MLife is in a closed loop driven by bad cases, especially those which impact ML model performance the most but also provide the most value for further ML model development-a key factor towards enabling enterprises to fast track their ML capabilities. Better yet, MLife is also flexible enough to be easily extensible to more complex scenarios for future maintenance. For this, we introduce two real-world use cases to demonstrate that MLife is particularly suitable for ML systems in their early phases.
引用
收藏
页码:2993 / 3013
页数:21
相关论文
共 32 条
[1]  
Amazon, 2020, AM MACH LEARN DEV GU, P72
[2]  
[Anonymous], 2018, IEEE DATA ENG B
[3]  
Ashmore R., 2019, ARXIV PREPRINT ARXIV
[4]  
Aslam F A, 2015, Int. J. Adv. Res. Comput. Sci, V6, P54
[5]   TFX: A TensorFlow-Based Production-Scale Machine Learning Platform [J].
Baylor, Denis ;
Breck, Eric ;
Cheng, Heng-Tze ;
Fiedel, Noah ;
Foo, Chuan Yu ;
Haque, Zakaria ;
Haykal, Salem ;
Ispir, Mustafa ;
Jain, Vihan ;
Koc, Levent ;
Koo, Chiu Yuen ;
Lew, Lukasz ;
Mewald, Clemens ;
Modi, Akshay Naresh ;
Polyzotis, Neoklis ;
Ramesh, Sukriti ;
Roy, Sudip ;
Whang, Steven Euijong ;
Wicke, Martin ;
Wilkiewicz, Jarek ;
Zhang, Xin ;
Zinkevich, Martin .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1387-1395
[6]   Sharing Representations for Long Tail Computer Vision Problems [J].
Bengio, Samy .
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, :1-1
[7]  
Bhosale S., 2015, Em: Int. J. Comput. Sci. Mob. Comput, V44, P882
[8]  
Chen C., 2018, IEEE Data Eng. Bull., V41, P10
[9]  
Crankshaw D, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P613
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848