Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

被引:103
作者
Lwakatare, Lucy Ellen [1 ]
Raj, Aiswarya [1 ]
Crnkovic, Ivica [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Horselgagen 11, S-41296 Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Nordenskioldsgatan 1, S-21119 Malmo, Sweden
关键词
Machine learning systems; Software engineering; Industrial settings; Challenges; Solutions; SLR;
D O I
10.1016/j.infsof.2020.106368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background : Developing and maintaining large scale machine learning (ML) based software systems in an in-dustrial setting is challenging. There are no well-established development guidelines, but the literature contains reports on how companies develop and maintain deployed ML-based software systems. Objective : This study aims to survey the literature related to development and maintenance of large scale ML -based systems in industrial settings in order to provide a synthesis of the challenges that practitioners face. In addition, we identify solutions used to address some of these challenges. Method : A systematic literature review was conducted and we identified 72 papers related to development and maintenance of large scale ML-based software systems in industrial settings. The selected articles were qualita-tively analyzed by extracting challenges and solutions. The challenges and solutions were thematically synthe-sized into four quality attributes: adaptability, scalability, safety and privacy. The analysis was done in relation to ML workflow, i.e. data acquisition, training, evaluation, and deployment. Results : We identified a total of 23 challenges and 8 solutions related to development and maintenance of large scale ML-based software systems in industrial settings including six different domains. Challenges were most often reported in relation to adaptability and scalability. Safety and privacy challenges had the least reported solutions. Conclusion : The development and maintenance on large-scale ML-based systems in industrial settings introduce new challenges specific for ML, and for the known challenges characteristic for these types of systems, require new methods in overcoming the challenges. The identified challenges highlight important concerns in ML system development practice and the lack of solutions point to directions for future research.
引用
收藏
页数:17
相关论文
共 52 条
[1]   DeepDIVA: A Highly-Functional Python']Python Framework for Reproducible Experiments [J].
Alberti, Michele ;
Pondenkandath, Vinaychandran ;
Wursch, Marcel ;
Ingold, Rolf ;
Liwicki, Marcus .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :423-428
[2]   Software Engineering for Machine Learning: A Case Study [J].
Amershi, Saleema ;
Begel, Andrew ;
Bird, Christian ;
DeLine, Robert ;
Gall, Harald ;
Kamar, Ece ;
Nagappan, Nachiappan ;
Nushi, Besmira ;
Zimmermann, Thomas .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2019), 2019, :291-300
[3]  
[Anonymous], 2004, PROCEDURES PERFORMIN
[4]   Machine learning in complex systems [J].
Axtell, Travis W. ;
Overbey, Lucas A. ;
Woerner, Lisa .
GROUND/AIR MULTISENSOR INTEROPERABILITY, INTEGRATION, AND NETWORKING FOR PERSISTENT ISR IX, 2018, 10635
[5]   TFX: A TensorFlow-Based Production-Scale Machine Learning Platform [J].
Baylor, Denis ;
Breck, Eric ;
Cheng, Heng-Tze ;
Fiedel, Noah ;
Foo, Chuan Yu ;
Haque, Zakaria ;
Haykal, Salem ;
Ispir, Mustafa ;
Jain, Vihan ;
Koc, Levent ;
Koo, Chiu Yuen ;
Lew, Lukasz ;
Mewald, Clemens ;
Modi, Akshay Naresh ;
Polyzotis, Neoklis ;
Ramesh, Sukriti ;
Roy, Sudip ;
Whang, Steven Euijong ;
Wicke, Martin ;
Wilkiewicz, Jarek ;
Zhang, Xin ;
Zinkevich, Martin .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1387-1395
[6]  
Breck E., 2016, REL MACH LEARN WILD
[7]   Novel Applications of Machine Learning in Software Testing [J].
Briand, Lionel C. .
QSIC 2008: PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, 2008, :3-10
[8]   A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms [J].
Cai, Zhuhua ;
Gao, Zekai J. ;
Luo, Shangyu ;
Perez, Luis L. ;
Vagena, Zografoula ;
Jermaine, Christopher .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :1371-1382
[9]  
Cheng Heng-Tze, 2016, P 1 WORKSH DEEP LEAR, P7
[10]   Recommended Steps for Thematic Synthesis in Software Engineering [J].
Cruzes, Daniela S. ;
Dyba, Tore .
2011 FIFTH INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2011), 2011, :275-284