The Open-Closed Principle of Modern Machine Learning Frameworks

被引:24
作者
Ben Braiek, Houssem [1 ]
Khomh, Foutse [1 ]
Adams, Bram [2 ]
机构
[1] Polytech Montreal, SWAT Lab, Montreal, PQ, Canada
[2] Polytech Montreal, MCIS, Montreal, PQ, Canada
来源
2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR) | 2018年
基金
加拿大自然科学与工程研究理事会;
关键词
Machine Learning; Open Source; Framework; Technology adoption;
D O I
10.1145/3196398.3196445
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in computing technologies and the availability of huge volumes of data have sparked a new machine learning (ML) revolution, where almost every day a new headline touts the demise of human experts by ML models on some task. Open source software development is rumoured to play a significant role in this revolution, with both academics and large corporations such as Google and Microsoft releasing their ML frameworks under an open source license. This paper takes a step back to examine and understand the role of open source development in modern ML, by examining the growth of the open source ML ecosystem on GitHub, its actors, and the adoption of frameworks over time. By mining LinkedIn and Google Scholar profiles, we also examine driving factors behind this growth (paid vs. voluntary contributors), as well as the major players who promote its democratization (companies vs. communities), and the composition of ML development teams (engineers vs. scientists). According to the technology adoption lifecycle, we find that ML is in between the stages of early adoption and early majority. Furthermore, companies are the main drivers behind open source ML, while the majority of development teams are hybrid teams comprising both engineers and professional scientists. The latter correspond to scientists employed by a company, and by far represent the most active profiles in the development of ML applications, which reflects the importance of a scientific background for the development of ML frameworks to complement coding skills. The large influence of cloud computing companies on the development of open source ML frameworks raises the risk of vendor lock-in. These frameworks, while open source, could be optimized for specific commercial cloud offerings.
引用
收藏
页码:353 / 363
页数:11
相关论文
共 20 条
[1]  
Abadi Abadi M M, Tensorflow: Large-scale machine learning on heterogeneous distributed systems Tensorflow: Large-scale machine learning on heterogeneous distributed systems
[2]  
[Anonymous], 2010, Introduction to Machine Learning
[3]  
[Anonymous], 2011, INNOVATION ADOPTION
[4]  
[Anonymous], 2002, Applied Social Research Methods Series
[5]  
Bettenburg N, 2015, EMPIR SOFTW ENG, V20, P252, DOI 10.1007/s10664-013-9284-6
[6]  
Bird Christian, 2012, 2012 9th IEEE Working Conference on Mining Software Repositories (MSR 2012), P237, DOI 10.1109/MSR.2012.6224286
[7]  
Bohlen J.M., 1957, Special Report, Agriculture Extension Service, V18, P56
[8]   Between Organization and Community: Investigating Turnover Intention Factors of Firm-Sponsored Open Source Software Developers [J].
Homscheid, Dirk ;
Schaarschmidt, Mario .
PROCEEDINGS OF THE 2016 ACM WEB SCIENCE CONFERENCE (WEBSCI'16), 2016, :336-337
[9]  
International Data Corporation, 2016, DOUBL DIG GROWTH FOR
[10]   In-Datacenter Performance Analysis of a Tensor Processing Unit [J].
Jouppi, Norman P. ;
Young, Cliff ;
Patil, Nishant ;
Patterson, David ;
Agrawal, Gaurav ;
Bajwa, Raminder ;
Bates, Sarah ;
Bhatia, Suresh ;
Boden, Nan ;
Borchers, Al ;
Boyle, Rick ;
Cantin, Pierre-luc ;
Chao, Clifford ;
Clark, Chris ;
Coriell, Jeremy ;
Daley, Mike ;
Dau, Matt ;
Dean, Jeffrey ;
Gelb, Ben ;
Ghaemmaghami, Tara Vazir ;
Gottipati, Rajendra ;
Gulland, William ;
Hagmann, Robert ;
Ho, C. Richard ;
Hogberg, Doug ;
Hu, John ;
Hundt, Robert ;
Hurt, Dan ;
Ibarz, Julian ;
Jaffey, Aaron ;
Jaworski, Alek ;
Kaplan, Alexander ;
Khaitan, Harshit ;
Killebrew, Daniel ;
Koch, Andy ;
Kumar, Naveen ;
Lacy, Steve ;
Laudon, James ;
Law, James ;
Le, Diemthu ;
Leary, Chris ;
Liu, Zhuyuan ;
Lucke, Kyle ;
Lundin, Alan ;
MacKean, Gordon ;
Maggiore, Adriana ;
Mahony, Maire ;
Miller, Kieran ;
Nagarajan, Rahul ;
Narayanaswami, Ravi .
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :1-12