Helping Data Science Students Develop Task Modularity

被引:0
作者
Saltz, Jeffrey S. [1 ]
Heckman, Robert [1 ]
Crowston, Kevin [1 ]
You, Sangseok [2 ]
Hedge, Yatish [1 ]
机构
[1] Syracuse Univ, Syracuse, NY 13244 USA
[2] HEC Paris, Paris, France
来源
PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES | 2019年
关键词
SYSTEMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper explores the skills needed to be a data scientist. Specifically, we report on a mixed method study of a project-based data science class, where we evaluated student effectiveness with respect to dividing a project into appropriately sized modular tasks, which we termed task modularity. Our results suggest that while data science students can appreciate the value of task modularity, they struggle to achieve effective task modularity. As a first step, based our study, we identified six task decomposition best practices. However, these best practices do not fully address this gap of how to enable data science students to effectively use task modularity. We note that while computer science/information system programs typically teach modularity (e.g., the decomposition process and abstraction), and there remains a need identify a corresponding model to that used for computer science / information system students, to teach modularity to data science students.
引用
收藏
页码:1095 / 1104
页数:10
相关论文
共 42 条
[1]  
Ahmad Muhammad Ovais, 2013, 2013 39th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), P9, DOI 10.1109/SEAA.2013.28
[2]  
Anderson D.J., 2010, KANBAN: Successful Evolutionary Change for Your Technology Business
[3]  
Anderson P, 2014, PROCEEDINGS OF THE 45TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION (SIGCSE'14), P145
[4]  
[Anonymous], 2018, Wikipedia
[5]  
[Anonymous], 2016, IEEE T BIG DATA
[6]  
Baldwin C., 2004, WORKING PAPER
[7]  
Baldwin C.Y., 2001, WORKING PAPER
[8]  
Baldwin CY, 1997, HARVARD BUS REV, V75, P84
[9]  
Bhardwaj Anant P., 2015, DataHub: Collaborative Data Science Dataset Version Management at Scale
[10]  
BROOKS FP, 1974, DATAMATION, V20, P44