Analyze This! 145 Questions for Data Scientists in Software Engineering

被引:122
作者
Begel, Andrew [1 ]
Zimmermann, Thomas [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014) | 2014年
关键词
Data Science; Software Engineering; Analytics;
D O I
10.1145/2568225.2568233
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present the results from two surveys related to data science applied to software engineering. The first survey solicited questions that software engineers would like data scientists to investigate about software, about software processes and practices, and about software engineers. Our analyses resulted in a list of 145 questions grouped into 12 categories. The second survey asked a different pool of software engineers to rate these 145 questions and identify the most important ones to work on first. Respondents favored questions that focus on how customers typically use their applications. We also saw opposition to questions that assess the performance of individual employees or compare them with one another. Our categorization and catalog of 145 questions can help researchers, practitioners, and educators to more easily focus their efforts on topics that are important to the software industry.
引用
收藏
页码:12 / 23
页数:12
相关论文
共 51 条
[1]  
Allamanis M, 2013, IEEE WORK CONF MIN S, P207, DOI 10.1109/MSR.2013.6624029
[2]  
[Anonymous], 1985, Structure and Interpretation of Computer Programs
[3]  
[Anonymous], 2012, HARVARD BUSINESS REV
[4]  
[Anonymous], 2003, Moneyball: The Art of Winning an Unfair Game
[5]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[6]  
[Anonymous], 2010, P 32 ACM IEEE INT C, DOI DOI 10.1145/1806799.1806842
[7]   Building knowledge through families of experiments [J].
Basili, VR ;
Lanubile, F .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (04) :456-473
[8]  
Begel A., 2013, TECHNICAL REPORT
[9]  
Begel Andrew., 2010, P 32 ACMIEEE INT C S, P125
[10]  
Beget Andrew, 2008, P 4 INT WORKSHOP COM, P3