A randomized controlled trial on the nomenclature of scientific computing

被引:0
|
作者
Kluthe, Timothy [1 ]
Stabler, Hannah [1 ]
McNamara, Amelia [2 ]
Stefik, Andreas [1 ]
机构
[1] Univ Nevada, Comp Sci, Las Vegas, NV 89154 USA
[2] Univ St Thomas, Comp & Informat Sci, St Paul, MN USA
基金
美国国家科学基金会;
关键词
Programming language usability; scientific computing; data science; statistics;
D O I
10.1080/08993408.2024.2403971
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background and Context: Data science and statistics are used across a broad spectrum of professions, experience levels and programming languages. The popular scientific computing languages, such as Matlab, Python and R, were organized without using empirical methods to show evidence for or against their design choices, resulting in them feeling eclectic or esoteric in their design. Objective: To meaningfully organize scientific computing based on evidence gathered through user feedback, build a statistical package based on the findings and provide a replication packet to run similar studies on people with different backgrounds. Method: A randomized controlled trial using a weighted, ranked choice survey (n = 118) with between-subjects design having two independent variables: Language Group (Matlab, Python and R) and Method Name options. Our dependent variable was a normalized preference rating. Findings: There was a very small interaction between Language Group and Method Name. Language Group did not have a statistically significant effect, but Method Name did (F(4, 27037) = 2211.23, p < .001)($\eta _p<^>2$eta p2 = .247). Finally, many names in Matlab, Python and R were ranked so poorly that they were not statistically significantly different from a random word in 63.0%, 62.2% and 30.4% of concepts respectively. Implications: We found organized and structured names were ranked by a large margin, suggesting statistical programming today likely needs considerable improvement. Finally, we outline a statistical package built using these principles, provide comparison scripts and describe some of the challenges from going from simple surveys to in-practice libraries.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Effect of the exercise programme on the quality of life of prostate cancer survivors: A randomized controlled trial
    Mardani, Abbas
    Razi, Shadan Pedram
    Mazaheri, Reza
    Haghani, Shima
    Vaismoradi, Mojtaba
    INTERNATIONAL JOURNAL OF NURSING PRACTICE, 2021, 27 (02)
  • [22] Fundamental movement skills in preschoolers: a randomized controlled trial targeting object control proficiency
    Donath, L.
    Faude, O.
    Hagmann, S.
    Roth, R.
    Zahner, L.
    CHILD CARE HEALTH AND DEVELOPMENT, 2015, 41 (06) : 1179 - 1187
  • [23] Exploiting multiple levels of parallelism in scientific computing
    Rauber, T
    Rünger, G
    HIGH PERFORMANCE COMPUTATIONAL SCIENCE AND ENGINEERING, 2004, 172 : 3 - 19
  • [24] Extending YML to Be a Middleware for Scientific Cloud Computing
    Shang, Ling
    Petiton, Serge G.
    Emad, Nahid
    Yang, Xiaolin
    Wang, Zhijian
    CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 662 - +
  • [25] Designing and Deploying a Scientific Computing Cloud Platform
    Zhao, Yong
    Zhang, Yanzhe
    Tian, Wenhong
    Xue, Ruini
    Lin, Cui
    2012 ACM/IEEE 13TH INTERNATIONAL CONFERENCE ON GRID COMPUTING (GRID), 2012, : 104 - 113
  • [26] A Customized Processor for Energy Efficient Scientific Computing
    Sethia, Ankit
    Dasika, Ganesh
    Mudge, Trevor
    Mahlke, Scott
    IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (12) : 1711 - 1723
  • [27] Algorithm Selection for Error Resilience in Scientific Computing
    Callenes-Sloan, Joseph
    McNamara, Hugh
    2014 20TH IEEE PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC 2014), 2014, : 96 - 105
  • [28] Parametric Fortran: program generation in scientific computing
    Erwig, Martin
    Fu, Zhe
    Pflaum, Ben
    JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2007, 19 (03): : 155 - 182
  • [29] e-Clouds: Scientific Computing as a Service
    Mendez, David
    Villamizar, Mario
    Castro, Harold
    2013 SEVENTH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS), 2013, : 481 - 486
  • [30] Scientific Computing Doesn't Need noSQL
    Butler, David M.
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1301 - 1302