REFORMS: Consensus-based Recommendations for Machine-learning-based Science

被引:18
作者
Kapoor, Sayash [1 ,2 ]
Cantrell, Emily M. [3 ,4 ]
Peng, Kenny [5 ]
Pham, Thanh Hien [1 ,2 ]
Bail, Christopher A. [6 ,7 ,8 ]
Gundersen, Odd Erik [9 ,10 ]
Hofman, Jake M. [11 ]
Hullman, Jessica [12 ]
Lones, Michael A. [13 ]
Malik, Momin M. [14 ,15 ,16 ]
Nanayakkara, Priyanka [12 ,17 ]
Poldrack, Russell A. [18 ]
Raji, Inioluwa Deborah [19 ]
Roberts, Michael [20 ,21 ]
Salganik, Matthew J. [2 ,3 ,22 ]
Serra-Garcia, Marta [23 ]
Stewart, Brandon M. [2 ,3 ,22 ,24 ]
Vandewiele, Gilles [25 ]
Narayanan, Arvind [1 ,2 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Princeton Univ, Ctr Informat Technol Policy, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Sociol, Princeton, NJ 08544 USA
[4] Princeton Univ, Sch Publ & Int Affairs, Princeton, NJ 08544 USA
[5] Cornell Univ, Dept Comp Sci, Ithaca, NY 14850 USA
[6] Duke Univ, Dept Sociol, Durham, NC 27708 USA
[7] Duke Univ, Dept Polit Sci, Durham, NC 27708 USA
[8] Duke Univ, Sanford Sch Publ Policy, Durham, NC 27708 USA
[9] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
[10] Aneo AS, Trondheim, Norway
[11] Microsoft Res, New York, NY 10012 USA
[12] Northwestern Univ, Dept Comp Sci, Evanston, IL 60208 USA
[13] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh, Scotland
[14] Mayo Clin, Ctr Digital Hlth, Rochester, MN 55905 USA
[15] Univ Penn, Sch Social Policy & Practice, Philadelphia, PA 19104 USA
[16] Johns Hopkins Univ, Inst Crit Quantitat Computat & Mixed Methodol, Baltimore, MD 21218 USA
[17] Northwestern Univ, Dept Commun Studies, Evanston, IL 60208 USA
[18] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[19] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[20] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge, England
[21] Univ Cambridge, Dept Med, Cambridge, England
[22] Princeton Univ, Off Populat Res, Princeton, NJ 08544 USA
[23] Univ Calif San Diego, Rady Sch Management, La Jolla, CA 92093 USA
[24] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
[25] Univ Ghent, Dept Informat Technol, Ghent, Belgium
基金
英国工程与自然科学研究理事会;
关键词
CROSS-VALIDATION; MISSING DATA; MODEL SELECTION; PREDICTION; QUALITY; HEALTH; REPRODUCIBILITY; PERFORMANCE; PSYCHOLOGY; TRIALS;
D O I
10.1126/sciadv.adk3452
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways across disciplines. Motivated by this observation, our goal is to provide clear recommendations for conducting and reporting ML-based science. Drawing from an extensive review of past literature, we present the REFORMS checklist (recommendations for machine-learning-based science). It consists of 32 questions and a paired set of guidelines. REFORMS was developed on the basis of a consensus of 19 researchers across computer science, data science, mathematics, social sciences, and biomedical sciences. REFORMS can serve as a resource for researchers when designing and implementing a study, for referees when reviewing papers, and for journals when enforcing standards for transparency and reproducibility. We provide a checklist to improve reporting practices in ML-based science based on a review of best practices and common errors.
引用
收藏
页数:17
相关论文
共 196 条
[1]  
AAAI, AAAI reproducibility checklist
[2]   Retire statistical significance [J].
Amrhein, Valentin ;
Greenland, Sander ;
McShane, Blake .
NATURE, 2019, 567 (7748) :305-307
[3]   Prediction-powered inference [J].
Angelopoulos, Anastasios N. ;
Bates, Stephen ;
Fannjiang, Clara ;
Jordan, Michael I. ;
Zrnic, Tijana .
SCIENCE, 2023, 382 (6671) :669-674
[4]   Reviewing computational methods [J].
不详 .
NATURE METHODS, 2015, 12 (12) :1099-1099
[5]  
[Anonymous], 2023, Brain imaging data structure
[6]  
[Anonymous], 2015, Principles and guidelines for reporting preclinical research
[7]  
[Anonymous], 2007, MISSING DATA GENTLE
[8]  
[Anonymous], REPORTING GUIDELINES
[9]  
[Anonymous], 2017, Nature research code and software submission checklist
[10]   Machine Learning Methods That Economists Should Know About [J].
Athey, Susan ;
Imbens, Guido W. .
ANNUAL REVIEW OF ECONOMICS, VOL 11, 2019, 2019, 11 :685-725