Check It Before You Wreck It: A Guide to STAR-ML for Screening Machine Learning Reporting in Research

被引:2
作者
Koh, Ryan G. L. [1 ]
Khan, Md Asif [2 ]
Rashidiani, Sajjad [2 ]
Hassan, Samah [3 ]
Tucci, Victoria [4 ]
Liu, Theodore [2 ]
Nesovic, Karlo [1 ]
Kumbhare, Dinesh [1 ]
Doyle, Thomas E. [2 ,5 ,6 ]
机构
[1] Univ Hlth Network UHN, KITE Res Inst, Toronto Rehabil Inst, Toronto, ON M5G 2A2, Canada
[2] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L8, Canada
[3] UHN, Inst Educ Res TIER, Toronto, ON M5T 1V4, Canada
[4] McMaster Univ, Fac Hlth Sci, Hamilton, ON L8S 4L8, Canada
[5] McMaster Univ, Sch Biomed Engn, Hamilton, ON L8S 4L8, Canada
[6] Vector Inst Artificial Intelligence, Toronto, ON M5G 1M1, Canada
关键词
Checklist; literature review; machine learning; quality scoring; reporting assessment; research methodology; screening tool; CONVOLUTIONAL NEURAL-NETWORKS; CLASSIFICATION; STATEMENT; REVIEWS; TRENDS;
D O I
10.1109/ACCESS.2023.3316019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) is a technique that learns to detect patterns and trends in data. However, the quality of reporting ML in research is often suboptimal, leading to inaccurate conclusions and hindering progress in the field, especially if disseminated in literature reviews that provide researchers with an overview of a field, current knowledge gaps, and future directions. While various tools are available to assess the quality and risk-of-bias of studies, there is currently no generalized tool for assessing the reporting quality of ML in the literature. To address this, this study presents a new screening tool called STAR-ML (Screening Tool for Assessing Reporting of Machine Learning), accompanied by a guide to using it. A pilot scoping review looking at ML in chronic pain was used to investigate the tool. The time it took to screen papers and how the selection of the threshold affected the papers included were explored. The tool provides researchers with a reliable and systematic way to evaluate the quality of reporting of ML studies and to make informed decisions about the inclusion of studies in scoping or systematic reviews. In addition, this study provides recommendations for authors on how to choose the threshold for inclusion and use the tool proficiently. Lastly, the STAR-ML tool can serve as a checklist for researchers seeking to develop or implement ML techniques effectively.
引用
收藏
页码:101567 / 101579
页数:13
相关论文
共 95 条
  • [1] Algorithmic bias in machine learning-based marketing models
    Akter, Shahriar
    Dwivedi, Yogesh K.
    Sajib, Shahriar
    Biswas, Kumar
    Bandara, Ruwan J.
    Michael, Katina
    [J]. JOURNAL OF BUSINESS RESEARCH, 2022, 144 : 201 - 216
  • [2] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [3] andWeiyuWang Keng Siau, 2018, Cutter business technology journal, V31, P47
  • [4] [Anonymous], 2022, Hands-on Machine Learning with Scikit-Learn, Keras, and Hands-On Machine Learning TensorFlow
  • [5] A survey of cross-validation procedures for model selection
    Arlot, Sylvain
    Celisse, Alain
    [J]. STATISTICS SURVEYS, 2010, 4 : 40 - 79
  • [6] Batista G.E., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [10.1145/1007730.1007735, 10.1145/1007730.1007735.2, DOI 10.1145/1007730.1007735]
  • [7] GPS driving: a digital biomarker for preclinical Alzheimer disease
    Bayat, Sayeh
    Babulal, Ganesh M.
    Schindler, Suzanne E.
    Fagan, Anne M.
    Morris, John C.
    Mihailidis, Alex
    Roe, Catherine M.
    [J]. ALZHEIMERS RESEARCH & THERAPY, 2021, 13 (01)
  • [8] Bossuyt P M., Ann. Internal Med., V138, pW1
  • [9] Bracke P., 2019, STAFF WORKING PAPER, DOI [10.2139/ssrn.3435104, DOI 10.2139/SSRN.3435104]
  • [10] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)