Data Science for Starters: How to Train and Be Trained

被引:0
作者
Zupan, Blaz [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Vecna Pot 113, Ljubljana, Slovenia
来源
ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2022 | 2022年 / 13263卷
关键词
Data science; Machine learning; Hands-on training;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the right tool, it may take only a few hours to familiarize outsiders with data science. Data science, machine learning, and artificial intelligence are drivers of change in all fields of science, including biomedicine. The computational approaches that can sip through vast collections of data, extract interesting patterns, and devise predictive models are becoming omnipresent. But only a few professionals understand the essential concepts behind data science, and even fewer engage in building models using their data. We here report on the contents of the tutorial at AIME-2022, where we aim to explain how anybody who can spare a few hours can learn the essential mechanics behind data science and machine learning. With the training we have designed, the professionals can gain enough intuition about data science to recognize opportunities that this field can offer and actively engage in data science projects. Besides good mentors and an encouraging working environment, the right tool is critical for such training. We advocate the workflow-based construction of analytical pipelines with interactive visualizations and show that they can be the key to the simplicity of the interface and flexibility to adopt analytics to any data type and problem domain.
引用
收藏
页码:450 / 454
页数:5
相关论文
共 7 条
[1]   Microarray data mining with visual programming [J].
Curk, T ;
Demsar, J ;
Xu, QK ;
Leban, G ;
Petrovic, U ;
Bratko, I ;
Shaulsky, G ;
Zupan, B .
BIOINFORMATICS, 2005, 21 (03) :396-398
[2]   Hands-on training about overfitting [J].
Demsar, Janez ;
Zupan, Blaz .
PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (03)
[3]  
Demsar J, 2013, J MACH LEARN RES, V14, P2349
[4]   Building the biomedical data science workforce [J].
Dunn, Michelle C. ;
Bourne, Philip E. .
PLOS BIOLOGY, 2017, 15 (07)
[5]   Data literacy training needs of biomedical researchers [J].
Federer, Lisa M. ;
Lu, Ya-Ling ;
Joubert, Douglas J. .
JOURNAL OF THE MEDICAL LIBRARY ASSOCIATION, 2016, 104 (01) :52-57
[6]   Democratized image analytics by visual programming through integration of deep models and small-scale machine learning [J].
Godec, Primoz ;
Pancur, Matja ;
Ilenic, Nejc ;
Copar, Andrej ;
Strazar, Martin ;
Erjavec, Ales ;
Pretnar, Ajda ;
Demsar, Janez ;
Staric, Anze ;
Toplak, Marko ;
Zagar, Lan ;
Hartman, Jan ;
Wang, Hamilton ;
Bellazzi, Riccardo ;
Petrovic, Uros ;
Garagna, Silvia ;
Zuccotti, Maurizio ;
Park, Dongsu ;
Shaulsky, Gad ;
Zupan, Blaz .
NATURE COMMUNICATIONS, 2019, 10 (1)
[7]   scOrange-a tool for hands-on training of concepts from single-cell data analytics [J].
Strazar, Martin ;
Zagar, Lan ;
Kokosar, Jaka ;
Tanko, Vesna ;
Erjavec, Ales ;
Policar, Pavlin G. ;
Staric, Anze ;
Demsar, Janez ;
Shaulsky, Gad ;
Menon, Vilas ;
Lemire, Andrew ;
Parikh, Anup ;
Zupan, Blaz .
BIOINFORMATICS, 2019, 35 (14) :I4-I12