The Lean Data Scientist: Recent Advances toward Overcoming the Data Bottleneck

被引:8
作者
Shani, Chen [1 ]
Zarecki, Jonathan [2 ]
Shahaf, Dafna [3 ]
机构
[1] Hebrew Univ Jerusalem, Jerusalem, Israel
[2] Israeli Mil Intelligence, Tel Aviv, Israel
[3] Hebrew Univ Jerusalem, Data Sci, Jerusalem, Israel
基金
欧洲研究理事会;
关键词
D O I
10.1145/3551635
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A taxonomy of the methods used to obtain quality datasets enhances existing resources.
引用
收藏
页码:92 / 102
页数:11
相关论文
共 60 条
[11]  
Duong L, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, P845
[12]  
Finn C, 2017, PR MACH LEARN RES, V70
[13]   A Winnow-based approach to context-sensitive spelling correction [J].
Golding, AR ;
Roth, D .
MACHINE LEARNING, 1999, 34 (1-3) :107-130
[14]  
Gurevich N, 2006, P NAT C ART INT, P362
[15]  
Gururangan S, 2018, Arxiv, DOI arXiv:1803.02324
[16]  
Hacohen G, 2022, Arxiv, DOI [arXiv:2202.02794, DOI 10.48550/ARXIV.2202.02794]
[17]  
Hacohen G, 2019, Arxiv, DOI arXiv:1904.03626
[18]  
Hope T., 2016, JOINT EUROPEAN C MAC, P299
[19]  
Jiang L, 2015, AAAI CONF ARTIF INTE, P2694
[20]   Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes [J].
Kottur, Satwik ;
Vedantam, Ramakrishna ;
Moura, Jose M. F. ;
Parikh, Devi .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4985-4994