Data Management Challenges for Deep Learning

被引:51
作者
Raj, Aiswarya [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
Arpteg, Anders [3 ]
Brinne, Bjorn [3 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Malmo, Sweden
[3] Peltarion AB, Stockholm, Sweden
来源
2019 45TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2019) | 2019年
关键词
Deep learning; Data Management; Machine learning; Artificial intelligence; Deep Neural Networks;
D O I
10.1109/SEAA.2019.00030
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning is one of the most exciting and fast-growing techniques in Artificial Intelligence. The unique capacity of deep learning models to automatically learn patterns from the data differentiates it from other machine learning techniques. Deep learning is responsible for a significant number of recent breakthroughs in AI. However, deep learning models are highly dependent on the underlying data. So, consistency, accuracy, and completeness of data is essential for a deep learning model. Thus, data management principles and practices need to be adopted throughout the development process of deep learning models. The objective of this study is to identify and categorise data management challenges faced by practitioners in different stages of end-to-end development. In this paper, a case study approach is employed to explore the data management issues faced by practitioners across various domains when they use real-world data for training and deploying deep learning models. Our case study is intended to provide valuable insights to the deep learning community as well as for data scientists to guide discussion and future research in applied deep learning with real-world data.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 27 条
[1]  
[Anonymous], 2011, LANGUAGE UNDERSTANDI
[2]  
Arpteg A., 2017, SEAA C
[3]  
Baxter P, 2008, QUAL REP, V13, P544
[4]   Big Data Deep Learning: Challenges and Perspectives [J].
Chen, Xue-Wen ;
Lin, Xiaotong .
IEEE ACCESS, 2014, 2 :514-525
[5]   Online particle detection with Neural Networks based on topological calorimetry information [J].
Ciodaro, T. ;
Deva, D. ;
de Seixas, J. M. ;
Damazio, D. .
14TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH (ACAT 2011), 2012, 368
[6]  
Efrati A., 2013, DEEP LEARNING WORKS
[7]  
He K., 2016, CVPR, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[8]   Connectomic reconstruction of the inner plexiform layer in the mouse retina [J].
Helmstaedter, Moritz ;
Briggman, Kevin L. ;
Turaga, Srinivas C. ;
Jain, Viren ;
Seung, H. Sebastian ;
Denk, Winfried .
NATURE, 2013, 500 (7461) :168-+
[9]   Deep Neural Networks for Acoustic Modeling in Speech Recognition [J].
Hinton, Geoffrey ;
Deng, Li ;
Yu, Dong ;
Dahl, George E. ;
Mohamed, Abdel-rahman ;
Jaitly, Navdeep ;
Senior, Andrew ;
Vanhoucke, Vincent ;
Patrick Nguyen ;
Sainath, Tara N. ;
Kingsbury, Brian .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :82-97
[10]   THE LEARNING MACHINES [J].
Jones, Nicola .
NATURE, 2014, 505 (7482) :146-148