A qualitative study of Machine Learning practices and engineering challenges in Earth Observation

被引:3
作者
Jentzsch, Sophie [1 ]
Hochgeschwender, Nico [1 ]
机构
[1] DLR Inst Software Technol, Cologne, Germany
来源
IT-INFORMATION TECHNOLOGY | 2021年 / 63卷 / 04期
关键词
Machine Learning; Artificial Intelligence; Earth Observation; Process Models;
D O I
10.1515/itit-2020-0045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine Learning (ML) is ubiquitously on the advance. Like many domains, Earth Observation (EO) also increasingly relies on ML applications, where ML methods are applied to process vast amounts of heterogeneous and continuous data streams to answer socially and environmentally relevant questions. However, developing such ML- based EO systems remains challenging: Development processes and employed workflows are often barely structured and poorly reported. The application of ML methods and techniques is considered to be opaque and the lack of transparency is contradictory to the responsible development of ML-based EO applications. To improve this situation a better understanding of the current practices and engineering-related challenges in developing ML-based EO applications is required. In this paper, we report observations from an exploratory study where five experts shared their view on ML engineering in semi-structured interviews. We analysed these interviews with coding techniques as often applied in the domain of empirical software engineering. The interviews provide informative insights into the practical development of ML applications and reveal several engineering challenges. In addition, interviewees participated in a novel workflow sketching task, which provided a tangible reflection of implicit processes. Overall, the results confirm a gap between theoretical conceptions and real practices in ML development even though workflows were sketched abstractly as textbook-like. The results pave the way for a large-scale investigation on requirements for ML engineering in EO.
引用
收藏
页码:235 / 247
页数:13
相关论文
共 28 条
[1]   Software Engineering for Machine Learning: A Case Study [J].
Amershi, Saleema ;
Begel, Andrew ;
Bird, Christian ;
DeLine, Robert ;
Gall, Harald ;
Kamar, Ece ;
Nagappan, Nachiappan ;
Nushi, Besmira ;
Zimmermann, Thomas .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2019), 2019, :291-300
[2]   The telephone medium and semi-structured interviews: a complementary fit [J].
Cachia, Moira ;
Millward, Lynne .
QUALITATIVE RESEARCH IN ORGANIZATIONS AND MANAGEMENT, 2011, 6 (03) :265-277
[3]  
de Souza Nascimento E., 2019 ACM IEEE INT S, P1
[4]  
Fischer G., 2019, THESIS ETH ZURICH
[5]   Data Mining and Analytics in the Process Industry: The Role of Machine Learning [J].
Ge, Zhiqiang ;
Song, Zhihuan ;
Deng, Steven X. ;
Huang, Biao .
IEEE ACCESS, 2017, 5 :20590-20616
[6]   Rohingya Refugee Crisis and Forest Cover Change in Teknaf, Bangladesh [J].
Hassan, Mohammad Mehedy ;
Smith, Audrey Culver ;
Walker, Katherine ;
Rahman, Munshi Khaledur ;
Southworth, Jane .
REMOTE SENSING, 2018, 10 (05)
[7]  
Hill C, 2016, S VIS LANG HUM CEN C, P162, DOI 10.1109/VLHCC.2016.7739680
[8]   Don't Forget Your Roots! Using Provenance Data for Transparent and Explainable Development of Machine Learning Models [J].
Jentzsch, Sophie F. ;
Hochgeschwender, Nico .
2019 34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOPS (ASEW 2019), 2019, :37-40
[9]   Data Scientists in Software Teams: State of the Art and Challenges [J].
Kim, Miryung ;
Zimmermann, Thomas ;
DeLine, Robert ;
Begel, Andrew .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (11) :1024-1038
[10]   The Emerging Role of Data Scientists on Software Development Teams [J].
Kim, Miryung ;
Zimmermann, Thomas ;
DeLine, Robert ;
Begel, Andrew .
2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, :96-107