Challenges and practices of deep learning model reengineering: A case study on computer vision

被引:4
作者
Jiang, Wenxin [1 ]
Banna, Vishnu [1 ]
Vivek, Naveen [1 ]
Goel, Abhinav [1 ]
Synovic, Nicholas [2 ]
Thiruvathukal, George K. [2 ]
Davis, James C. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Loyola Univ Chicago, Chicago, IL USA
关键词
Empirical software engineering; Machine learning; Deep learning; Deep neural networks; Computer vision; Software reliability; Failure analysis; Bug study; Mixed methods; Case study; SOFTWARE; TOOL; AI;
D O I
10.1007/s10664-024-10521-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
ContextMany engineering organizations are reimplementing and extending deep neural networks from the research community. We describe this process as deep learning model reengineering. Deep learning model reengineering - reusing, replicating, adapting, and enhancing state-of-the-art deep learning approaches - is challenging for reasons including under-documented reference models, changing requirements, and the cost of implementation and testing.ObjectivePrior work has characterized the challenges of deep learning model development, but as yet we know little about the deep learning model reengineering process and its common challenges. Prior work has examined DL systems from a "product" view, examining defects from projects regardless of the engineers' purpose. Our study is focused on reengineering activities from a "process" view, and focuses on engineers specifically engaged in the reengineering process.MethodOur goal is to understand the characteristics and challenges of deep learning model reengineering. We conducted a mixed-methods case study of this phenomenon, focusing on the context of computer vision. Our results draw from two data sources: defects reported in open-source reeengineering projects, and interviews conducted with practitioners and the leaders of a reengineering team. From the defect data source, we analyzed 348 defects from 27 open-source deep learning projects. Meanwhile, our reengineering team replicated 7 deep learning models over two years; we interviewed 2 open-source contributors, 4 practitioners, and 6 reengineering team leaders to understand their experiences.ResultsOur results describe how deep learning-based computer vision techniques are reengineered, quantitatively analyze the distribution of defects in this process, and qualitatively discuss challenges and practices. We found that most defects (58%) are reported by re-users, and that reproducibility-related defects tend to be discovered during training (68% of them are). Our analysis shows that most environment defects (88%) are interface defects, and most environment defects (46%) are caused by API defects. We found that training defects have diverse symptoms and root causes. We identified four main challenges in the DL reengineering process: model operationalization, performance debugging, portability of DL operations, and customized data pipeline. Integrating our quantitative and qualitative data, we propose a novel reengineering workflow.ConclusionsOur findings inform several conclusion, including: standardizing model reengineering practices, developing validation tools to support model reengineering, automated support beyond manual model reengineering, and measuring additional unknown aspects of model reengineering.
引用
收藏
页数:61
相关论文
共 201 条
[1]   DISPARITY ANALYSIS BETWEEN THE ASSEMBLY AND BYTE MALWARE SAMPLES WITH DEEP AUTOENCODERS [J].
Abdullah, Muhammed Amin ;
Yu, Yongbin ;
Cai, Jingye ;
Imrana, Yakubu ;
Tettey, Nartey Obed ;
Addo, Daniel ;
Sarpong, Kwabena ;
Agbley, Bless Lord Y. ;
Appiah, Benjamin .
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
[2]   HFCommunity: A Tool to Analyze the Hugging Face Hub Community [J].
Ait, Adem ;
Canovas Izquierdo, Javier Luis ;
Cabot, Jordi .
2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, :728-732
[3]   Challenges for the Repeatability of Deep Learning Models [J].
Alahmari, Saeed S. ;
Goldgof, Dmitry B. ;
Mouton, Peter R. ;
Hall, Lawrence O. .
IEEE ACCESS, 2020, 8 :211860-211868
[4]  
AlDanial, 2022, cloc
[5]  
Ali Q, 2024, Arxiv, DOI arXiv:2405.19976
[6]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[7]   Software Engineering for Machine Learning: A Case Study [J].
Amershi, Saleema ;
Begel, Andrew ;
Bird, Christian ;
DeLine, Robert ;
Gall, Harald ;
Kamar, Ece ;
Nagappan, Nachiappan ;
Nushi, Besmira ;
Zimmermann, Thomas .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2019), 2019, :291-300
[8]  
Amusuo P, 2022, ACM JOINT EUR SOFTW
[9]   Reflecting on Recurring Failures in IoT Development [J].
Anandayuvaraj, Dharun ;
Davis, James C. .
PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
[10]  
[Anonymous], 2020, Managing labels