A Test Architecture for Machine Learning Product

被引:12
作者
Nishi, Yasuharu [1 ]
Masuda, Satoshi [2 ]
Ogawa, Hideto [3 ]
Uetsuki, Keiji [4 ]
机构
[1] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo, Tokyo, Japan
[2] IBM Res, Tokyo Res Lab, Tokyo, Japan
[3] Hitachi Ltd, Ctr Technol Innovat Syst Engn, Yokohama, Kanagawa, Japan
[4] Assoc Software Testing Engn Japan, Tokyo, Japan
来源
2018 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW) | 2018年
关键词
test architecture; test level; test type; test design; artificial intelligence; machine learning; quality assurance; functional safety;
D O I
10.1109/ICSTW.2018.00060
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As machine learning (ML) technology continues to spread by rapid evolution, the system or service using Machine Learning technology, called ML product, makes big impact on our life, society and economy. Meanwhile, Quality Assurance (QA) for ML product is quite more difficult than hardware, non-ML software and service because performance of ML technology is much better than non-ML technology in exchange for the characteristics of ML product, e.g. low explainability. We must keep rapid evolution and reduce quality risk of ML product simultaneously. In this paper, we show a Quality Assurance Framework for Machine Learning product. Scope of QA in this paper is limited to product evaluation. First, a policy of QA for ML Product is proposed. General principles of product evaluation is introduced and applied to ML product evaluation as a part of the policy. They are composed of A-ARAI: Allowability, Achievability, Robustness, Avoidability and Improvability. A strategy of ML Product Evaluation is constructed as another part of the policy. Quality Integrity Level for ML product is also modelled. Second, we propose a test architecture of ML product testing. It consists of test levels and fundamental test types of ML product testing, including snapshot testing, learning testing and confrontation testing. Finally, we defines QA activity levels for ML product.
引用
收藏
页码:273 / 278
页数:6
相关论文
共 27 条
  • [1] [Anonymous], 2017, ARXIV170506640
  • [2] [Anonymous], 2008, SEKE
  • [3] [Anonymous], 1995, Safeware: System Safety and Computers
  • [4] [Anonymous], J SYST SOFTWARE
  • [5] [Anonymous], 2013, INT WORKSH COMB TEST
  • [6] [Anonymous], 2014, IEEE Std 730-2014 (Revision of IEEE Std 730-2002), P1, DOI DOI 10.1109/IEEESTD.2014.6835311
  • [7] [Anonymous], SOFTW ENG C APSEC 20
  • [8] [Anonymous], 2017, UML TEST PROF UTP VE
  • [9] [Anonymous], ARXIV170201135
  • [10] [Anonymous], P IEEE 1 INT C DAT E