HOW ACCURATE IS SCIENTIFIC SOFTWARE

被引:65
作者
HATTON, L
ROBERTS, A
机构
[1] ENTERPRISE OIL PLC,LONDON WC2N 5EJ,ENGLAND
[2] UNIV LONDON KINGS COLL,CAMBRIDGE,ENGLAND
[3] UNIV MANCHESTER,MANCHESTER,LANCS,ENGLAND
关键词
ACCURACY; N-VERSION; FAULT VISUALIZATION;
D O I
10.1109/32.328993
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper describes some results of what, to the authors' knowledge, is the largest N-version programming experiment ever performed. The object of this ongoing four-year study is to attempt to determine just how consistent the results of scientific computation really are, and, from this, to estimate accuracy. The experiment is being carried out in a branch of the earth sciences known as seismic data processing, where 15 or so independently developed large commercial packages that implement mathematical algorithms from the same or similar published specifications in the same programming language (Fortran) have been developed over the last 20 years. The results of processing the same input dataset, using the same user-specified parameters, for nine of these packages is reported in this paper. Finally, feedback of obvious flaws was attempted to reduce the overall disagreement. The results are deeply disturbing. Whereas scientists like to think that their code is accurate to the precision of the arithmetic used, in this study, numerical disagreement grows at around the rate of 1% in average absolute difference per 4000 lines of implemented code, and, even worse, the nature of the disagreement is nonrandom. Furthermore, the seismic data processing industry has better than average quality standards for its software development with both identifiable quality assurance functions and substantial test datasets. Comparing the results reported here with other work by Hatton showing broadly similar statically detectable fault rates in software from different disciplines gives strong indications that the software realisations of work in other scientific fields may be a great deal less accurate than many would believe. Against this backdrop, the authors believe that little progress will be made in some sciences until the problem is reduced, particularly in remote sensing, where the answer is generally inaccessible to direct measurement. To this end, the feedback experiments that formed part of the study proved valuable, resulting in significant reductions in disagreement.
引用
收藏
页码:785 / 797
页数:13
相关论文
共 17 条
[1]   RECOMMENDED STANDARDS FOR DIGITAL TAPE FORMATS [J].
BARRY, KM ;
CAVERS, DA ;
KNEALE, CW .
GEOPHYSICS, 1975, 40 (02) :344-352
[2]  
Claerbout J. F., 1985, IMAGING EARTHS INTER, P398
[3]  
HALE ID, 1990, DIP MOVEOUT PROCESSI, V4
[4]   THE SEISMIC KERNEL SYSTEM - A LARGE-SCALE EXERCISE IN FORTRAN-77 PORTABILITY [J].
HATTON, L ;
WRIGHT, A ;
SMITH, S ;
PARKES, G ;
BENNETT, P ;
LAWS, R .
SOFTWARE-PRACTICE & EXPERIENCE, 1988, 18 (04) :301-329
[5]  
HATTON L, 1994, DEV HIGH INTEGRITY S
[6]  
HATTON L, 1986, SEISMIC DATA PROCESS, P177
[7]  
HATTON L, 1993, QUALITY RELIABILITY
[8]  
HATTON L, 1992, ANAL AGREEMENT SEISM
[9]  
KELLER TW, 1993, 2 INT SOFTW TEST AN
[10]  
KOENING A, 1988, C TRAPS PITFALLS, P146