Application of spectra cross-correlation for Type II outliers screening during multivariate near-infrared spectroscopic analysis of whole blood

被引：5

作者：

Abookasis, David ^{[1
]}

Workman, Jerome J. ^{[2
,3
]}

机构：

[1] Ariel Univ Ctr Samaria, Dept Elect & Elect Engn, IL-44837 Ariel, Israel

[2] Liberty Univ, Lynchburg, VA 24502 USA

[3] US Natl Univ, La Jolla, CA 92037 USA

来源：

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS | 2011年 / 107卷 / 02期

关键词：

Type II outlier detection; Fourier transform infrared spectroscopy; Blood glucose measurements; Partial least squares (PLS); False negative sample; CALIBRATION; GLUCOSE; SQUARES; STATISTICS; REGRESSION; TUTORIAL;

D O I：

10.1016/j.chemolab.2011.04.015

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this study, a simple screening algorithm was developed to prevent the occurrence of Type II errors or samples with high prediction error that are not detected as outliers. The method is used to determine "good" and "bad" spectra and to prevent a false negative condition where poorly predicted samples appear to be within the calibration space, yet have inordinately large residual or prediction errors. The detection and elimination of this type of sample, which is a true outlier but not easily detected, is extremely important in medical decisions, since such erroneous data can lead to considerable mistakes in clinical analysis and medical diagnosis. The algorithm is based on a cross-correlation comparison between samples spectra measured over the region of 4160-4880 cm(-1). The correlation values are converted using the Fisher's z-transform, while a z-test of the transformed values is performed to screen out the outlier spectra. This approach allows the use of a tuning parameter used to decrease the percentage of samples with high analytical (residual) errors. The algorithm was tested using a dataset with known reference values to determine the number of false negative and false positive samples. The cross-correlation algorithm performance was tested on several hundred blood samples prepared at different hematocrit (24 to 48%) and glucose (30 to 500 mg/dL) levels using blood component materials from thirteen healthy human volunteers. Experimental results illustrate the effectiveness of the proposed algorithm in finding and screening out Type II outliers in terms of sensitivity and specificity, and the ability to predict or estimate future or validation datasets ensuring lower error of prediction. To our knowledge this is the first paper to introduce a statistically useful screening method based on spectra cross-correlation to detect the occurrence of Type II outliers (false negative samples) for routine analysis in a clinically relevant application for medical diagnosis. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：303 / 311

页数：9

共 47 条

[1] STATISTICS NOTES - DIAGNOSTIC-TESTS-1 - SENSITIVITY AND SPECIFICITY .3.
ALTMAN, DG
BLAND, JM
[J]. BRITISH MEDICAL JOURNAL, 1994, 308 (6943) : 1552 - 1552
[2] ANDRIES E, 2010, CHEMOM, V24
[3] [Anonymous], 1997, Data Handling in Science and Technology 20a: Handbook of Chemometrics and Qualimentrics: Part A
[4] [Anonymous], 2004, Applied linear regression models
[5] Barnett V., 1995, OUTLIERS STAT DATA, V3th
[6] Beebe K.R., 1998, CHEMOMETRICS PRACTIC
[7] Belsley D.A., 2005, REGRESSION DIAGNOSTI
[8] BOND CF, 2006, PSYHCOM, V69, P291
[9] Burns D. A., 2007, HDB NEAR INFRARED AN, P673
[10] Chrominski K., 2010, Journal of Medical Informatics Technologies, V16, P89

← 1 2 3 4 5 →