OPLS methodology for analysis of pre-processing effects on spectroscopic data

被引:46
|
作者
Gabrielsson, Jon [1 ]
Jonsson, Hans
Airiau, Christian
Schmidt, Bernd
Escott, Richard
Trygg, Johan
机构
[1] Umea Univ, Res Grp Chemometr, SE-90187 Umea, Sweden
[2] GlaxoSmithKline Inc, Tonbridge, Kent, England
关键词
multi-block strategies; pre-processing; UV-data; OPLS; O2PLS; batch process;
D O I
10.1016/j.chemolab.2006.03.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pre-processing of spectroscopic data is commonly applied to remove unwanted systematic variation. Possible loss of information and ambiguity regarding discarded variation are issues that complicate pre-treatment of data. In this paper, OPLS methodology is applied to evaluate different techniques for pre-processing of spectroscopic data gathered from a batch process. The objective is to present a rational scheme for analysis of preprocessing in order to understand the influence and effect of pre-treatment. O2PLS uses linear regression to divide the systematic variation in X and Y into three parts; one part with joint X-Y covariation, i.e. related to both X and Y, one part of X with Y-orthogonal variation and one part of Y with X-orthogonal variation. All of the investigated pre-treatment methods removed an additive baseline as expected. In the analysis of raw and differentiated data variation associated with the baseline was found in the Y-orthogonal part of X. Orthogonal information was also found in Y, which suggests that this preprocessing procedure not only removed variation. This would have been more difficult to detect without the O2PLS model since both raw and differentiated data must be analysed simultaneously. Development of a knowledge based strategy with OPLS methodology is an important step towards eliminating trial and error approaches to pre-processing. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:153 / 158
页数:6
相关论文
共 50 条
  • [21] PRE-PROCESSING OF DATA FOR CHARACTER RECOGNITION
    ALCORN, TM
    HOGGAR, CW
    MARCONI REVIEW, 1969, 32 (172): : 61 - &
  • [22] Pre-processing Agilent microarray data
    Zahurak, Marianna
    Parmigiani, Giovanni
    Yu, Wayne
    Scharpf, Robert B.
    Berman, David
    Schaeffer, Edward
    Shabbeer, Shabana
    Cope, Leslie
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [23] Pre-processing Agilent microarray data
    Marianna Zahurak
    Giovanni Parmigiani
    Wayne Yu
    Robert B Scharpf
    David Berman
    Edward Schaeffer
    Shabana Shabbeer
    Leslie Cope
    BMC Bioinformatics, 8
  • [24] PRESISTANT: Data Pre-processing Assistant
    Bilalli, Besim
    Abello, Alberto
    Aluja-Banet, Tomas
    Munir, Rana Faisal
    Wrembel, Robert
    INFORMATION SYSTEMS IN THE BIG DATA ERA, 2018, 317 : 57 - 65
  • [25] Testing the effects of pre-processing on voxel based morphometry analysis
    Chaitanya, C., V
    Koirala, N.
    Mideksa, K. G.
    Anwar, A. R.
    Schmidt, G.
    Deuschl, G.
    Groppa, S.
    Muthuraman, M.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 4302 - 4305
  • [26] Analysis of Document Pre-Processing Effects in Text and Opinion Mining
    Eler, Danilo Medeiros
    Grosa, Denilson
    Pola, Ives
    Garcia, Rogerio
    Correia, Ronaldo
    Teixeira, Jaqueline
    INFORMATION, 2018, 9 (04)
  • [27] Assessing effects of pre-processing mass spectrometry data on classification performance
    Ozcift, Akin
    Gulten, Arif
    EUROPEAN JOURNAL OF MASS SPECTROMETRY, 2008, 14 (05) : 267 - 273
  • [28] stagg: A data pre-processing R package for climate impacts analysis
    Liddell, Tyler
    Boser, Anna S.
    Orofino, Sara
    Mangin, Tracey
    Carleton, Tamma
    ENVIRONMENTAL MODELLING & SOFTWARE, 2025, 183
  • [29] Online calibration and pre-processing of TAMA data
    Tatsumi, D
    Tsunesada, Y
    CLASSICAL AND QUANTUM GRAVITY, 2004, 21 (05) : S451 - S456
  • [30] Data pre-processing pipeline generation for AutoETL
    Giovanelli, Joseph
    Bilalli, Besim
    Abello, Alberto
    INFORMATION SYSTEMS, 2022, 108