Drift in a popular metal oxide sensor dataset reveals limitations for gas classification benchmarks

被引:37
作者
Dennler, Nik [1 ,2 ]
Rastogi, Shavika [1 ,2 ]
Fonollosa, Jordi [3 ,4 ]
Van Schaik, Andre [2 ]
Schmuker, Michael [1 ]
机构
[1] Univ Hertfordshire, Ctr Comp Sci & Informat Res, UH Biocomputat Res Grp, Hatfield, England
[2] Western Sydney Univ, Int Ctr Neuromorph Syst, Sydney, Australia
[3] Univ Politecn Cataluna, Dept Enginyeria Sistemes Automat Informat Ind, Barcelona, Spain
[4] Inst Recerca Sant Joan de Deu, Esplugas de Llobregat, Spain
基金
欧盟地平线“2020”;
关键词
Metal oxide gas sensors; Wind tunnel dataset; Sensor drift; Gas recognition; ARRAYS; DISCRIMINATION; SYSTEMS;
D O I
10.1016/j.snb.2022.131668
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metal oxide (MOx) gas sensors are a popular choice for many applications, due to their tunable sensitivity, space efficiency and low cost. Publicly available sensor datasets are particularly valuable for the research community as they accelerate the development and evaluation of novel algorithms for gas sensor data analysis. A dataset published in 2013 by Vergara and colleagues contains recordings from MOx gas sensor arrays in a wind tunnel. It has since become a standard benchmark in the field. Here we report a latent property of this dataset that limits its suitability for gas classification studies. Measurement timestamps show that gases were recorded in separate, temporally clustered batches. Sensor baseline response before gas exposure were strongly correlated with the recording batch, to the extent that baseline response was largely sufficient to infer the gas used in a given trial. Zero-offset baseline compensation did not resolve the issue, since residual short-term drift still contained enough information for gas/trial identification using a machine learning classifier. A subset of the data recorded within a short period of time was minimally affected by drift and suitable for gas classification benchmarking after offset compensation, but with much reduced classification performance compared to the full dataset. We found 18 publications where this dataset was used without precautions against the circumstances we describe, thus potentially overestimating the accuracy of gas classification algorithms. These observations highlight potential pitfalls in using previously recorded gas sensor data, which may have distorted widely reported results.
引用
收藏
页数:8
相关论文
共 47 条
[1]   Breath Acetone Sensors as Non-Invasive Health Monitoring Systems: A Review [J].
Alizadeh, Naader ;
Jamalabadi, Hoda ;
Tavoli, Farnaz .
IEEE SENSORS JOURNAL, 2020, 20 (01) :5-31
[2]  
Araujo I., 2019, AN 16 ENC NAC INT AR, P844, DOI [10.5753/eniac.2019.9339, DOI 10.5753/ENIAC.2019.9339]
[3]   Time-dependent gas distribution modelling [J].
Asadi, Sahar ;
Fan, Han ;
Bennetts, Victor Hernandez ;
Lilienthal, Achim J. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 96 :157-170
[4]   A PRACTICAL RANDOMIZED CP TENSOR DECOMPOSITION [J].
Battaglino, Casey ;
Ballard, Grey ;
Kolda, Tamara G. .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2018, 39 (02) :876-901
[5]  
Burgues J, 2019, SIGNAL PROCESSING MA
[6]   Feature Extraction for Transient Chemical Sensor Signals in Response to Turbulent Plumes: Application to Chemical Source Distance Prediction [J].
Burgues, Javier ;
Marco, Santiago .
SENSORS AND ACTUATORS B-CHEMICAL, 2020, 320
[7]   Wind-Independent Estimation of Gas Source Distance From Transient Features of Metal Oxide Sensor Signals [J].
Burgues, Javier ;
Marco, Santiago .
IEEE ACCESS, 2019, 7 :140460-140469
[8]   Estimation of the limit of detection in semiconductor gas sensors through linearized calibration models [J].
Burgues, Javier ;
Manuel Jimenez-Soto, Juan ;
Marco, Santiago .
ANALYTICA CHIMICA ACTA, 2018, 1013 :13-25
[9]   Iterative complex network approach for chemical gas sensor array characterisation [J].
Cardellicchio, Angelo ;
Lombardi, Angela ;
Guaragnella, Cataldo .
JOURNAL OF ENGINEERING-JOE, 2019, (06) :4612-4616
[10]  
Chang I.-S., 2020, PROC KOREAN SOC BROA, P21