Data analysis of electronic nose technology in lung cancer: generating prediction models by means of Aethena

被引:44
作者
Kort, Sharina [1 ]
Brusse-Keizer, Marjolein [2 ]
Gerritsen, Jan-Willem [3 ]
van der Palen, Job [2 ,4 ]
机构
[1] Med Spectrum Twente, Dept Pulm Med, Enschede, Netherlands
[2] Med Spectrum Twente, Med Sch Twente, Enschede, Netherlands
[3] eNose Co, Zutphen, Netherlands
[4] Univ Twente, Dept Res Methodol Measurement & Data Anal, Enschede, Netherlands
关键词
lung cancer; electronic nose; exhaled breath; aeonose; prediction models; data analysis; VOLATILE ORGANIC-COMPOUNDS; COMPUTED-TOMOGRAPHY; COST-EFFECTIVENESS; BREATH; VALIDATION; MORTALITY; DECISION;
D O I
10.1088/1752-7163/aa6b08
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Introduction. Only 15% of lung cancer cases present with potentially curable disease. Therefore, there is much. interest in a fast, non-invasive tool to detect lung cancer earlier. Exhaled breath analysis using. electronic nose technology measures volatile organic compounds (VOCs) in exhaled breath that. are associated with lung cancer. Methods. The diagnostic accuracy of the Aeonose (TM) is currently being studied in a multi-centre, prospective study in 210 subjects suspected for lung cancer, where approximately half will have a confirmed diagnosis and the other half will have a rejected diagnosis of lung cancer. We will also include 100-150 healthy control subjects. The eNose Company (provider of the Aeonose (TM)) uses a software program, called Aethena, comprising pre-processing, data compression and neural networks to handle big data analyses. Each individual exhaled breath measurement comprises a data matrix with thousands of conductivity values. This is followed by data compression using a Tucker3-like algorithm, resulting in a vector. Subsequently, model selection takes place after entering vectors with different presets in an artificial neural network to train and evaluate the results. Next, a 'judge model' is formed, which is a combination of models for optimizing performance. Finally, two types of cross-validation, being 'leave-10%-out' cross-validation and 'bagging', are used when recalculating the judge models. These judge models are subsequently used to classify new, blind measurements. Discussion. Data analysis in eNose technology is principally based on generating prediction models that. need to be validated internally and externally for eventual use in clinical practice. This paper describes the analysis of big data,. captured by eNose technology in lung cancer. This is done by means of generating prediction models with Aethena, a data analysis program specifically developed for analysing VOC data.
引用
收藏
页数:10
相关论文
共 50 条
[21]   Carotta: Revealing Hidden Confounder Markers in Metabolic Breath Profiles [J].
Hauschild, Anne-Christin ;
Frisch, Tobias ;
Baumbach, Joerg Ingo ;
Baumbach, Jan .
METABOLITES, 2015, 5 (02) :344-363
[22]  
Henschke CI, 2006, NEW ENGL J MED, V355, P1763, DOI 10.1056/NEJMoa060476
[23]   What Teachers Should Know About the Bootstrap: Resampling in the Undergraduate Statistics Curriculum [J].
Hesterberg, Tim C. .
AMERICAN STATISTICIAN, 2015, 69 (04) :371-386
[24]   Combined sputum hypermethylation and eNose analysis for lung cancer diagnosis [J].
Hubers, A. Jasmijn ;
Brinkman, Paul ;
Boksem, Remco J. ;
Rhodius, Robert J. ;
Witte, Birgit I. ;
Zwinderman, Aeilko H. ;
Heideman, Danielle A. M. ;
Duin, Sylvia ;
Koning, Remco ;
Steenbergen, Renske D. M. ;
Snijders, Peter J. F. ;
Smit, Egbert F. ;
Sterk, Peter J. ;
Thunnissen, Erik .
JOURNAL OF CLINICAL PATHOLOGY, 2014, 67 (08) :707-711
[25]   ZeitZeiger: supervised learning for high-dimensional data from an oscillatory system [J].
Hughey, Jacob J. ;
Hastie, Trevor ;
Butte, Atul J. .
NUCLEIC ACIDS RESEARCH, 2016, 44 (08) :e80
[26]   Gene set bagging for estimating the probability a statistically significant result will replicate [J].
Jaffe, Andrew E. ;
Storey, John D. ;
Ji, Hongkai ;
Leek, Jeffrey T. .
BMC BIOINFORMATICS, 2013, 14
[27]  
Jemal A., CA-CANCER J CLIN, V61, P69
[28]   Comparison of classification methods in breath analysis by electronic nose [J].
Leopold, Jan Hendrik ;
Bos, Lieuwe D. J. ;
Sterk, Peter J. ;
Schultz, Marcus J. ;
Fens, Niki ;
Horvath, Ildiko ;
Bikov, Andras ;
Montuschi, Paolo ;
Di Natale, Corrado ;
Yates, Deborah H. ;
Abu-Hanna, Ameen .
JOURNAL OF BREATH RESEARCH, 2015, 9 (04)
[29]   Detection of lung cancer by sensor array analyses of exhaled breath [J].
Machado, RF ;
Laskowski, D ;
Deffenderfer, O ;
Burch, T ;
Zheng, S ;
Mazzone, PJ ;
Mekhail, T ;
Jennings, C ;
Stoller, JK ;
Pyle, J ;
Duncan, J ;
Dweik, RA ;
Erzurum, SC .
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2005, 171 (11) :1286-1291
[30]   Lung, cancer screening with helical computed tomography in older adult smokers - A decision and cost-effectiveness analysis [J].
Mahadevia, PJ ;
Fleisher, LA ;
Fric, KD ;
Eng, J ;
Goodman, SN ;
Powe, NR .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2003, 289 (03) :313-322