Linear Regression Modelling on Epigallocatechin-3-gallate Sensor Data for Green Tea

被引:0
作者
Modak, Angiras [1 ]
Chatterjee, Trisita Nandy [1 ]
Nag, Sangita [1 ]
Roy, Runu Banerjee [1 ]
Tudu, Bipan [1 ]
Bandyopadhyay, Rajib [1 ]
机构
[1] Jadavpur Univ, Dept Instrumentat & Elect Engn, Kolkata, India
来源
2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN) | 2018年
关键词
electronic tongue; machine learning; linear regression; regularization techniques; MOLECULARLY IMPRINTED POLYMERS; VARIABLE SELECTION; CLASSIFICATION; POLYPHENOLS; STATISTICS; !text type='PYTHON']PYTHON[!/text; HEALTH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, linear regression machine learning techniques are applied to determine the quality of green tea samples. The data set is obtained by applying Differential Pulse Voltammetry (DPV) on green tea samples using Epigallocatechin-3-gallate (EGCG) specific sensor based on Molecular Imprinted Polymer (MIP) technique. Multiple linear regression models have been developed using this dataset that gives more hidden insight of the dataset and helps to find the input feature importance out of it. Regularization techniques are applied on linear regression like Ridge regression (L2 Penalty), Lasso regression (L1 Penalty) and ElasticNet regression (combination of L1 and L2 Penalty) considered to reduce overfitting of the model and to provide better prediction. The variation of cross validation score vs regularization parameter for different regularized techniques of linear regression are also taken under consideration and best value of the regularization parameter is calculated to develop the model for getting better prediction with high accuracy. From the result obtained from model metrics, a clear picture is portrayed how lasso regression performs better than ridge regression for this dataset and eliminates the less important features to develop the model as sparsity can be useful in practice if we have a high dimensional dataset with many features that are not effective for modelling. The beauty of ElasticNet Regression model is also highlighted how both L1 and L2 penalty go hand in hand to give prediction at a high accuracy.
引用
收藏
页码:112 / 117
页数:6
相关论文
共 44 条
[1]  
[Anonymous], 2001, SciPy: Open source scientific tools for Python
[2]  
[Anonymous], 1989, Stat. Sci., DOI DOI 10.1214/SS/1177012580
[3]   Fusion of electronic nose and tongue response using fuzzy based approach for black tea classification [J].
Banerjee , Runu ;
Modak, Angiras ;
Mondal, Sourav ;
Tudu, Bipan ;
Bandyopadhyay, Rajib ;
Bhattacharyya, Nabarun .
FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE: MODELING TECHNIQUES AND APPLICATIONS (CIMTA) 2013, 2013, 10 :615-622
[4]   Electronic nose for black tea classification and correlation of measurements with "Tea Taster" marks [J].
Bhattacharyya, Nabarun ;
Bandyopadhyay, Rajib ;
Bhuyan, Manabendra ;
Tudu, Bipan ;
Ghosh, Devdulal ;
Jana, Arun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2008, 57 (07) :1313-1321
[5]  
BOWLES MICHAEL., 2015, Machine learning in Python: Essential techniques for predictive analysis
[6]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[7]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[8]   Beneficial effects of green tea: A literature review [J].
Chacko S.M. ;
Thambi P.T. ;
Kuttan R. ;
Nishigaki I. .
Chinese Medicine, 5 (1)
[9]   Molecular Imprinted Polymer Based Electrode for Sensing Catechin ( plus C) in Green Tea [J].
Chatterjee, Trisita Nandy ;
Das, Debangana ;
Roy, Runu Banerjee ;
Tudu, Bipan ;
Sabhapondit, Santanu ;
Tamuly, Pradip ;
Pramanik, Panchanan ;
Bandyopadhyay, Rajib .
IEEE SENSORS JOURNAL, 2018, 18 (06) :2236-2244
[10]   Detection of theaflavins in black tea using a molecular imprinted polyacrylamide-graphite nanocomposite electrode [J].
Chatterjee, Trisita Nandy ;
Roy, Runu Banerjee ;
Tudu, Bipan ;
Pramanik, Panchanan ;
Deka, Himangshu ;
Tamuly, Pradip ;
Bandyopadhyay, Rajib .
SENSORS AND ACTUATORS B-CHEMICAL, 2017, 246 :840-847