Clustering honey samples with unsupervised machine learning methods using FTIR data

被引:1
作者
Avcu, Fatih M. [1 ]
机构
[1] Inonu Univ, Dept Informat, TR-44280 Malatya, Turkiye
来源
ANAIS DA ACADEMIA BRASILEIRA DE CIENCIAS | 2024年 / 96卷 / 01期
关键词
Fouirer transform infrared spectrophotometer; hierarchical clustering analysis; machine learning; deep Learning; MULTIVARIATE; ORIGIN;
D O I
10.1590/0001-3765202420230409
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study utilizes Fourier transform infrared (FTIR) data from honey samples to cluster and categorize them based on their spectral characteristics. The aim is to group similar samples together, revealing patterns and aiding in classification. The process begins by determining the number of clusters using the elbow method, resulting in five distinct clusters. Principal Component Analysis (PCA) is then applied to reduce the dataset's dimensionality by capturing its significant variances. Hierarchical Cluster Analysis (HCA) further refines the sample clusters. 20% of the data, representing identified clusters, is randomly selected for testing, while the remainder serves as training data for a deep learning algorithm employing a multilayer perceptron (MLP). Following training, the test data are evaluated, revealing an impressive 96.15% accuracy. Accuracy measures the machine learning model's ability to predict class labels for new data accurately. This approach offers reliable honey sample clustering without necessitating extensive preprocessing. Moreover, its swiftness and cost-effectiveness enhance its practicality. Ultimately, by leveraging FTIR spectral data, this method successfully identifies similarities among honey samples, enabling efficient categorization and demonstrating promise in the field of spectral analysis in food science.
引用
收藏
页数:11
相关论文
共 50 条
[31]   A Review of Machine Learning Methods Recently Applied to FTIR Spectroscopy Data for the Analysis of Human Blood Cells [J].
Fadlelmoula, Ahmed ;
Catarino, Susana O. ;
Minas, Graca ;
Carvalho, Vitor .
MICROMACHINES, 2023, 14 (06)
[32]   Comparative Analysis of Machine Learning Clustering Methods for Electroretinogram [J].
Zhdanov, Aleksei ;
Bulev, Daniil ;
Dolganov, Anton ;
Kulyabin, Mikhail .
ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 :385-392
[33]   Clustering and Data Aggregation in Wireless Sensor Networks Using Machine Learning Algorithms [J].
Shahina, K. ;
Vaidehi, V. .
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING (ICRTAC-CPS 2018), 2018, :109-115
[34]   Using Information about Influencing Factors to Split Data Samples in Machine Learning Methods for the Purposes of Assessing Information Security [J].
I. S. Lebedev ;
M. E. Sukhoparov .
Automatic Control and Computer Sciences, 2022, 56 :981-987
[35]   Using Information about Influencing Factors to Split Data Samples in Machine Learning Methods for the Purposes of Assessing Information Security [J].
Lebedev, I. S. ;
Sukhoparov, M. E. .
AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (08) :981-987
[36]   Assessment of the regeneration of landslides areas using unsupervised and supervised methods and explainable machine learning models [J].
Arrogante-Funes, Patricia ;
Bruzon, Adrian G. ;
Alvarez-Ripado, Ariadna ;
Arrogante-Funes, Fatima ;
Martin-Gonzalez, Fidel ;
Novillo, Carlos J. .
LANDSLIDES, 2024, 21 (02) :275-290
[37]   Assessment of the regeneration of landslides areas using unsupervised and supervised methods and explainable machine learning models [J].
Patricia Arrogante-Funes ;
Adrián G. Bruzón ;
Ariadna Álvarez-Ripado ;
Fátima Arrogante-Funes ;
Fidel Martín-González ;
Carlos J. Novillo .
Landslides, 2024, 21 (2) :275-290
[38]   Unsupervised Machine Learning-Based Clustering of Nanosized Fluorescent Extracellular Vesicles [J].
Kuypers, Soren ;
Smisdom, Nick ;
Pintelon, Isabel ;
Timmermans, Jean-Pierre ;
Ameloot, Marcel ;
Michiels, Luc ;
Hendrix, Jelle ;
Hosseinkhani, Baharak .
SMALL, 2021, 17 (05)
[39]   Communicating exploratory unsupervised machine learning analysis in age clustering for paediatric disease [J].
Spear, Joshua William ;
Pissaridou, Eleni ;
Bowyer, Stuart ;
Bryant, William A. ;
Key, Daniel ;
Booth, John ;
Spiridou, Anastasia ;
Denaxas, Spiros ;
Pope, Rebecca ;
Taylor, Andrew M. ;
Hemingway, Harry ;
Sebire, Neil J. .
BMJ HEALTH & CARE INFORMATICS, 2024, 31 (01) :1-7
[40]   Galaxy Clustering and Classification using Machine Learning Algorithms and XAI [J].
Elvitigala, Amasha ;
Navaratne, Udani ;
Rathnayake, Samadhi ;
Dissanayaka, Dr Kapila .
2024 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY RESEARCH, ICITR, 2024,