Identification of millet origin using terahertz spectroscopy combined with ensemble learning

被引:0
作者
Yin, Xianhua
Tian, Hao
Zhang, Fuqiang
Xu, Chuanpei [1 ]
Tang, Linkai
Wei, Yongbing
机构
[1] Guilin Univ Elect Technol, Sch Elect Engn & Automat, Guilin 541004, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Terahertz time-domain spectroscopy; Millet; Geographical origin; Machine learning; Ensemble learning; Stacking; Topsis; LIQUID-CHROMATOGRAPHY; HEALTH-BENEFITS; DISCRIMINATION; PRODUCTS; TOPSIS; AUTHENTICITY;
D O I
10.1016/j.infrared.2024.105547
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
It's crucial for both producers and consumers to accurately trace the origin of millet, given the significant differences in price and taste that exist between millets from various origins. The traditional method of identifying the origin of millet is time-consuming, laborious, complex, and destructive. In this study, a new method for fast and non-destructive differentiation of millet origins is developed by combining terahertz time domain spectroscopy with ensemble learning. Firstly, three machine learning algorithms, namely support vector machine (SVM), random forest (RF), and kernel extreme learning machine (KELM), were used to build different discriminative models, and then the impact of six different preprocessing methods on the models' classification performance was compared. It was observed that models employing Savitzky-Golay preprocessing exhibited pronounced superiority in accurately determining the millet's geographical origins. Building upon these findings, the research introduces an innovative ensemble learning strategy, leveraging both topsis and stacking techniques, to harness the collective strengths of the three algorithms. The outcomes of this approach reveal its remarkable capacity to distinguish millets originating from five distinct locations without the necessity for any parameter fine-tuning. The accuracy, F1 score, and Kappa on the prediction set are all 100 %, which significantly outperforms the single model, traditional voting method, and stacking method. The culmination of this study suggests that the integration of terahertz time-domain spectroscopy and TOPSIS-Stacking ensemble learning emerges as a promising method for the swift and non-intrusive discrimination of millet geographical origins with remarkable precision.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Rapid identification of coffee species and origin using affordable multi-channel spectral sensor combined with machine learning [J].
Sagita, Diang ;
Widodo, Slamet ;
Mardjan, Sutrisno Suro ;
Purwandoko, Pradeka Brilyan ;
Suparlan ;
Hariadi, Hari ;
Darniadi, Sandi .
FOOD RESEARCH INTERNATIONAL, 2025, 211
[22]   Identification of the optical isomers using laser induced breakdown spectroscopy combined with machine learning [J].
Junjuri, Rajendhar ;
Tarai, Akash Kumar ;
Gundawar, Manoj Kumar .
JOURNAL OF OPTICS-INDIA, 2024,
[23]   iAnOxPep: A Machine Learning Model for the Identification of Anti-Oxidative Peptides Using Ensemble Learning [J].
Hassan, Mir Tanveerul ;
Tayara, Hilal ;
Chong, Kil To .
IEEE TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2025, 22 (01) :85-96
[24]   Payload Parameters Identification Using Incremental Ensemble Learning [J].
Taie, Wael ;
ElGeneidy, Khaled ;
Al-Yacoub, Ali ;
Ronglei, Sun .
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, :241-245
[25]   An Event Causality Identification Framework Using Ensemble Learning [J].
Wang, Xiaoyang ;
Luo, Wenjie ;
Yang, Xiudan .
INFORMATION, 2025, 16 (01)
[26]   Rapid identification of the geographic origin of Taiping Houkui green tea using near-infrared spectroscopy combined with a variable selection method [J].
Jin, Ge ;
Xu, Yifan ;
Cui, Chuanjian ;
Zhu, Yuanyuan ;
Zong, Jianfa ;
Cai, Huimei ;
Ning, Jingming ;
Wei, Chaoling ;
Hou, Ruyan .
JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2022, 102 (13) :6123-6130
[27]   STACKION: Ion Channel-Modulating Peptides Identification Using Stacking-Based Ensemble Machine Learning [J].
Ali, Md. Mamun ;
Ahmed, Kawsar ;
Bui, Francis M. ;
Chen, Li .
2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
[28]   Identifying High Risk of Atherosclerosis Using Deep Learning and Ensemble Learning [J].
Olhosseiny, Hedieh Hashem ;
Mirzaloo, Mohammadsalar ;
Bolic, Miodrag ;
Dajani, Hilmi R. ;
Groza, Voicu ;
Yoshida, Masayoshi .
2021 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (IEEE MEMEA 2021), 2021,
[29]   Identification of black plastics with terahertz time-domain spectroscopy and machine learning [J].
Cielecki, Pawel Piotr ;
Hardenberg, Michel ;
Amariei, Georgiana ;
Henriksen, Martin Lahn ;
Hinge, Mogens ;
Klarskov, Pernille .
SCIENTIFIC REPORTS, 2023, 13 (01)
[30]   Rapid Identification of Plastic Beverage Bottles by Using Raman Spectroscopy Combined With Machine Learning Algorithm [J].
Liu, Xinlei ;
Wang, Lei ;
Li, Wei ;
Wan, Jingwei .
JOURNAL OF RAMAN SPECTROSCOPY, 2025, 56 (05) :381-388