Rapid discrimination modeling of common wine and food residues in archaeology based on machine learning and infrared spectroscopy

被引:0
作者
Xiao, Nan [1 ,2 ]
Qi, Xuekai [1 ]
Wang, Bin [3 ]
Huang, He [1 ]
Wen, Rui [1 ]
机构
[1] Northwest Univ, Sch Cultural Heritage, Xian 710127, Peoples R China
[2] Univ Chinese Acad Sci, Dept Archaeol & Anthropol, Beijing 100049, Peoples R China
[3] Northwest Univ, Sch Informat Sci & Technol, Xian 710069, Peoples R China
关键词
Machine learning; Infrared spectroscopy; Simulation experiments; Residues; Archaeology; CHROMATOGRAPHY; OPTIMIZATION; CONSERVATION; EXTRACTION; MARKERS; KERNEL;
D O I
10.1016/j.culher.2025.03.005
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
The study of wine and food residues in archaeology offers crucial insights into ancient diets and brewing techniques. Traditional detection methods, however, are often complex and time-consuming. Bridging the gap between excavation sites and laboratories is vital for enhancing real-time analysis and artifact preservation. This paper presents a non-targeted spectral fingerprinting method that integrates simulated experiments, Fourier Transform Infrared Spectroscopy (FTIR), and machine learning algorithms for the rapid identification of food and wine residues in archaeological excavations. Infrared spectral data were collected from 23 modern food and liquor samples subjected to simulated aging. A comprehensive preprocessing protocol was developed, including smoothing, baseline correction, and normalization, to reduce unwanted variability and enhance data quality. Eight spectral preprocessing methods were assessed, including standard normal variate (SNV), multiple scatter correction (MSC), and various derivative techniques. The final model, which employed SNV preprocessing, demonstrated superior prediction accuracy and robustness. Six common machine learning algorithms-linear discriminant analysis (LDA), decision tree classification (DTC), support vector machine (SVM), random forest (RF), k-nearest neighbor (KNN), and backpropagation neural network (BPNN)-were utilized for modeling and comparison. Results indicated that the RF, KNN, and BPNN models were particularly effective, achieving prediction accuracies near 100 %. In external validation with real archaeological samples and those simulated to be aged for nearly ten years, the BPNN model achieved a confidence estimate of 99 % for validating archaeological wine residue samples, while other models provided confidence estimates above 70 %. However, due to the significant loss of characteristic substances from prolonged aging, the current model has difficulty distinguishing specific wine or food types. Future research should focus on improving model portability for on-site screening and expanding the database of simulated aged residues through multi-platform collaboration. (c) 2025 Elsevier Masson SAS. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:195 / 205
页数:11
相关论文
共 60 条
  • [1] Beef muscle discrimination based on two-trace two-dimensional correlation spectroscopy (2T2D COS) combined with snapshot visible-near infrared multispectral imaging
    Ait-Kaddour, Abderrahmane
    Loudiyi, Mohammed
    Boukria, Oumayma
    Safarov, Jasur
    Sultanova, Shaxnoza
    Andueza, Donato
    Listrat, Anne
    Cahyana, Yana
    [J]. MEAT SCIENCE, 2024, 214
  • [2] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [3] Andrea V., 2012, Application of Infrared Spectroscopy for the Prediction of Color Components of Red Wines
  • [4] Feasibility study on the use of ATR-FTIR spectroscopy as a tool for the estimation of wine polysaccharides
    Baca-Bocanegra, Berta
    Martinez-Lapuente, Leticia
    Nogales-Bueno, Julio
    Hernandez-Hierro, Jose Miguel
    Ferrer-Gallego, Raill
    [J]. CARBOHYDRATE POLYMERS, 2022, 287
  • [5] Spectroscopic and sensory characterization of Brazilian Coffea canephora terroir and botanical varieties produced in the Amazon and Espirito Santo implementing multi-block approaches
    Baqueta, Michel Rocha
    Marini, Federico
    Teixeira, Alexsandro Lara
    Goulart, Bruno Henrique Fermino
    Pilau, Eduardo Jorge
    Valderrama, Patricia
    Pallone, Juliana Azevedo Lima
    [J]. JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2024, 133
  • [6] Barker A, 2010, ETHNOBIOL LETT, V1, P58
  • [7] STANDARD NORMAL VARIATE TRANSFORMATION AND DE-TRENDING OF NEAR-INFRARED DIFFUSE REFLECTANCE SPECTRA
    BARNES, RJ
    DHANOA, MS
    LISTER, SJ
    [J]. APPLIED SPECTROSCOPY, 1989, 43 (05) : 772 - 777
  • [8] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [9] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32