A comparative evaluation of machine learning algorithms for predicting syngas fermentation outcomes

被引:9
|
作者
Roell, Garrett W. [1 ]
Sathish, Ashik [2 ,3 ]
Wan, Ni [1 ]
Cheng, Qianshun [4 ]
Wen, Zhiyou [2 ,3 ]
Tang, Yinjie J. [1 ]
Bao, Forrest Sheng [5 ]
机构
[1] Washington Univ St Louis, DOE Environm & Chem Engn, St Louis, MO 63130 USA
[2] Iowa State Univ, Dept Agr & Biosyst Engn, Ames, IA 50011 USA
[3] Iowa State Univ, Dept Food Sci & Human Nutr, Ames, IA 50011 USA
[4] Univ Illinois, Dept Math Stat & Comp Sci, Chicago, IL 60607 USA
[5] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Clostridium carboxidivorans; Neural network; Random forest; Support vector machine; Data transformation; Model predictive control; HOLLOW-FIBER MEMBRANE; MASS-TRANSFER; BUTANOL; P7;
D O I
10.1016/j.bej.2022.108578
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Clostridium carboxidivorans can use syngas to produce acids and alcohols. However, simulating gas fermentation dynamics remains challenging. This study employed data transformation and machine learning (ML) approaches to predict syngas fermentation behavior. Syngas composition and fermentative metabolite concentrations (features) were paired with the production rates (prediction targets) of acetate, ethanol, butyrate, and butanol at each time point. This transformation avoided the use of time as a feature. Data augmentation by polynomial smoothing of experimental measurements was used to create a database for supervised learning of 836 rate instances from 10 gas compositions. Seven families of ML algorithms were compared, including neural networks, support vector machines, random forests, elastic nets, lasso regressors, k-nearest neighbors, and Bayesian ridge regressors. These algorithms predicted production rates for training data with Pearson correlation coefficients (R-2 > 0.9), but they showed poorer performance for predicting unseen test data. Among the algorithms, random forests and support vector machines produced the most accurate predictions for the test data, which could regenerate product concentration curves (R-2 asymptotic to 0.85). In contrast, neural networks had a higher risk of over-fitting. Additionally, ML-based feature importance analysis highlighted the significant impacts of CO and H-2 on alcohol production, which offersguidance for model predictive control. Together, these findings can help direct future applications of ML algorithms to complex bioprocesses with limited data.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting the Air Quality Using Machine Learning Algorithms: A Comparative Study
    Goel, Neetika
    Kumari, Ritika
    Bansal, Poonam
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 137 - 147
  • [2] Comparative analysis of machine learning algorithms for predicting Dubai property prices
    Balila, Abdulsalam Elnaeem
    Bin Shabri, Ani
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2024, 10
  • [3] Predicting the percentage of student placement: A comparative study of machine learning algorithms
    Cakit, Erman
    Dagdeviren, Metin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (01) : 997 - 1022
  • [4] Evaluation of interlayer bonding in layered composites based on non-destructive measurements and machine learning: Comparative analysis of selected learning algorithms
    Czarnecki, Slawomir
    Sadowski, Lukasz
    Hola, Jerzy
    AUTOMATION IN CONSTRUCTION, 2021, 132
  • [5] Comparison of Machine Learning Algorithms for Predicting Lane Changing Intent
    Choi, Dongho
    Lee, Sangsun
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2021, 22 (02) : 507 - 518
  • [6] Comparison of Machine Learning Algorithms for Predicting Lane Changing Intent
    Dongho Choi
    Sangsun Lee
    International Journal of Automotive Technology, 2021, 22 : 507 - 518
  • [7] A comparative evaluation of machine learning algorithms and an improved optimal model for landslide susceptibility: a case study
    Liu, Yue
    Xu, Peihua
    Cao, Chen
    Shan, Bo
    Zhu, Kuanxing
    Ma, Qiuyang
    Zhang, Zongshuo
    Yin, Han
    GEOMATICS NATURAL HAZARDS & RISK, 2021, 12 (01) : 1973 - 2001
  • [8] Comparative evaluation of machine learning algorithms for Coringa Mangroves mapping with satellite imagery and spectral indices
    Sowjanya, D. S.
    Prasad, P. Rama Chandra
    JOURNAL OF EARTH SYSTEM SCIENCE, 2024, 134 (01)
  • [9] A Comparative Study of Different Machine Learning Algorithms in Predicting the Content of Ilmenite in Titanium Placer
    Lv, Yingli
    Qui-Thao Le
    Hoang-Bac Bui
    Xuan-Nam Bui
    Hoang Nguyen
    Trung Nguyen-Thoi
    Dou, Jie
    Song, Xuan
    APPLIED SCIENCES-BASEL, 2020, 10 (02):
  • [10] A machine learning based regression methods to predicting syngas composition for plasma gasification system
    Abdelrahim, Anass I. M.
    Yucel, Ozgun
    FUEL, 2025, 381