Research on predicting the productivity of cutter suction dredgers based on data mining with model stacked generalization

被引:33
|
作者
Wang, Bin [1 ]
Fan, Shidong [1 ]
Jiang, Pan [1 ]
Xing, Ting [1 ]
Fang, Zhenlong [1 ]
Wen, Quan [2 ]
机构
[1] Wuhan Univ Technol, Sch Energy & Power Engn, Wuhan 430063, Peoples R China
[2] Changjiang Sea Route Planning Design Res Inst, Wuhan 430010, Peoples R China
基金
中国国家自然科学基金;
关键词
Cutter suction dredger; Data mining; Machine learning; Productivity prediction; ALGORITHMS;
D O I
10.1016/j.oceaneng.2020.108001
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
To solve the problem that dredging prediction systems provide inaccurate productivity predictions and rely heavily on mud concentration data. This paper presents a data mining method to accurately predict dredger productivity by using model-stacked generalization in the absence of mud concentration data. First, eliminate abnormal construction data, and l(2) norm normalization and log smooth transformation are then performed on the data. Second, Spearman's rank correlation coefficient method is used to extract features. Five machine learning models, namely, Lasso, Elastic net (ENet), Gradient-boosting decision tree (GBDT), extreme gradient boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM), were introduced to predict dredger productivity. Based on these five models, a stacked generalization model was applied. The results show that the goodness of fit R-2 of the stacked generalization model for productivity prediction is 0.9281, which is higher than the accuracy of the other algorithms investigated, and the optimization effect is obvious.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Research on patent information analyzing and predicting system based on data mining
    Liu, Yang
    International Journal of Hybrid Information Technology, 2015, 8 (05): : 207 - 214
  • [12] Research on data mining model based on rough sets
    Li, Longshu
    Yang, Weimin
    Li, Xuejun
    Xu, Yi
    2006 1ST INTERNATIONAL SYMPOSIUM ON PERVASIVE COMPUTING AND APPLICATIONS, PROCEEDINGS, 2006, : 851 - +
  • [13] Predicting COVID-19 Infected Cases: Exploring Stacked Generalization with Japanese Data
    Khan, M. Fahim Ferdous
    Dung, Mai Duy
    Sakamura, Ken
    Lecture Notes in Networks and Systems, 2023, 761 LNNS : 59 - 68
  • [14] A Model Proposal for Predicting Students' Academic Performances Based on Data Mining
    ALTUN, Murat
    KAYIKCI, Kemal
    IRMAK, Sezgin
    HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2022, 37 (03): : 1080 - 1098
  • [15] Stacked Machine Learning Model for Predicting Alzheimer's Disease Based on Genetic Data
    Alatrany, Abbas Saad
    Hussain, Abir
    Jamila, Mustafina
    Al-Jumeiy, Dhiya
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 594 - 598
  • [16] Impact of Digital Innovation on Corporate Productivity: A Predictive Model Based on Data Mining
    Tang, Xun
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 886 - 891
  • [17] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Ajitha Antony
    Ramanathan Karuppasamy
    Paddy and Water Environment, 2023, 21 : 231 - 242
  • [18] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Antony, Ajitha
    Karuppasamy, Ramanathan
    PADDY AND WATER ENVIRONMENT, 2023, 21 (02) : 231 - 242
  • [19] Offline Data Driven Evolutionary Optimization Based on Pruning Stacked Generalization
    Liang Z.-P.
    Huang X.-J.
    Li S.-T.
    Wang X.-Y.
    Zhu Z.-X.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (06): : 1306 - 1325
  • [20] Research on the prediction model of material cost based on data mining
    Shenyang, Liu
    Qi, Gao
    Zhen, Li
    Si, Li
    Zhiwei, Li
    Open Mechanical Engineering Journal, 2015, 9 (01): : 1062 - 1066