Research on predicting the productivity of cutter suction dredgers based on data mining with model stacked generalization

被引:33
作者
Wang, Bin [1 ]
Fan, Shidong [1 ]
Jiang, Pan [1 ]
Xing, Ting [1 ]
Fang, Zhenlong [1 ]
Wen, Quan [2 ]
机构
[1] Wuhan Univ Technol, Sch Energy & Power Engn, Wuhan 430063, Peoples R China
[2] Changjiang Sea Route Planning Design Res Inst, Wuhan 430010, Peoples R China
基金
中国国家自然科学基金;
关键词
Cutter suction dredger; Data mining; Machine learning; Productivity prediction; ALGORITHMS;
D O I
10.1016/j.oceaneng.2020.108001
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
To solve the problem that dredging prediction systems provide inaccurate productivity predictions and rely heavily on mud concentration data. This paper presents a data mining method to accurately predict dredger productivity by using model-stacked generalization in the absence of mud concentration data. First, eliminate abnormal construction data, and l(2) norm normalization and log smooth transformation are then performed on the data. Second, Spearman's rank correlation coefficient method is used to extract features. Five machine learning models, namely, Lasso, Elastic net (ENet), Gradient-boosting decision tree (GBDT), extreme gradient boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM), were introduced to predict dredger productivity. Based on these five models, a stacked generalization model was applied. The results show that the goodness of fit R-2 of the stacked generalization model for productivity prediction is 0.9281, which is higher than the accuracy of the other algorithms investigated, and the optimization effect is obvious.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Data mining in predicting liver patients using classification model
    Shubashini Rathina Velu
    Vinayakumar Ravi
    Kayalvily Tabianan
    [J]. Health and Technology, 2022, 12 : 1211 - 1235
  • [42] Research on the Fuzzy Model of E-learning based Data Mining and Data Mining Technology under the Environment of Cloud Computing
    Chuan, Wan
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 878 - 882
  • [43] Outlier data mining model for sports data analysis based on machine learning
    Yin, Zhimeng
    Cui, Wei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2733 - 2742
  • [44] The Research of Data Mining Based on Fuzzy Cluster
    Fang Fugui
    [J]. COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION, PTS 1-2, 2011, 88-89 : 763 - 766
  • [45] Data Mining Research Based on College Forum
    Xue, Liming
    Li, Zhihuai
    Luan, Weixin
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 525 - 532
  • [46] Research on intrusion detection based on data mining
    Tong, Xiaojun
    Cui, Minggen
    Wang, Jie
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 444 - 447
  • [47] Research on CBR system based on data mining
    Guo, Yuan
    Hu, Jie
    Peng, Yinghong
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (08) : 5006 - 5014
  • [48] The Research of WebGIS-Based Data Mining
    Fu Chunchang
    [J]. MANUFACTURING SYSTEMS AND INDUSTRY APPLICATIONS, 2011, 267 : 774 - 777
  • [49] Research of Data Mining Based on Neural Networks
    Ni, Xianjun
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 29, 2008, 29 : 381 - 384
  • [50] Physical Analysis and Research Based on Data Mining
    Zhu Xueqiang
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2015, 6 : 527 - 531