Research on predicting the productivity of cutter suction dredgers based on data mining with model stacked generalization

被引:33
作者
Wang, Bin [1 ]
Fan, Shidong [1 ]
Jiang, Pan [1 ]
Xing, Ting [1 ]
Fang, Zhenlong [1 ]
Wen, Quan [2 ]
机构
[1] Wuhan Univ Technol, Sch Energy & Power Engn, Wuhan 430063, Peoples R China
[2] Changjiang Sea Route Planning Design Res Inst, Wuhan 430010, Peoples R China
基金
中国国家自然科学基金;
关键词
Cutter suction dredger; Data mining; Machine learning; Productivity prediction; ALGORITHMS;
D O I
10.1016/j.oceaneng.2020.108001
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
To solve the problem that dredging prediction systems provide inaccurate productivity predictions and rely heavily on mud concentration data. This paper presents a data mining method to accurately predict dredger productivity by using model-stacked generalization in the absence of mud concentration data. First, eliminate abnormal construction data, and l(2) norm normalization and log smooth transformation are then performed on the data. Second, Spearman's rank correlation coefficient method is used to extract features. Five machine learning models, namely, Lasso, Elastic net (ENet), Gradient-boosting decision tree (GBDT), extreme gradient boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM), were introduced to predict dredger productivity. Based on these five models, a stacked generalization model was applied. The results show that the goodness of fit R-2 of the stacked generalization model for productivity prediction is 0.9281, which is higher than the accuracy of the other algorithms investigated, and the optimization effect is obvious.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] The Model and Empirical Research of Application Scoring Based on Data Mining Methods
    Lai Hui
    Shuai Li
    Zhou Zongfang
    FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 911 - 918
  • [22] Research on the Identification Model of Customer Knowledge Source Based on Data Mining
    Wei Hongmei
    Ju Xiaofeng
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INNOVATION AND MANAGEMENT, VOLS I AND II, 2009, : 1561 - 1566
  • [23] Research on Electricity Consumption Model of Library Building Based on Data Mining
    Dou J.
    Ma H.
    Guo R.
    Energy Engineering: Journal of the Association of Energy Engineering, 2022, 119 (06): : 2407 - 2429
  • [24] Research on Data Mining Model Based on Context-Aware System
    Yang Wen-yi
    Ye Dan
    Xiao Bo
    Lin Zhi-qing
    Lu Yue-ming
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3789 - 3793
  • [25] Research on Data Mining Model of Intelligent Transportation Based on Granular Computing
    Xie, Xiao-Lan
    Gu, Xiao-Feng
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (07): : 281 - 286
  • [26] Research On Construction Of Educational Management Model Based On Data Mining Technology
    Zhao, He
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 26 (05): : 613 - 621
  • [27] Research on Improved Model of Electronic Commerce Data Mining Based on Big Data Technology
    Xu, Hongsheng
    Fan, Ganglong
    Li, Ke
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON SOCIAL NETWORK, COMMUNICATION AND EDUCATION (SNCE 2017), 2017, 82 : 48 - 52
  • [28] A new unsupervised data mining method based on the stacked autoencoder for chemical process fault diagnosis
    Zheng, Shaodong
    Zhao, Jinsong
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 135
  • [29] Generalization-based data mining in object-oriented databases using an object cube model
    Han, J
    Nishio, S
    Kawano, H
    Wang, W
    DATA & KNOWLEDGE ENGINEERING, 1998, 25 (1-2) : 55 - 97
  • [30] English Research Based on Big Data and Data Mining
    Chen, Dafeng
    Han, Bingqing
    MATERIAL SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY II, 2014, 651-653 : 2462 - 2465