Real-time prediction of rock mass classification based on TBM operation big data and stacking technique of ensemble learning

被引:128
作者
Hou, Shaokang [1 ]
Liu, Yaoru [1 ]
Yang, Qiang [1 ]
机构
[1] Tsinghua Univ, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Tunnel boring machine (TBM) operation data; Rock mass classification; Stacking ensemble learning; Sample imbalance; Synthetic minority oversampling technique (SMOTE); PERFORMANCE PREDICTION; GEOLOGICAL CONDITIONS; TUNNEL; FRAMEWORK; SELECTION; AGREEMENT; MODEL;
D O I
10.1016/j.jrmge.2021.05.004
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Real-time prediction of the rock mass class in front of the tunnel face is essential for the adaptive adjustment of tunnel boring machines (TBMs). During the TBM tunnelling process, a large number of operation data are generated, reflecting the interaction between the TBM system and surrounding rock, and these data can be used to evaluate the rock mass quality. This study proposed a stacking ensemble classifier for the real-time prediction of the rock mass classification using TBM operation data. Based on the Songhua River water conveyance project, a total of 7538 TB M tunnelling cycles and the corresponding rock mass classes are obtained after data preprocessing. Then, through the tree-based feature selection method, 10 key TBM operation parameters are selected, and the mean values of the 10 selected features in the stable phase after removing outliers are calculated as the inputs of classifiers. The preprocessed data are randomly divided into the training set (90%) and test set (10%) using simple random sampling. Besides stacking ensemble classifier, seven individual classifiers are established as the comparison. These classifiers include support vector machine (SVM), k-nearest neighbors (KNN), random forest (RF), gradient boosting decision tree (GBDT), decision tree (DT), logistic regression (LR) and multilayer perceptron (MLP), where the hyper-parameters of each classifier are optimised using the grid search method. The prediction results show that the stacking ensemble classifier has a better performance than individual classifiers, and it shows a more powerful learning and generalisation ability for small and imbalanced samples. Additionally, a relative balance training set is obtained by the synthetic minority oversampling technique (SMOTE), and the influence of sample imbalance on the prediction performance is discussed. (C) 2022 Institute of Rock and Soil Mechanics, Chinese Academy of Sciences. Production and hosting by Elsevier B.V.
引用
收藏
页码:123 / 143
页数:21
相关论文
共 84 条
  • [1] Prediction of geological hazardous zones in front of a tunnel face using TSP-203 and artificial neural networks
    Alimoradi, Andisheh
    Moradzadeh, Ali
    Naderi, Reza
    Salehi, Mojtaba Zad
    Etemadi, Afshin
    [J]. TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2008, 23 (06) : 711 - 717
  • [2] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [3] [Anonymous], 2014, GBT502182014
  • [4] [Anonymous], 2008, Gb50487-2008
  • [5] Some new Q-value correlations to assist in site characterisation and tunnel design
    Barton, N
    [J]. INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 2002, 39 (02) : 185 - 216
  • [6] Bieniawski Z.T., 1973, CIV ENG SIVIELE INGE, V15, P335, DOI 10.1016/0148-906290924-3
  • [7] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [8] Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
  • [9] Breiman L., 2001, Machine Learning, V45, P5
  • [10] A framework for dynamic classifier selection oriented by the classification problem difficulty
    Brun, Andre L.
    Britto, Alceu S., Jr.
    Oliveira, Luiz S.
    Enernbreck, Fabricio
    Sabourin, Robert
    [J]. PATTERN RECOGNITION, 2018, 76 : 175 - 190