Optimizing Mean Fragment Size Prediction in Rock Blasting: A Synergistic Approach Combining Clustering, Hyperparameter Tuning, and Data Augmentation

被引:1
|
作者
Krop, Ian [1 ,2 ]
Sasaoka, Takashi [1 ]
Shimada, Hideki [1 ]
Hamanaka, Akihiro [1 ]
机构
[1] Kyushu Univ, Dept Earth Resources Engn, Fukuoka 8190395, Japan
[2] Jomo Kenyatta Univ Agr & Technol, Dept Min Mat & Petr Engn, POB 62000-00200, Nairobi, Kenya
来源
ENG | 2024年 / 5卷 / 03期
关键词
machine learning; regression; clustering; data augmentation; XGBoost; hyperparameter tuning; GROUND VIBRATION; MODEL;
D O I
10.3390/eng5030102
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Accurate estimation of the mean fragment size is crucial for optimizing open-pit mining operations. This study presents an approach that combines clustering, hyperparameter optimization, and data augmentation to enhance prediction accuracy using the Xtreme Gradient Boosting (XGBoost) regression model. A dataset of 110 blasts was divided into 97 blasts for training and testing, whereas a separate set of 13 new, unseen blasts was used to evaluate the robustness and generalization of the model. Hierarchical Agglomerative (HA) and K-means clustering algorithms were used, with HA clustering providing a higher cluster quality. To address class imbalance and improve model generalization, a synthetic minority oversampling technique for regression with Gaussian noise (SMOGN) was employed. Hyperparameter tuning was conducted using HyperOpt by comparing Random Search (RS) with the Advanced Tree-structured Parzen Estimator (ATPE). The combination of ATPE with HA clustering and SMOGN in an expanded search space produced the best results, achieving superior prediction accuracy and reliability. The proposed HAC1-SMOGN model, which integrates HA clustering, ATPE tuning, and SMOGN augmentation, achieved a mean squared error (MSE) of 0.0002 and an R2 of 0.98 on the test set. This study highlights the synergistic benefits of clustering, hyperparameter optimization, and data augmentation in enhancing machine learning models for regression tasks, particularly in scenarios with class imbalance or limited data.
引用
收藏
页码:1905 / 1936
页数:32
相关论文
empty
未找到相关数据