The linear random forest algorithm and its advantages in machine learning assisted logging regression modeling

被引:169
作者
Ao, Yile [1 ]
Li, Hongqi [1 ]
Zhu, Liping [1 ]
Ali, Sikandar [1 ]
Yang, Zhongguo [2 ]
机构
[1] China Univ Petr, Beijing, Peoples R China
[2] North China Univ Technol, Beijing, Peoples R China
关键词
Machine learning; Logging interpretation; Logging regression modeling; Linear random forest; Algorithm comparison; SUPPORT-VECTOR-REGRESSION; PETROLEUM RESERVOIR CHARACTERIZATION; ARTIFICIAL NEURAL-NETWORKS; PERMEABILITY PREDICTION; GENETIC-ALGORITHM; WATER SATURATION; WIRELINE LOGS; FUZZY-LOGIC; POROSITY; INDUCTION;
D O I
10.1016/j.petrol.2018.11.067
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Direct measurements of formation properties such as the shale volume, porosity, permeability, and fluid saturation are often accompanied by expensive cost and are time-consuming too. Well logging inversion provides an alternative way for the determination of formation properties. Compared to traditional theoretical models or formalized empirical fitted models, machine learning assisted logging regression modeling is more accurate and objective. Several machine learning regression algorithms such as neural networks, support vector regression, fuzzy logic, k nearest neighbors regression, multivariate adaptive regression spline, and random forest have already been applied. In this article, we present the Linear Random Forest algorithm and investigate its application in logging regression modeling. By systematic comparison with 8 other algorithms including least squared linear regression, neural networks, epsilon support vector regression, k nearest neighbors regression, regression tree, regression random forest, gradient descent boosted trees, and linear decision tree, the advantage of linear random forest in performance is confirmed by 24 real-world tasks from 7 different areas. Deeper discussions reveal that the advantages of linear random forest source from its strong learning ability, robustness, and feasibility of the hypothesis space. Through our study, the superiority of linear random forest for logging regression modeling is substantiated, which provides a more reasonable way for the further practices of logging regression modeling.
引用
收藏
页码:776 / 789
页数:14
相关论文
共 87 条