Conventionally, many researchers have used both regression and black box techniques to estimate the unconfined compressive strength (UCS) of different rocks. The advantage of the regression approach is that it can be used to render a functional relationship between the predictive rock indices and its UCS. The advantage of the black box techniques is in rendering more accurate predictions. Gene expression programming (GEP) is proposed, in this study, as a robust mathematical alternative for predicting the UCS of carbonate rocks. The two parameters of total porosity and P-wave speed were selected as predictive indices. The proposed GEP model had the advantage of the both traditionally used approaches by proposing a mathematical model, similar to a regression, while keeping the prediction errors as low as the black box methods. The GEP outperformed both artificial neural networks and support vector machines in terms of yielding more accurate estimates of UCS. Both the porosity and the P-wave velocity were sufficient predictive indices for estimating the UCS of the carbonate rocks in this study. Nearly, 95% of the observed variation in the UCS values was explained by these two parameters (i.e., R2 =95%).
The unconfined compressive strength (UCS) is a rock strength index that is widely used in geothechnical, mining, and construction projects. For estimating/measuring the UCS, two methods can be used: (i) direct UCS measurement, and (ii) indirect UCS estimation. Both the ASTM [1] and the ISRM [2] have specific laboratory and in situ testing procedures for the direct measurement of UCS. Sophisticated representative samples must however be prepared before direct tests can be conducted. These samples are, at times, difficult to obtain. For example, the preparation of standard specimens in highly weathered/fractured rocks is both a challenging and time/money consuming task. Thus, a considerable amount of research has been devoted to the indirect UCS estimation from a range of mechanical, physical, and petrological indices in an effort to address these limitations. The indirect UCS estimation methods can be categorized into two classes: black box [3][4][5][6] and regression approaches [7][8][9][10]. In the black box approach, that is mainly applied by soft computing techniques such as artificial neural networks (ANNs), a kernel function maps a vector of input (independent) variables into a vector of response (dependent) variables. After learning from a portion of the available datasets (training dataset), the model will be able to predict values of the response vector for the other portion (testing dataset). These techniques, however, do not render a functional relationship between the input and output vectors. Thus, they are referred to as black box techniques. In contrast, the regression methods (e.g., multiple variable and support vector regression) propose a functional relationship between the variables. This results in a better understanding of both the underlying process and interactions between variables. A number of studies have found that the soft computing methods perform better than the regression approach performs in terms of accurate predictions [11][12][13]. Thus, the advantage of the black box approach is in proposing better estimates while the advantage of the regression is in proposing a functional relationship between the variables. The genetic programming (GP) technique was adapted in this study that combines the individual advantages of other methods into a single framework. The GP simultaneously reduces the prediction errors (comparable to the black box techniques) and renders a functional relationship (comparable to the regression approach).
Genetic programming is among the recent advances made in mathematical modeling, which are based on evolutionary principles. GP is a development of the genetic algorithm (GA) that generates thousands of computer programs to solve regression and classification problems. Unlike the conventional GA technique that seeks optimal values for some predefined parameters, GP manipulates many programs, based on the Darwinian evolution theory, to find both the best models and the best parameters for a given set of variables. Introduced by Ferreira [14], gene expression programming (GEP) was developed from the original GP. The entities in GEP (unlike those in GP) are in the form of the same length strings [15].
Although the technique emerged only recently, numerous applications have been discussed. GP has been applied to a range of geological, environmental, and civil problems [16][17][18][19][20]. The GEP method has not, however, been used to predict the UCS of carbonate rocks. Unlike other soft computing techniques, such as ANNs, GEP renders a (none) linear relationship between the independent variables and the response. Hence, this technique helps with providing a better understanding of the phenomena. In order to demonstrate the superiority of the GP in predicting the UCS, the technique is applied in the case of carbonate rocks. Moreover, the results are compared with two techniques of SVR (regression approach) and ANNs (black box approach).
The GEP was used in this study to construct a functional relationship between P-wave velocity and total porosity (independent variables) and UCS (dependent variables) of 117 samples of carbonate rocks. The results were compared with support vector machine (SVR) and artificial neural networks (ANNs) in terms of modeling capability and prediction accuracy.
Density [21], impact strength [22], porosity [23][24], and fabric [25] are among indices most widely used to predict UCS. We used in this study: P-wave velocity and total porosity to estimate UCS [4,26]. A carbonate formation of sedimentary limestones in a dam site located in southern Iran was investigated. A total of 117 representative core specimens were prepared for the measurement of UCS (per ASTM) [1]. The height-to-diameter ratio was approximately 2.7, and the 2 ends were flattened. The net total porosity was calculated as in Eq. ( 1):
In Eq. (1): n, , and are porosity, dry density, and density of the solid part, respectively.
Furthermore, the P-wave velocity within
This content is AI-processed based on open access ArXiv data.