Abstract
Permeability modelling is considered a complex task in reservoir characterization and a key component of reservoir simulation. A common method for permeability modelling involves performing static rock typing (SRT) using routine core analysis data and developing simple fitting-based mathematical relations that link permeability to reservoir rock porosity. In the case of carbonate reservoirs, which are associated with high heterogeneities, fitting-based approaches may fail due to porosity–permeability data scattering. Accurate modelling of permeability using petrophysical well log data seems more promising since they comprise a vast array of information about the intrinsic properties of the geological formations. Furthermore, well log data exhibit continuity throughout the entire reservoir interval, whereas core data are discrete and limited in availability and coverage. In this research work, porosity, permeability and log data of two oil wells from a tight carbonate reservoir were used to predict permeability at un-cored intervals. Machine learning (ML) and fitting models were used to develop predictive models. Then, the developed ML models were compared to exponential and statistical fitting modelling approaches. The integrated ML permeability model based on Random Forest method performed significantly superior to exponential and statistical fitting-based methods. Accordingly, for horizontal and vertical permeability of test samples, the Root Mean Squared Error (RMSE) values were 3.7 and 4.5 for well 2, and 1.7 and 0.86 for well 4, respectively. Hence, using log data, permeability modelling was improved as it incorporates more comprehensive reservoir rock physics. The outcomes of this reach work can be used to improve the distribution of both horizontal and vertical permeability in the 3D model for future dynamic reservoir simulations in such a complex and heterogeneous reservoir system.
Similar content being viewed by others
Introduction
Permeability of reservoir rocks is typically measured using a limited number of core samples taken from wells. Advances in technology such as Nuclear Magnetic Resonance (NMR) log have made it possible to estimate permeability with a high accuracy using in situ measurements in an interval, continuously. However, permeability distributions between wells remains a significant challenge in formation evaluation. Various algorithms and mathematical methods are available for prediction of permeability in un-cored intervals, known as permeability modelling. This is crucial for preparing flow simulation models to evaluate reservoir performance. 3D dynamic reservoir models are essential for field development plans, reservoir performance prediction, well-based operations, and enhanced oil recovery (EOR) screening. Prediction of permeability throughout un-cored intervals is a complex process and researchers have attempted to develop a universal correlation covering these complications1,2,3,4. Machine learning (ML) is a powerful tool with a wide range of rapidly growing applications, which can be trained with well-based and reservoir-based data for permeability modelling purposes. Several techniques are available for permeability measurement including core scale, reservoir scale, testing, and NMR logs5. In routine core analysis (RCAL), permeability is measured on core scale; while, in transient well testing it is measured on a large volume of reservoir rock, reducing the uncertainty. Formation testing reports permeability of a specific spot in the well column, resulting in limited heterogeneity ranges compared to transient testing6. However, different length scales and heterogeneity scales result in different permeability values.
Permeability modelling techniques can be classified into three general groups: (1) Mathematical modelling based on measurable rock properties (e.g., porosity, grain size, and tortuosity) such as the exponential Kozeny–Carman relation7,8, correlations based on irreducible saturation9,10, and permeability model by Pittman and Winland based on porosity and pore diameter11,12, (2) Statistical permeability models such as Pearson-Tukey and Swanson averaging13,14, and (3) intelligent techniques such as ML and Artificial Intelligence (AI) modelling15,16. Empirical permeability models have experienced a significant progress in recent years thanks to availability of various methods to assess gas permeability and diffusivity in tight reservoirs. The tools used for this purposed include numerical models, NMR measurements, and fractal theory-based approaches to better understand gas transport properties in complex geological formations17,18,19,20. Correlations developed for specific reservoir and petrology conditions, even in similar geological and petrological conditions, are rarely successful in different reservoirs. Hence, some parameters related to rock physics are necessary to determine the governing rules between measured/available data and rock permeability. Log data are the most potential suitable candidates for this application. They have attracted growing attention because of: (i) their abundance and availability, (ii) good areal distribution and vertical continuity through geological structure of the reservoir, and (iii) each log parameter represents specific physics of the reservoir rock (such as sonic characteristics, natural or radioactivity, and electric features). Implementing log data (such as sonic, gamma, resistivity, neutron, density, and NMR) instead of macro/microscopic features of reservoir rock (e.g., porosity, specific surface area, tortuosity, and pore size) in permeability modelling has attracted much attention recently. Among the log parameters, electrical resistivity and sound speed are the most correlated with rock permeability since they are affected by rock permeability21. Furthermore, permeability of carbonate reservoirs can be estimated using NMR logs, accurately22. However, NMR data are not always available compared to routine full-set log parameters. Thus, some alternative methods should be developed for permeability modelling based on conventional log data. Because the measurement of pore size, grain radius, tortuosity, and irreducible water saturation is challenging in routine permeability modelling methods, some researches assessed these shortcomings5. Other researchers used log data with multi-parameter regression techniques5,9. However, they did not perform well in permeability modelling of heterogeneous sandstones and carbonate reservoirs15. Hence, researchers recently used artificial neural networks to obtain more accurate and acceptable results15,23.
Intelligent methods and, more specifically, ML methods have been used frequently for various engineering purposes. When the objective variable is a constant value (not distinct classes or categories), regression ML methods must be used. Hence, the problem of permeability modelling is categorized as a regression problem. Since there is a remarkable (vertical and areal) variety in correlation between permeability (k) and porosity (Φ), a single correlation cannot lead to accurate permeability predictions in all cells of a reservoir model. ML methods are able to resolve this position dependency of k-Φ relations24. Several pieces of research used ML methods in the permeability modelling of petroleum reservoirs25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51. A summary of application of some AI models for reservoir properties prediction is presented in Table 1.
According to the literature, several research work have been performed on intelligent permeability modelling. However, few researches addressed the application and comprehensive comparison between fitting-based and learning-based permeability modelling methods. In the petroleum industry, permeability is usually predicted using fitting-based approaches. The prediction of permeability can be improved by applying learning-based techniques; particularly when more log data from the reservoir is included following well drilling. These techniques are more economical since they exhibit higher levels of completeness and experience fewer errors. While, in this research work, we practically demonstrated the ability of ML methods to effectively handle the disparities and uncertainties inherent in the input data, thereby yielding more reliable and promising predictions of permeability. Moreover, this research work is performed on a challenging database that belongs to a highly heterogeneous, low porosity/permeability, and tight oil reservoir with average porosity and permeability of 0.05 and 2 mD, respectively.
In this paper, a brief description of the basics of exponential, statistical, and ML permeability modelling methods is provided (“Permeability modelling methods” section) followed by description geological setting, core and log data of an Iranian tight carbonate oil reservoir. Then, the fitting-based and learning-based permeability modelling methods were applied to the database. The efficiency of permeability prediction is then assessed and compared for all models. In the final stage, permeability modelling is performed and its reliability is verified via comparison with the average values of core permeability in all of the wells.
Permeability modelling methods
Geological setting and database
This research work is performed on one of the Iranian south oil reservoirs trapped in a structural anticline. The study area contains three formations: Ilam, Surgah and Sarvak, and the main reservoir in the field lies in the Sarvak formation, with primary lithology of calcite and dolomite and thin layers of shale. The reservoir can be categorized as a tight carbonate reservoir with an average porosity of 0.021 and water saturation of 0.64. Moreover, according to facies analysis, the main reservoir is composed of low to medium quality carbonate rocks and only about 10% of the reservoir rock can be characterized as potential reservoir rocks. The main oil-bearing layers are zones 4 and 6 (part of the Sarvak formation), compact carbonate formations with low porosity/permeability characteristics. A schematic of relative well locations, geological zoning, and the crest and lateral wells are presented in Fig. 1.
Five wells were drilled and completed in this reservoir including four production wells and one observation well. Due to the limited number of wells, close distance, and low distribution of wells through the reservoir geological structure, the characterization of reservoir rock, and permeability modelling are quite challenging. Petrophysical log data are available for all five wells; however, laboratory porosity/permeability core data are only available for wells 2 and 4. The lack of experimental data and the fact that the wells number 2 (northwest) and 4 (southeast) are located in the two furthest locations of the reservoir (about 10 km distance) are two factors that add to the level of uncertainty and challenge of the present research work.
Here, we first provided a statistical view of the experimental data because statistical parameters represent a comprehensive view of the dataset and the reservoir characteristics. For example, average values indicate the range of expected values from permeability modelling, and standard deviation reveals the dispersion and distribution of the data around the mean value. A detailed statistical description of the porosity/permeability data of wells 2 and 4 is presented in Table 2. The average permeability of the wells 2 and 4 is 1.86 and 1.15 mD for horizontal, and 2.29 and 0.88 mD for vertical samples. Additionally, most of the core samples in these two wells belong to the zones 4 and 6, the potential oil-bearing layers. Hence, the final permeability model is expected to show reliable performances in the oil-bearing zones.
Log and core porosity data of the zones 4, 5, and 6 in the wells 2 and 4 are presented in Fig. 2. There is an acceptable match between the laboratory and log porosity data in the well 2. However, in some intervals of the well 4, there are discrepancies originating from previous data acquisition processes. In the upper parts of the zone 4, log porosity is half of the value reported by core samples, and in the zone 6, log porosity is about 2/3 of the core porosity values. The porosity data does not have the same quality in all the wells and zones, so the proper modelling method should mitigate this inconsistency, reducing the level of uncertainty in permeability prediction. Furthermore, because of the poor quality of the porosity data and the tight and heterogeneous nature of the reservoir rock, conventional fitting-based methods are not deemed viable options as they typically rely on porosity data for permeability predictions. Hence, use of other available data (i.e., log data) in permeability prediction alleviates the negative impacts of data discrepancy and helps to manage the high uncertainty present in the dataset.
A complete set of log data is available for all of the five wells. The log dataset includes various data columns such as bit size (BS), Caliper (CALI), total conductivity (CT), sonic (DT), neutron porosity (NPHI), density (RHOB), total resistivity (RT), spectral gamma ray (SGR), calcite volume (VOL_CALCITE), dolomite volume (VOL_DOLOM), oil volume (VOL_UOIL) and water volume (VOL_UWATER). The availability of log data columns in each well is presented in Table 3. The available data are denoted in green and the missing data in red. A general permeability model must be developed based on the data columns available in all of the wells. Hence, eight data columns (known in all of the five wells) having physical relations with the pore geometry of the reservoir rock were chosen as input variables for the ML modelling conducted in this research work.
Reservoir rock typing
Rock typing can be defined as dividing reservoir rock into distinct categories, each having a specific condition of geological deposition and diagenetic features52. Successful rock typing leads to a reliable permeability prediction, saturation height modelling, dynamic simulations, and performance forecasting53. Several rock typing methods have been introduced in the literature, each of which is developed for specific lithology, rock quality, and geological conditions52,53,54. In this research work, we used the most popular methods (Rock Quality Index (RQI), Flow Zone Indicator (FZI), Discrete Rock Typing (DRT), and Winland methods) on the vertical and horizontal core k-Φ data as a part of the fitting-based methods. FZI method showed the best performance in classifying the data into distinct rock types. Since the main objective of this research work was to compare fitting-based and learning-based permeability modelling approaches, and to avoid prolongation of the paper, only the most widely used rock typing method (the FZI method) are presented here. To apply the FZI method, the RQI is first calculated as:
where k is the sample permeability in mD, \({\Phi }_{e}\) is the effective porosity, and RQI is in μm. FZI is defined as the ratio of RQI to normalized porosity (\({\Phi }_{z}\)) as:
where FZI is in \(\mu m\). Distinct rock types can be identified by: (i) taking core samples with similar FZI values as a distinct rock type or (ii) plotting RQI against \({\Phi }_{z}\) in log–log scale, then each unit slope line represents a rock type.
Statistical correlations
Delfiner21 performed a statistical investigation on k-Φ relations and pointed out that rock typing methods based on exponential k-Φ relations lead to underestimation of permeability. These methods result in arithmetic permeability averages lower than laboratory average values. Hence, these method would lead to underestimated values in the 3D upscaled model of a reservoir21. As Male and Duncan showed, exponential relations may lead to underestimation of core permeability by up to 3 times or more, particularly in heterogeneous carbonates55. To prevent this modelling bias, a statistical averaging method based on Swanson averaging was applied to core data to develop a k-Φ relation having a lower prediction bias and closer arithmetic average21. Firstly, the total porosity interval is divided into some sub-intervals, then 10%, 50%, and 90% quintiles of each sub-interval are calculated, and finally, the Swanson average of each is calculated as follows:
where \({X}_{10}\), \({X}_{50}\), and \({X}_{90}\) are 10%, 50%, and 90% quintiles, respectively. After \({X}_{mean}\) values are calculated, and a correlation is fitted for \({X}_{mean}\) against Φ, and it is used for permeability modelling instead of exponential k-Φ relations.
Machine learning permeability modelling
In recent years, ML methods have been widely used for reservoir property modelling. There are several commonly used ML methods such as ANN, SVM, Gradient Boosting Regressor (GBR), Lasso Regressor, K-nearest Neighbors (KNN), Decision Tree (DT), and Random Forest (RF) algorithms. Since model selection is an essential step in ML predictive modelling, multiple algorithms were trained on the same dataset to compare their performances in both the training and validation steps. For the present dataset, the RF Regressor was found as the most suitable ML model (see “Learning-based methods” section). The model selection process should be repeated for each database indicating the random forest model is not a universal method that performs best for all databases. As several models are used to maximize the performance, it is not feasible to describe in detail the design of each model within the scope of this research work. Here, we focused on explaining the Random Forest structure, a key model in our work. Hence, this algorithm is briefly discussed here. The RF is a robust integrated ML algorithm developed by Breiman56. The primary aim of developing the RF algorithm was to solve unsupervised regression and classification problems. However, it has been successfully applied to supervised regression problems such as porosity prediction from wireline log data57. In other words, the main concept behind this technique was building several independent decision trees also known as ensemble of trees and training them on the desired dataset and finally to make predictions. This algorithm utilizes bootstrap resampling technique to avoid overfitting, a resampling approach which works via replacement. Bootstrap sets are then formed from initial data where several samples are replaced with other repeating samples. Each tree is then built on an individual bootstrap set in the RF algorithm. Hence, all the trees would be different as they were built on diverse datasets leading to unalike predictions. In the next stage, all the trees are aggregated together, and the final prediction is obtained by averaging the predictions of individual trees (in the case of regression)56,57,58. The RF method can provide the importance degree of each feature and pairwise proximity between samples. Conversely, the RF algorithm favours smaller groups over larger groups if the input data contains groups of correlated features of similar relevance. The workflow of the random forest algorithm is presented in Fig. 3.
Results and discussion
Fitting-based methods
Rock typing permeability modelling
In this section, the FZI rock typing and permeability modelling is described as it performed the best in classifying the data into distinct rock types (RTs) among various methods used. As shown in Fig. 4, horizontal permeability was classified into four RTs (distinct FZI intervals). Exponential k-Φ correlations were obtained for the RT1 to RT4 with relatively good determination coefficients of 0.68, 0.89, 0.84, and 0.61. The RT1 and RT4 showed weaker correlations because they included the data points with scattered FZI values. This reveals that as the values of FZI and average permeability increases, the average porosity and storage capacity of reservoir rock decreases by a factor of 5.7, from the RT1 to RT4. This procedure was also repeated for vertical data resulting in four RTs with the same FZI intervals as in the horizontal samples. The data points and exponentially fitted correlations are presented in Fig. 5. The RT1 to RT3 were fitted with proper correlation coefficients of 0.782, 0.908, and 0.902, respectively. According to the statistical analysis, as the average vertical permeability increases, the average porosity decreases from the RT1 to RT3. The RT4 showed a very weak correlation coefficient because (i) its data were highly scattered, and (ii) while the average permeability of the RT4 is roughly 1/3 of the RT3, the lower porosity of RT4 resulted in higher FZI values. This demonstrates the incompetency of this permeability modelling method.
The permeability model obtained by rock typing is not applicable in un-cored reservoir depths because porosity and fluid saturations are the only available data in these locations, and they are not enough to identify the RT of each un-cored location. A simple and imprecise method is to take all permeability samples as a single rock type to achieve a general k-Φ relation, which can be used for permeability modelling of un-cored regions.
Statistical permeability modelling
A statistical permeability modelling method introduced by Delfiner21 was also applied to the data to compare the permeability modelling methods, comprehensively. This approach is expected to reduce the pessimistic effects of the exponential fitting on average predicted permeability values which can led to underestimation of the core permeability values by a factor of 355. In this section, statistical permeability modelling for horizontal permeability data (Fig. 6) and vertical permeability data (Fig. 7) are presented and discussed. For a detailed description of the applied procedure, one can refer to the paper by Delfiner21.
The values of \({X}_{10}\), \({X}_{50}\) and \({X}_{90}\) were calculated and plotted for each 0.02 porosity interval. Afterward, the Swanson averaging method (Eq. 4) was used to calculate the average values (\({X}_{mean}\)). Then, power correlations were fitted to the average values to obtain the final permeability modelling correlations, which showed acceptable coefficients of determination (R2) of 0.87 and 0.77 for horizontal and vertical permeability models. Since the R2 does not represent the efficiency of permeability modelling, comprehensively, the proficiency of this method is evaluated and compared with other methods in Sect. 5.
Learning-based methods
The choice of machine learning (ML) model within a pool of models should align with the characteristics of the dataset. It is important to acknowledge that no model is flawless. Owing to both intrinsic limitations within the dataset and inherent imperfections in all models, each carrying its own set of limitations and degrees of error. The effectiveness of a model is determined by how well it meets the predefined criteria. Upon identification of the most suitable model, a subsequent step involves optimization and evaluation of the model to enhance its efficiency when applied to a given dataset.
In this research work, the following AI algorithms were examined: SVM, RF, Lasso regression, KNN, and DT. Then, four separate groups of training/testing data with a ratio of 4:1 (train: test) were created and each algorithm was applied to all of the four datasets. The performance of the models then was assessed using different statistical quality measures including the R2 (Eq. 5), mean absolute error (MAE) (Eq. 6), Mean Squared Error (MSE) (Eq. 7), and Root mean squared error (RMSE) (Eq. 8) values, which are calculated and reported for train, test, and overall data sets (see Fig. 8) to compare and select the most effective algorithm.
where, N is the total number of observations.
As shown in Fig. 8, the Lasso regression and SVM algorithms demonstrated the weakest performance on the present dataset. The KNN model has shown a weak to intermediate prediction such that the training and testing R2 were 0.38 and 0.18. The critical point is that the decision tree model is mostly over-fitted, evidenced by the high training R2 of 0.92 and the low test R2 of 0.22. As a result, the RF model retained a relatively reasonable accuracy in permeability prediction from log parameters with an average R2 values of 0.84 (train) and 0.62 (test), MSE values of 4.0879 (train) and 4.989 (test), RMSE values of 1.819 (train) and 2.01 (test), and MAE values of 1.19 (train) and 1.34 (test) for the respective datasets.
Although the accuracy of the ML permeability modelling is not excellent because of the scattered and highly heterogeneous nature of the present dataset, this method is more precise and comprehensive than fitting-based methods, which only fit a few equations to all the data points, even in high heterogeneity situations. Besides, the following advantages can be listed for ML permeability modelling in comparison to fitting-based methods:
-
i.
The ML models are more comprehensive than fitting-based models because they predict rock permeability based on multiple (log) parameters rather than just using porosity and the fact that the log data are more abundant and available than core and laboratory data.
-
ii.
The ML models operate based on learning the rules between the input and objective parameters; while, the fitting-based methods use only some porosity/permeability correlations.
-
iii.
The ML models sufficiently account for the areal and vertical heterogeneity of the reservoir because log data is usually available both areally and vertically.
-
iv.
Each log parameter represents specific physics from the porous media (sonic, electrical, neutron, and radioactive), which strengthens the physical basis of this approach instead of only using porosity values.
After the best model is selected, it must be trained on the present dataset using an appropriate strategy. In this step, the main question is how to feed the input data into the models for the training process? Considering the available permeability (training) data, the following potential structures are suggested for the training process. (i) Well-based model: train the models and perform predictions for four groups: horizontal data of well 2, vertical data of well 2, horizontal data of well 4, and vertical data of well 2, separately. (ii) Integrated model: Develop two general models by integrating all horizontal samples for one model and all vertical samples for another.
Depending on the expected results and applications of the permeability modelling, one of the mentioned approaches should be followed for training and prediction. For instance, when there are suitable areal and vertical distributions of wells with available core permeability and log data, the well-based approach is more accurate but time-consuming. On the other hand, when there are good distributions of wells and log data, but core permeability data only exist for a few wells, the integrated approach is the best choice. In the following sections, we presented the results of both methods to provide a comprehensive comparison. In this research work, the integrated approach is preferable because:
-
i.
As mentioned in “Results and discussion” section, log data are available from all the wells (wells 1 to 5). However, core data is only accessible from wells 2 and 4.
-
ii.
Integrated modelling approach provides two general models (horizontal and vertical), which can model rock permeability wherever log data are available. This method results in 10 permeability logs for wells 1 to 5 (five horizontal and five vertical), increasing the availability of areal and vertical permeability data and consequently enhancing the efficiency of geo-statistical property distribution in the 3D reservoir model.
-
iii.
From the perspective of ML modelling, as the database size increases, the obtained model shows a higher accuracy and reliability in predictions. Herein, the integrated ML models are trained with a database much more extensive than in well-based methods. Hence, the results of the integrated ML modelling are more reliable.
In this section, we applied the well-based approach to the dataset, and four RF models were trained and tested for permeability predictions, separately. The tuned and optimized hyper-parameters for each model are presented in Table 4. The models were trained and tested with relatively good quality as evidenced by a R2 of 0.80. Then, the vertical and horizontal permeability of the wells 2 and 4 were predicted. The performance of permeability prediction using this approach is examined in detail and compared with other methods in Sect. 5.
To apply the integrated approach, two distinct databases were established: one for total horizontal samples and another for total vertical core data of the wells 2 and 4. Then, the hyper-parameters of RF models were tunned, and the models were trained and tested. The test and train determination coefficients of 0.84 and 0.76 for horizontal, and 0.85 and 0.82 for vertical samples (see Table 4) demonstrate the high accuracy of the integrated models. Although a high determination coefficients is a good characteristic for a regression ML model, but it is not comprehensive and does not guarantee successful predictions. Hence, a detailed and comparative performance analysis of all the applied methods is presented in Fig. 8.
After constructing the fitting-based and learning-based models, in this section the modelling results are compared with laboratory core permeability data to evaluate the relative performance of the models. Firstly, the core data were fed into the models to reproduce the core permeability data. The model results were plotted against core permeability values in a log–log plot (to present a complete view of the range of the permeability data). The horizontal and vertical permeability predictions for well 2 and well 4 are illustrated in Figs. 9 and 10, respectively. The more closely the data is aligned with the unit slope line (\(y=x\)), the more accurate the permeability prediction is.
The data of well 2 in Fig. 9 reveals that the integrated ML method has considerably improved the permeability prediction compared with other modelling approaches. The results of the integrated ML model align with the unit slope line for both horizontal and vertical samples of the well 2. However, the well-based ML predictions using both statistical and exponential fitting methods are relatively scattered around the unit slope line and do not demonstrate a considerable progress in permeability modelling. Although the integrated ML model is working better than the other methods, it should be pointed out here that it showed a weak performance in predicting samples with a permeability of less than 0.8 mD and was mainly overestimating the permeability in this interval.
The quality of permeability predictions in the well 4 is generally lower than in the well 2 because of high uncertainty in the well 4 porosity data as described in “Results and discussion” section. Considering Fig. 10, the integrated ML delivers better predictions from the both horizontal and vertical core permeability data. Moreover, these observations demonstrate that the integrated ML alleviated the destructive effects of the low quality porosity data on vertical permeability modelling; which, led to considerable improvements in permeability prediction. The learning-based methods intelligently take advantage of the physical characteristics of the porous media by using the log parameters reducing the effect of any possible low quality data. This is the most prominent feature of applying ML models compared to the fitting-based methods that only use porosity for permeability predictions.
As shown in Figs. 9 and 10, if the deviation is above the unit slope line, the corresponding model overestimates the permeability. If it deviates under the unit slope line, then the model underestimates the permeability of the core. A comparison between the statistical and exponential models revealed that the statistical methods substantially decrease the permeability underestimation of the exponential models. According to Delfiner21, the permeability underestimation by exponential fitting reduces the average permeability value in 3D reservoir models.
Since the graphical representations (Figs. 9 and 10) do not sharply clarify the prominence of the integrated ML, especially for the well 4 and vertical samples, the error values were calculated by using standard metrics to present a comprehensive comparison. Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) are the three widely used error evaluation metrics in ML and statistical analysis. They were calculated and reported in Table 5 for each modelling method by zone, well number, and permeability direction. The most efficient models (with the lowest error) were highlighted in bold in Table 5. Moreover, it should be noted that since the oil-bearing layers of the reservoir are the zones 4 and 6, the permeability modelling is only performed for these layers (for future reservoir simulations).
In the well 2, the integrated ML shows the best performance in permeability estimation. It predicts both horizontal and vertical permeability with considerably lower error values such that the MSE of integrated ML is lower than other methods by a factor of 2–3 in all zones and directions. For samples of the well 4, the integrated ML provided the minimum error for horizontal samples in the zone 4. While, the statistical model predicts the vertical samples better in this zone. In the zone 6, both horizontal and vertical samples are well predicted by the integrated ML. Overall evaluation reveals that the integrated ML method is the most accurate, robust, and reliable permeability estimator.
Another indication of the modelling performance is the ability of a modelling approach to reproduce average core permeability in each zone. This feature can guarantee that the 3D reservoir permeability model does not deviate far from the average core permeability in the zones 4 and 6. In Figs. 11 and 12, the average horizontal and vertical permeability of core samples and their corresponding model predictions are plotted for each zone of the wells 2 and 4, respectively. The error percentage (indicated above each bar) indicates the deviation of the average modelling permeability from the average core permeability (Error =\(({k}_{core}^{avg}- {k}_{model}^{avg})/{k}_{core}^{avg}\) × 100).
In the well 2 (see Fig. 11), the integrated ML efficiently reproduced the average permeability of core samples with error values of less than 10%. For the horizontal samples of the well 4 (Fig. 12), the integrated ML method presented a suitable prediction in the zone 4. While, the well-based ML was more accurate in the zone 6. In the vertical samples of the well 4, both of the well-based and the integrated ML methods showed high efficiencies in reproducing the average permeability of core samples in the zones 4 and 6. However, overall evaluations again confirmed that the integrated ML method is the most promising predictor for average permeability prediction.
Eventually, the most efficient and promising model is the one that indicates the highest performance in four aspects: (i) having the lowest deviation from the unit slope line in Figs. 9 and 10, (ii) having the least MSE and MAE error values in Table 5, (iii) reducing the unfavorable effects of corrupt porosity data of the well 4 on permeability predictions, (iv) the minimum difference between the average core permeability and the average modelling permeability (denoted in Figs. 11 and 12). These indicators generally guarantee that the integrated ML provides the most accurate and reliable permeability predictions for this reservoir.
Since this research work is part of an extensive process of 3D reservoir model preparation for later dynamic reservoir simulations, the selected permeability modelling approach must be used to generate a 3D permeability model. The integrated ML method presented here works based on log parameters, which are available for all of the wells (1 to 5). Hence, this model provides a suitable areal and vertical data distribution throughout the 3D structure of the reservoir. After the petrophysical log data of the wells 1 to 5 were imported into the permeability model, the permeability logs were constructed for each well (Figs. 13 and 14). It can be observed that the modelling data suitably matched the core permeability data, as demonstrated in previous discussions, both graphically and numerically. The resultant permeability logs will be used to distribute the horizontal and vertical permeability in the 3D model for future dynamic reservoir simulations.
Conclusions
In this research work, we reported one of the insignificant applications of ML methods in the field of reservoir property characterization which is vital for building 3D dynamic model of reservoirs. In the cases that enough routine and specific core data are available, it is recommended to perform adopt the following workflow; (i) to perform rock typing to identify rock types, (ii) to apply classification ML methods to predict rock type (and permeability correlation) of each point in the wells having log data. The following main points were concluded from this research work:
-
A comparison between results of statistical and exponential (fitting-based) models revealed that the permeability underestimations of exponential methods were mainly alleviated using statistical methods.
-
In highly heterogeneous, low porosity/permeability, and tight oil reservoirs (which is the case in this research work), the standard fitting-based methods failed to model reservoir rock permeability efficiently, precisely. Hence, the ML permeability modelling approach demonstrated to be more robust and accurate in such cases.
-
Considering the limited data and the low areal distribution of the wells in this reservoir, the permeability modelling using log data (instead of core data) included a better vertical and areal data distribution of the reservoir structure into the permeability modelling process.
-
Using the log data improved the reservoir permeability model because it used more physics of the reservoir rock, such as sonic characteristics, radioactivity, and electric features, instead of only using porosity correlations (as in fitting-based methods).
-
Multiple graphical and quantitative evaluations demonstrated that the integrated ML model is considerably more efficient than the well-based ML, exponential, and statistical methods. Best predictions made by the integrated ML (i.e., RF model) mainly owing to its database, which is more extensive than the database used in well-based ML. According to the results obtained from the integrated model, the RMSE for horizontal permeability in well 4 and well 2 are 1.7 and 3.7 respectively, while for vertical permeability, they are 0.86 and 4.5 respectively.
-
Proper application of ML methods and log data for permeability prediction significantly reduced undesirable effects of discrepancy and/or uncertainty in the dataset.
As a result of this research work, permeability modelling should incorporate learning-based approaches and techniques to better address challenges such as data gaps, low quality, scattering, potential errors, and discrepancies in data collection. By reducing the impact of these issues on outputs, these methods increase reliability and precision. Incorporating these intelligent procedures will significantly enhance future research initiatives and operational approaches in similar geological settings.
Data availability
The data supporting this research work are available from the corresponding authors upon reasonable request and with the permission of the IRD.
Abbreviations
- AI:
-
Artificial intelligence
- ANFIS:
-
Artificial neuro-fuzzy interference system
- ANN:
-
Artificial neural network
- BS:
-
Bit size
- CALI:
-
Caliper
- CMEF:
-
Committed machine with empirical formulas
- CT:
-
Total conductivity
- DEN:
-
Formation density log
- DRT:
-
Discrete rock typing
- DT:
-
Decision tree
- DT:
-
Sonic log
- EOR:
-
Enhanced oil recovery
- F:
-
Formation factor
- FFBP:
-
Feed-forward back-propagation
- FL:
-
Fuzzy logic
- FZI:
-
Flow zone indicator
- GA:
-
Genetic algorithm
- GBR:
-
Gradient boosting regressor
- GMDH:
-
Group method of data handling
- GR:
-
Gamma ray
- HGAPSO:
-
Hybrid GA and PSO
- ICA:
-
Imperialist competitive algorithm
- KNN:
-
K-nearest neighbors
- LLD:
-
Deep laterolog
- LLS:
-
Shallow laterolog
- LM:
-
Levenberg–Marquardt
- LSSVM:
-
Least-square support vector machine
- MAE:
-
Mean absolute error
- MIMO:
-
Multiple-input multiple-output
- MKF:
-
Mixed kernel function
- ML:
-
Machine learning
- MSE:
-
Mean squared error
- MSFL:
-
Micro spherically focused log
- NMR:
-
Nuclear magnetic resonance
- NPHI:
-
Neutron-porosity log
- PEF:
-
Photoelastic adsorption factor
- PHID:
-
Density porosity
- PHIE:
-
Effective porosity
- PHIT:
-
Total porosity
- R:
-
Correlation coefficient
- R2 :
-
Coefficient of determination
- RBF:
-
Radial basis function
- RD:
-
Deep resistivity
- RF:
-
Random forest
- RHOB:
-
Deep density log
- RHOZ:
-
Formation density
- RL:
-
Resistivity log
- RLA1:
-
Apparent resistivity focusing mode 1
- RLA5:
-
Apparent resistivity focusing mode 5
- RMSE:
-
Root mean squared error
- RQI:
-
Rock quality index
- RT:
-
Total resistivity
- RT:
-
Electrical resistivity
- RTs:
-
Rock types
- RVR:
-
Relevance vector regression
- SFL:
-
Spherically focused log
- SGR:
-
Spectral Gamma Ray
- SRT:
-
Static rock typing
- SVM:
-
Support vector machine
- SWT:
-
Water saturation
- TNPH:
-
Thermal neutron porosity well logs
- VOL_CALCITE:
-
Volumes of calcite
- VOL_DOLOM:
-
Volumes of dolomite
- VOL_UOIL:
-
Volumes of oil
- VOL_UWATER:
-
Volumes of water
- VSH:
-
Shale volume
- XGBoost:
-
Extreme gradient boosting
References
Archie, G. E. The electrical resistivity log as an aid in determining some reservoir characteristics. Trans. AIME 146, 54–62 (1942).
Bloch, S. Empirical prediction of porosity and permeability in sandstones. AAPG Bull. 75, 1145–1160 (1991).
Tixier, M. P. Evaluation of permeability from electric-log resistivity gradients. Oil Gas J. 16, 113–133 (1949).
Ahmed, U., Crary, S. & Coates, G. Permeability estimation: The various sources and their interrelationships. J. Petrol. Technol. 43, 578–587 (1991).
Yao, C. & Holditch, S. SPE Eastern Regional Meeting. (OnePetro).
Babadagli, T. & Al-Salmi, S. A review of permeability-prediction methods for carbonate reservoirs using well-log data. SPE Reserv. Eval. Eng. 7, 75–88 (2004).
Carman, P. C. Fluid flow through granular beds. Chem. Eng. Res. Des. 75, S32–S48 (1997).
Kozeny, J. Uber kapillare leitung der wasser in boden. Royal Academy of Science, Vienna, Proc. Class I, vol. 136, 271–306 (1927).
Coates, G. R. & Dumanoir, J. SPWLA 14th Annual Logging Symposium. (OnePetro).
Timur, A. SPWLA 9th Annual Logging Symposium. (OnePetro).
Kolodzie, S. SPE Annual Technical Conference and Exhibition. (OnePetro).
Pittman, E. D. Relationship of porosity and permeability to various parameters derived from mercury injection-capillary pressure curves for sandstone. AAPG Bull. 76, 191–198 (1992).
Hurst, A., Brown, G. & Swanson, R. Swanson’s 30-40-30 Rule. AAPG Bull. 84, 1883–1891 (2000).
Pearson, E. S. & Tukey, J. W. Approximate means and standard deviations based on distances between percentage points of frequency curves. Biometrika 52, 533–546 (1965).
Huang, Z., Shimeld, J., Williamson, M. & Katsube, J. Permeability prediction with artificial neural network modeling in the Venture gas field, offshore eastern Canada. Geophysics 61, 422–436 (1996).
Knecht, L., Mathis, B., Leduc, J.-P., Vandenabeele, T. & Di Cuia, R. SPWLA 44th Annual Logging Symposium. (OnePetro).
Cui, X., Bustin, A. & Bustin, R. M. Measurements of gas permeability and diffusivity of tight reservoir rocks: Different approaches and their applications. Geofluids 9, 208–223 (2009).
Yang, Z., Yuan, B., Clarkson, C. R. & Ghanizadeh, A. Evaluation of surface diffusion in microporous/mesoporous media using a numerical model applied to rate-of-adsorption data: Implications for improved gas permeability estimation in shales/tight rocks using drill cuttings. Fuel 285, 118974 (2021).
Di, J. & Jensen, J. L. A new approach for permeability prediction with NMR measurements in tight formations. SPE Reserv. Eval. Eng. 19, 481–493 (2016).
Yang, Y. & Liu, S. Estimation and modeling of pressure-dependent gas diffusion coefficient for coal: A fractal theory-based approach. Fuel 253, 588–606 (2019).
Delfiner, P. SPE Annual Technical Conference and Exhibition. (OnePetro).
Quintero, L., Boyd, A., Gyllensten, A. & El-Wazeer, F. SPE Annual Technical Conference and Exhibition. (OnePetro).
Mohaghegh, S. Virtual-intelligence applications in petroleum engineering: Part 1—Artificial neural networks. J. Petrol. Technol. 52, 64–73 (2000).
Ma, Y. Quantitative Geosciences: Data Analytics, Geostatistics, Reservoir Characterization and Modeling 495–515 (Springer, 2019).
Van der Baan, M. & Jutten, C. Neural networks in geophysical applications. Geophysics 65, 1032–1047 (2000).
Helle, H. B., Bhatt, A. & Ursin, B. Porosity and permeability prediction from wireline logs using artificial neural networks: A North Sea case study. Geophys. Prospect. 49, 431–444 (2001).
Ahmadi, M. A., Zendehboudi, S., Lohi, A., Elkamel, A. & Chatzis, I. Reservoir permeability prediction by neural networks combined with hybrid genetic algorithm and particle swarm optimization. Geophys. Prospect. 61, 582–598 (2013).
Erofeev, A., Orlov, D., Ryzhov, A. & Koroteev, D. Prediction of porosity and permeability alteration based on machine learning algorithms. Transp. Porous Media 128, 677–700 (2019).
Male, F., Jensen, J. L. & Lake, L. W. Comparison of permeability predictions on cemented sandstones with physics-based and machine learning approaches. J. Nat. Gas Sci. Eng. 77, 103244 (2020).
Miah, M. I., Zendehboudi, S. & Ahmed, S. Log data-driven model and feature ranking for water saturation prediction using machine learning approach. J. Petrol. Sci. Eng. 194, 107291 (2020).
Shokir, E.E.-M. A novel model for permeability prediction in uncored wells. SPE Reserv. Eval. Eng. 9, 266–273 (2006).
Al-Anazi, A. & Gates, I. Support vector regression for porosity prediction in a heterogeneous reservoir: A comparative study. Comput. Geosci. 36, 1494–1503 (2010).
Al-Anazi, A. & Gates, I. D. Support-vector regression for permeability prediction in a heterogeneous reservoir: A comparative study. SPE Reserv. Eval. Eng. 13, 485–495 (2010).
Gholami, R., Shahraki, A. & Jamali Paghaleh, M. Prediction of hydrocarbon reservoirs permeability using support vector machine. Math. Probl. Eng. 2012, 1–18 (2012).
El-Sebakhy, E. A. et al. Functional networks as a new data mining predictive paradigm to predict permeability in a carbonate reservoir. Expert Syst. Appl. 39, 10359–10375 (2012).
Verma, A. K., Cheadle, B. A., Routray, A., Mohanty, W. K. & Mansinha, L. SPE Annual Technical Conference and Exhibition. 1–6.
Ahmadi, M.-A., Ahmadi, M. R., Hosseini, S. M. & Ebadi, M. Connectionist model predicts the porosity and permeability of petroleum reservoirs by means of petro-physical logs: Application of artificial intelligence. J. Petrol. Sci. Eng. 123, 183–200 (2014).
Ayoub, M. A. & Esmaeili, A. Application of artificial neural netwroks technique for estimating permeability from well log data. J. Eng. Sci. Soc. 10, 31–37 (2014).
Jamshidian, M. et al. Prediction of free flowing porosity and permeability based on conventional well logging data using artificial neural networks optimized by imperialist competitive algorithm—A case study in the South Pars Gas field. J. Nat. Gas Sci. Eng. 24, 89–98 (2015).
Alobaidi, D. A. Permeability prediction in one of Iraqi carbonate reservoir using hydraulic flow units and neural networks. Iraqi J. Chem. Petrol. Eng. 17, 1–11 (2016).
Vardian, M., Nasriani, H. R., Faghihi, R., Vardian, A. & Jowkar, S. Porosity and permeability prediction from well logs using an adaptive neuro-fuzzy inference system in a naturally fractured gas-condensate reservoir. Energy Sources Part A Recov. Util. Environ. Effects 38, 435–441 (2016).
Elkatatny, S., Mahmoud, M., Tariq, Z. & Abdulraheem, A. New insights into the prediction of heterogeneous carbonate reservoir permeability from well logs using artificial intelligence network. Neural Comput. Appl. 30, 2673–2683 (2018).
Ahmadi, M. A. & Chen, Z. Comparison of machine learning methods for estimating permeability and porosity of oil reservoirs via petro-physical logs. Petroleum 5, 271–284 (2019).
Zhong, Z., Carr, T. R., Wu, X. & Wang, G. Application of a convolutional neural network in permeability prediction: A case study in the Jacksonburg-Stringtown oil field, West Virginia, USA. Geophysics 84, B363–B373 (2019).
Zhong, Z. & Carr, T. R. Application of a new hybrid particle swarm optimization-mixed kernels function-based support vector machine model for reservoir porosity prediction: A case study in Jacksonburg-Stringtown oil field, West Virginia, USA. Interpretation 7, T97–T112 (2019).
Al Khalifah, H., Glover, P. & Lorinczi, P. Permeability prediction and diagenesis in tight carbonates using machine learning techniques. Mar. Petrol. Geol. 112, 104096 (2020).
Nkurlu, B. M. et al. Prediction of permeability using group method of data handling (GMDH) neural network from well log data. Energies 13, 1–18 (2020).
Zhang, Z., Zhang, H., Li, J. & Cai, Z. Permeability and porosity prediction using logging data in a heterogeneous dolomite reservoir: An integrated approach. J. Nat. Gas Sci. Eng. 86, 103743 (2021).
Menke, H. P., Maes, J. & Geiger, S. Upscaling the porosity–permeability relationship of a microporous carbonate for Darcy-scale flow with machine learning. Sci. Rep. 11, 2625 (2021).
Topór, T. Application of machine learning algorithms to predict permeability in tight sandstone formations. Nafta-Gaz 77, 283–292 (2021).
Mulashani, A. K., Shen, C., Nkurlu, B. M., Mkono, C. N. & Kawamala, M. Enhanced group method of data handling (GMDH) for permeability prediction based on the modified Levenberg Marquardt technique from well log data. Energy 239, 121915 (2022).
Gunter, G., Finneran, J., Hartmann, D. & Miller, J. SPE Annual Technical Conference and Exhibition. (OnePetro).
Amaefule, J. O., Altunbay, M., Tiab, D., Kersey, D. G. & Keelan, D. K. SPE Annual Technical Conference and Exhibition. (OnePetro).
Porras, J. & Campos, O. Latin American and Caribbean Petroleum Engineering Conference. 15–18 (The Society of Petroleum Buenos Aires).
Male, F. & Duncan, I. J. Lessons for machine learning from the analysis of porosity-permeability transforms for carbonate reservoirs. J. Petrol. Sci. Eng. 187, 106825 (2020).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Ho, T. K. Proceedings of 3rd International Conference on Document Analysis and Recognition. 278–282 (IEEE).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Acknowledgements
The authors would like to thank the National Iranian South Oil Company (NISOC) for providing the raw data.
Author information
Authors and Affiliations
Contributions
E.H. and A.A.R. were responsible for data curation, investigation, software, visualization, and writing the original draft. R.M. contributed to the investigation, conceptualization, project administration, writing review and editing, and supervision. A.S., S.K., and A.Sh. provided writing review and editing, as well as supervision. All authors critically helped in the interpretation of results, revised the manuscript, and provided relevant intellectual input. They all read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hajibolouri, E., Roozshenas, A.A., Miri, R. et al. Permeability modelling in a highly heterogeneous tight carbonate reservoir using comparative evaluating learning-based and fitting-based approaches. Sci Rep 14, 10209 (2024). https://doi.org/10.1038/s41598-024-60995-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-60995-7
Keywords
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.