Estimating total organic carbon of potential source rocks in the Espírito Santo Basin, SE Brazil, using XGBoost

Marine and Petroleum Geology(2024)

引用 0|浏览4
暂无评分
摘要
Identifying and constraining source rocks is critical for petroleum system modeling and risk assessment. Traditional methods for total organic carbon (TOC) estimation based on well logs have limitations, leading to the emergence of machine learning techniques like XGBoost. This study compiled a comprehensive data set of well log and geochemical data from the Espírito Santo Basin, SE Brazil, and XGBoost, which was integrated with pySpark, was used for running multiple machine learning models to predict TOC. Parameter tuning was performed by randomly combining model configurations over multiple replication data frames. XGBoost effectively predicted TOC, yielding a coefficient of determination R2 of 0.71, RMSE of 0.55 and MAE of 0.30, based on the average of all 10-fold cross-validation test sets. Heteroscedasticity was observed, possibly related to the presence of outliers in the target TOC variable, which may be linked to variable organic-matter deposition and preservation processes through the geological time, such as during Oceanic Anoxic Events (OAEs). The results indicate the potential of machine learning for TOC prediction in large, heterogeneous data sets, outperforming the traditional ΔlogR method and offering a promising tool for the usage of available public data sets in similar applications, such as the oil and gas (O&G) industry's exploration phase or field reassessment.
更多
查看译文
关键词
Geological database,Machine learning,XGBoost regression,Total organic carbon (TOC),Parameter tuning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要