This example will show how to convert the geological map below using GemGIS
to a GemPy
model. This example is based on digitized data. The area is 3954 m wide (W-E extent) and 2738 m high (N-S extent). The model represents folded layers that dip to the northeast or southwest, respectively.
formation
column) and the topographic lines (including a Z
column)GemPy
will be calculated from the strike lines.If you have not installed GemPy
yet, please follow the GemPy installation instructions and the GemGIS installation instructions. If you encounter any issues, feel free to open a new discussion at GemPy Discussions or GemGIS Discussions. If you encounter an error in the installation process, feel free to also open an issue at GemPy Issues or GemGIS Issues. There, the GemPy
and GemGIS
development teams will help you out.
For this notebook, we need the geopandas
library for the data preparation, rasterio
for dealing with the created digital elevation model, matplotlib
for plotting, numpy
for some numerical calculations, pandas
for manipulating DataFrames
and of course the gempy
and gemgis
libraries. Any warnings that may appear can be ignored for now. The file path is set to load the data provided for this tutorial.
import geopandas as gpd
import rasterio
import warnings
warnings.filterwarnings("ignore")
import gemgis as gg
import matplotlib.pyplot as plt
import numpy as np
import gempy as gp
import pyvista as pv
import pandas as pd
file_path = '../../data/example28_folded_layers/'
At his point, you should have the topographic contour lines (including a Z
column) and the layer boundaries (including a formation
column) digitized. If not, please generate the data before continuing with this tutorial.
The digital elevation model (DEM) will be created by interpolating the contour lines digitized from the georeferenced map using the SciPy
Radial Basis Function interpolation wrapped in GemGIS
. The respective function used for that is gg.vector.interpolate_raster()
.
There is also a tutorial available for this task on the GemGIS Documentation page.
Source: Powell, D. (1995): Interpretation geologischer Strukturen durch Karten - Eine praktische Anleitung mit Aufgaben und Lösungen, page 31, figure 26 A, Springer Verlag Berlin, Heidelberg, New York, ISBN: 978-3-540-58607-4.First, the contour lines are loaded using GeoPandas
. Please provide here the name of your shape file containing the digitized topographic contour lines.
topo = gpd.read_file(file_path + 'topo28.shp')
topo.head()
The contour lines are plotted using the built-in plotting function of GeoPandas
.
topo.plot(column='Z', aspect=1, legend=True, cmap='gist_earth')
The digital elevation model (DEM) will be created by interpolating the contour lines digitized from the georeferenced map using the SciPy
Radial Basis Function interpolation wrapped in GemGIS
. The respective function used for that is gg.vector.interpolate_raster()
.
topo_raster = gg.vector.interpolate_raster(gdf=topo, value='Z', method='rbf', res=5)
The interpolated digital elevation model can be displayed using matplotlib
and its plt.imshow()
function and by providing the extent of the raster to align it with the contour lines.
import matplotlib.pyplot as plt
from mpl_toolkits.axes_grid1 import make_axes_locatable
fix, ax = plt.subplots(1, figsize=(10,10))
topo.plot(ax=ax, aspect='equal', column='Z', cmap='gist_earth')
im = ax.imshow(topo_raster, origin='lower', extent=[0,865,0,867], cmap='gist_earth')
divider = make_axes_locatable(ax)
cax = divider.append_axes("right", size="5%", pad=0.05)
cbar = plt.colorbar(im, cax=cax)
cbar.set_label('Altitude [m]')
ax.set_xlabel('X [m]')
ax.set_ylabel('Y [m]')
After the interpolation of the contour lines, the raster is saved to disc using gg.raster.save_as_tiff()
. The function will not be executed as as raster is already provided with the example data.
gg.raster.save_as_tiff(raster=topo_raster, path=file_path + 'raster28.tif', extent=[0,865,0,867], crs='EPSG:4326', overwrite_file=True)
--------------------------------------------------------------------------- NameError Traceback (most recent call last) Cell In[1], line 1 ----> 1 gg.raster.save_as_tiff(raster=topo_raster, path=file_path + 'raster28.tif', extent=[0,865,0,867], crs='EPSG:4326', overwrite_file=True) NameError: name 'gg' is not defined
topo_raster = rasterio.open(file_path + 'topo28.tif')
The interface points will be extracted from LineStrings digitized from the georeferenced map using QGIS. It is important to provide a formation
name for each layer boundary. Up until now, only the X
and Y
position are stored in the vertices of the LineStrings. Using the digital elevation model created already, we will now sample the elevation model at the locations of the vertices to extract the height at this point as the stratigraphic boundary was mapped at the surface.
interfaces = gpd.read_file(file_path + 'interfaces28.shp')
interfaces.head()
fig, ax = plt.subplots(1, figsize=(5,5))
interfaces.plot(ax=ax, column='formation', legend=True, aspect='equal')
plt.grid()
ax.set_xlabel('X [m]')
ax.set_ylabel('Y [m]')
The vertical position of the interface points will not be extracted from the digital elevation model but using the GemGIS
function gg.vector.extract_xyz()
to extract it from the available Z
column. The resulting GeoDataFrame now contains single points including the information about the respective formation
as well as the X
, Y
, and Z
location. This is all we need as preparational steps to generate input data for GemPy
.
There is also a tutorial available for this task on the GemGIS Documentation page.
interfaces_coords = gg.vector.extract_xy(gdf=interfaces)
interfaces_coords = interfaces_coords.sort_values(by='formation', ascending=False)
interfaces_coords.head()
The interface points incuding their altitude (Z-) values and the digitized LineString can be plotted using matplotlib
.
fig, ax = plt.subplots(1, figsize=(5,5))
interfaces.plot(ax=ax, column='formation', legend=True, aspect='equal')
interfaces_coords.plot(ax=ax, column='formation', legend=True, aspect='equal')
plt.grid()
plt.xlabel('X [m]')
plt.ylabel('Y [m]')
For this example, orientations must be calculated yourself. They will be calculated using functions implemented in GemGIS and the previously digitized strike lines.
Source: Powell, D. (1995): Interpretation geologischer Strukturen durch Karten - Eine praktische Anleitung mit Aufgaben und Lösungen, page 31, figure 26 A, Springer Verlag Berlin, Heidelberg, New York, ISBN: 978-3-540-58607-4.Strike lines connect outcropping stratigraphic boundaries (interfaces) of the same altitude. In other words: the intersections between topographic contours and stratigraphic boundaries at the surface. The height difference and the horizontal difference between two digitized lines is used to calculate the dip and azimuth and hence an orientation that is necessary for GemPy
.
The calculation of orientations from strike lines has been implemented into GemPy
for simple cases like these. In order to calculate the orientations, each set of strikes lines/LineStrings for one formation must be given an id number next to the altitude of the strike line. The id field is already predefined in QGIS. The strike line with the lowest altitude gets the id number 1
, the strike line with the highest altitude the the number according to the number of digitized strike lines. It is currently recommended to use one set of strike lines for each structural element of one formation as illustrated.
By CrunchyRocks, after Karla Panchuck - https://openpress.usask.ca/physicalgeology/chapter/13-5-measuring-geological-structures/, CC BY 4.0, https://commons.wikimedia.org/w/index.php?curid=113554289
Source: Powell, D. (1995): Interpretation geologischer Strukturen durch Karten - Eine praktische Anleitung mit Aufgaben und Lösungen, page 14, figure 8, Springer Verlag Berlin, Heidelberg, New York, ISBN: 978-3-540-58607-4.strikes = gpd.read_file(file_path + 'strikes28.shp')
strikes
fig, ax = plt.subplots(1, figsize=(5,5))
strikes.plot(ax=ax,column='id', aspect=1)
interfaces.plot(ax=ax, column='formation', legend=True, aspect='equal')
ax.set_xlabel('X [m]')
ax.set_ylabel('Y [m]')
ax.grid()
The calculations will be calculated using the function gg.vector.calculate_orientations_from_strike_lines()
where the strike lines for each single formation will be provided and calculated separately. The result is a GeoDataFrame
ready to be used in GemPy
.
orientations_layer1a = gg.vector.calculate_orientations_from_strike_lines(gdf=strikes[strikes['formation']=='Layer1a'].sort_values(by='Z', ascending=True).reset_index())
orientations_layer1a
orientations_layer1b = gg.vector.calculate_orientations_from_strike_lines(gdf=strikes[strikes['formation']=='Layer1b'].sort_values(by='Z', ascending=True).reset_index())
orientations_layer1b
Since GemPy
only takes one DataFrame
for the necessary orientations, the single DataFrames
are concatenated using pd.concat()
.
import pandas as pd
orientations = pd.concat([orientations_layer1a, orientations_layer1b]).reset_index()
orientations['formation'] = 'Layer1'
orientations
fig, ax = plt.subplots(1, figsize=(5,5))
interfaces.plot(ax=ax, column='formation', legend=True, aspect='equal')
interfaces_coords.plot(ax=ax, column='formation', legend=True, aspect='equal')
orientations.plot(ax=ax, color='red', aspect='equal')
plt.grid()
plt.xlabel('X [m]')
plt.ylabel('Y [m]')
The provided geological map presents the the isolines or strike lines of the layers. Using the vertical and horizontal distance of these lines, we can calculate orientations from these lines. In order to that, we draw perpendicular lines between the isolines that will be converted into measurements of orientations. The horizontal distance will calculated from the length of each LineString, the vertical difference needs to be provided as separate column dZ
.
orientations_lines = gpd.read_file(file_path + 'orientation_lines.shp')
orientations_lines.head()
fig, ax = plt.subplots(1, figsize=(5,5))
interfaces.plot(ax=ax, column='formation', legend=True, aspect='equal')
orientations_lines.plot(ax=ax, aspect='equal', color='red')
plt.grid()
plt.xlabel('X [m]')
plt.ylabel('Y [m]')
gdf = gg.vector.extract_orientations_from_map(gdf=orientations_lines)
gdf
The creation of a GemPy
Model follows particular steps which will be performed in the following:
gp.create_model()
gp.init_data()
gp.map_stack_to_surfaces()
gp.set_interpolator()
gp.compute_model()
The first step is to create a new empty GemPy
model by providing a name for it.
geo_model = gp.create_model('Model28')
geo_model
During this step, the extent
of the model (xmin
, xmax
, ymin
, ymax
, zmin
, zmax
) and the resolution
in X
, Y
and Z
direction (res_x
, res_y
, res_z
, equal to the number of cells in each direction) will be set using lists of values.
The interface points (surface_points_df
) and orientations (orientations_df
) will be passed as pandas
DataFrames
.
gp.init_data(geo_model, [0,865,0,867,0,500], [50,50,50],
surface_points_df = interfaces_coords[interfaces_coords['Z']!=0].sample(n=75, random_state=1),
orientations_df = orientations,
default_values=True)
The model consists of one layers or surface which belongs to the Default series
. During the next step, the proper Series
will be assigned to the surface. Using the surfaces
-attribute again, we can check the loaded layer.
geo_model.surfaces
The loaded interface points and orientations can again be inspected using the surface_points
- and orientations
-attributes. Using the df
-attribute of this object will convert the displayed table in a pandas
DataFrame
.
geo_model.surface_points.df.head()
geo_model.orientations.df.head()
During this step, the one layer of the model are assigned to the Strata1
series. We know that the layers modeled here are parallel. If the layers were not parallel as shown in the next models, multiple series would be defined. We will also add a Basement
here (geo_model.add_surfaces('Basement')
). The order within one series also defines the age relations within this series and has to be according to the depositional events of the layers.
gp.map_stack_to_surfaces(geo_model,
{
'Strata1': ('Layer1'),
},
remove_unused_series=True)
geo_model.add_surfaces('Basement')
You can also return the number of interfaces and orientations for each formation using gg.utils.show_number_of_data_points()
gg.utils.show_number_of_data_points(geo_model=geo_model)
GemPy
is capable of including a topography into the modeling process. Here, we use the topography that we have interpolated in one of the previous steps. GemPy
takes the file path of the raster/digital elevation model and loads it as grid into the geo_model
object.
geo_model.set_topography(
source='gdal', filepath=file_path + 'topo28.tif')
It is possible to define straight (no bending) custom sections in GemPy
that will be calculated additionally with a provided resolution and that can be displayed separately using matplotlib
.
For this example, the bold black line indicates the location of the custom section. This custom section can be digitized with a LineString (line shape file) with vertices at both ends and loaded using GeoPandas
.
custom_section = gpd.read_file(file_path + 'customsections28.shp')
custom_section_dict = gg.utils.to_section_dict(custom_section, section_column='section')
geo_model.set_section_grid(custom_section_dict)
gp.plot.plot_section_traces(geo_model)
The input data can now be visualized in 2D using matplotlib
. This might for example be useful to check if all points and measurements are defined the way we want them to. Using the function plot_2d()
, we attain a 2D projection of our data points onto a plane of chosen direction (we can choose this attribute to be either 'x'
, 'y'
, or 'z'
).
gp.plot_2d(geo_model, direction='z', show_lith=False, show_boundaries=False)
plt.grid()
The input data can also be viszualized using the pyvista
package. In this view, the interface points are visible as well as the orientations (marked as arrows) which indicate the normals of each orientation value.
gp.plot_3d(geo_model, image=False, plotter_type='basic', notebook=True)
Once we have made sure that we have defined all our primary information, we can continue with the next step towards creating our geological model: preparing the input data for interpolation.
Setting the interpolator is necessary before computing the actual model. Here, the most important kriging parameters can be defined.
gp.set_interpolator(geo_model,
compile_theano=True,
theano_optimizer='fast_compile',
verbose=[],
update_kriging = False
)
At this point, we have all we need to compute our full model via gp.compute_model()
. By default, this will return two separate solutions in the form of arrays. The first provides information on the lithological formations, the second on the fault network in the model, which is not present in this example.
sol = gp.compute_model(geo_model, compute_mesh=True)
sol
geo_model.solutions
Cross sections in different direction
s and at different cell_number
s can be displayed. Here, we see the layers of the model in the different directions.
The first section to be plotted is the custom section Section1
followed by an array of cross sections.
gp.plot_2d(geo_model, section_names=['Section1'], show_topography=True, show_data=False)
gp.plot_2d(geo_model, direction=['x', 'x', 'y', 'y'], cell_number=[25,40,25,40], show_topography=True, show_data=False)
Next to the lithology data, we can also plot the calculated scalar field.
gp.plot_2d(geo_model, direction='y', show_data=False, show_scalar=True, show_lith=False)
The computed model can be visualized in 3D using the pyvista
library. Setting notebook=False
will open an interactive windows and the model can be rotated and zooming is possible.
gpv = gp.plot_3d(geo_model, image=False, show_topography=False,
plotter_type='basic', notebook=True, show_lith=True)
Take me to the next notebook on Github
Take me to the next notebook locally
Source: Bennison, G.M. (1988): An Introduction to Geological Structures and Maps, page 30, figure 12, Springer Verlag Berlin, Heidelberg, New York, ISBN: 978-1-4615-9632-5Institute for Computational Geoscience, Geothermics and Reservoir Geophysics, RWTH Aachen University & Fraunhofer IEG, Fraunhofer Research Institution for Energy Infrastructures and Geothermal Systems IEG, Authors: Alexander Juestel. For more information contact: alexander.juestel(at)ieg.fraunhofer.de
All notebooks are licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0, http://creativecommons.org/licenses/by/4.0/). References for each displayed map are provided. Most of the maps originate from the books of Powell (1992) and Bennison (1990). References for maps with unknown origin will gladly be added.