#!/usr/bin/env python # coding: utf-8 # [En français](../tutorial_gdal_fr/) # # ![ECCC logo](https://eccc-msc.github.io/open-data/img_eccc-logo.png) # # [TOC](https://eccc-msc.github.io/open-data/readme_en/) > [Usage overview](https://eccc-msc.github.io/open-data/usage/readme_en/) > GDAL command line tutorial # # # GDAL command line tutorial with weather data # # ## Introduction # # [MSC GeoMet](https://eccc-msc.github.io/open-data/msc-geomet/readme_en/) and [MSC Datamart](https://eccc-msc.github.io/open-data/msc-datamart/readme_en/) data can be manipulated via the command line using [GDAL](https://gdal.org/), a widely-known software library used to read and write raster and vector geospatial data. In the following examples, you'll use a GeoTIFF file retrieved using via a Web Coverage Service (WCS) request to MSC GeoMet. This tutorial will show you how to: # * Display the GDAL version installed on your system # * Save a WCS request output to your computer # * List information/metadata related to the raster file # * Reproject a raster file # * Convert a GeoTIFF file to the NetCDF file format # * Get the value for a specific point based on a location in longitude/latitude # # There are various ways to install GDAL, please refer to https://gdal.org/ for more information. # # To run the following GDAL command line examples, you need to have a basic knowledge of using the terminal command line. These examples work within a bash terminal. # # The [interactive version of this Jupyter Notebook is available](https://mybinder.org/v2/gh/ECCC-MSC/open-data/master?filepath=docs%2Fusage%2Ftutorial_gdal%2ftutorial_gdal_en.ipynb). # # [![badge](https://img.shields.io/badge/Interactive%20version-binder-F5A252.svg?logo=)](https://mybinder.org/v2/gh/ECCC-MSC/open-data/master?filepath=docs%2Fusage%2Ftutorial_gdal%2ftutorial_gdal_en.ipynb) # # ## Show GDAL version # # GDAL is a suite of several command line tools. When you install GDAL you get all the different command line tools. The most basic tool is `gdalinfo` which can be used to retrieve information pertaining to your GDAL installation and display information about a raster file. # In[1]: get_ipython().run_cell_magic('bash', '', '\ngdalinfo --version\n') # ## Save a WCS request output to disk # # The OGC Web Coverage Service requests enable a client to retrieve coverage information from a raster file for a given area of interest. WCS requests are made over the internet (HTTPS) and give the user more flexibility when requesting information about the coverage of a layer compared with the more traditional way of downloading flat files. The Web Coverage Service allows for several different types of requests, each of which are described in further detail below. For more information on the WCS request parameters, please refer to the [MSC GeoMet WCS GetCoverage page](https://eccc-msc.github.io/open-data/msc-geomet/web-services_en/#wcs-getcoverage). # # We are going to use a `curl` command to save the WCS request result on disk, the file will be named `CMC_glb_TMP.tif`. The result is a GeoTIFF file, showing temperature (°C) from a subset of MSC's Global Deterministic Prediction System (GDPS). # In[4]: get_ipython().run_cell_magic('bash', '', '\ncurl "https://geo.weather.gc.ca/geomet?SERVICE=WCS&VERSION=2.0.1&REQUEST=GetCoverage&COVERAGEID=GDPS.ETA_TT&SUBSETTINGCRS=EPSG:4326&SUBSET=x(-120,-85)&SUBSET=y(48,66)&RESOLUTION=x(0.24)&RESOLUTION=y(0.24)&FORMAT=image/tiff" > CMC_glb_TMP.tif \n') # ## Lists information about a raster file # # The `gdalinfo` tool can be used to retrieve the downloaded raster file's metadata. The command's output will list some information on the file, such as: # * File format # * File size # * Coordinate system # * Metadata # * Band information # In[11]: get_ipython().run_cell_magic('bash', '', '\ngdalinfo CMC_glb_TMP.tif\n') # It is also possible to use `gdalinfo` to retrieve some basic statistics on the raster file, such as minimum and maximum value by adding the `-mm` option. Note that the resulting values are in °C. # In[21]: get_ipython().run_cell_magic('bash', '', '\ngdalinfo -mm CMC_glb_TMP.tif | grep Min/Max\n') # Adding the `-proj4` option to `gdalinfo` will output the projection definition as a proj4 string: # In[24]: get_ipython().run_cell_magic('bash', '', '\ngdalinfo -proj4 CMC_glb_TMP.tif | grep PROJ.4 -A 1\n') # ## Reproject a raster file # # Using the `gdalwarp` command, we can reproject a raster file. Using MSC GeoMet and MSC Datamart data, you only need to provide an output projection definition corresponding to a EPSG code, or you can use a proj4 string. # # The following example reprojects the GeoTIFF file into an EPSG:3857 projection. The output file is named `CMC_glb_TMP_epsg3857.tif` # In[25]: get_ipython().run_cell_magic('bash', '', '\ngdalwarp -t_srs EPSG:3857 CMC_glb_TMP.tif CMC_glb_TMP_epsg3857.tif\n') # Then we can use `gdalinfo` to look at the coordinates and the proj4 string to ensure that the projection of `CMC_glb_TMP_epsg3857.tif` really is different from the original file. # In[26]: get_ipython().run_cell_magic('bash', '', "\ngdalinfo -proj4 epsg3857.tif | grep -E '(PROJ.4|Corner Coordinates:)' -A 5\n") # ## Convert a GeoTIFF file to the NetCDF file format # # Using `gdal_translate` command, we can convert a raster file from any supported format (`gdalinfo --formats`) into another raster file format. # # In this example, we convert our GeoTIFF file to a NetCDF file. The `-of NetCDF` parameter tells gdal_translate in which format to do the projection. The output file will be named `CMC_glb_TMP.nc` # In[27]: get_ipython().run_cell_magic('bash', '', '\ngdal_translate -of NetCDF CMC_glb_TMP.tif CMC_glb_TMP.nc\n') # Then using `gdalinfo` we can make sure the output NetCDF file is a valid raster file. # In[29]: get_ipython().run_cell_magic('bash', '', '\ngdalinfo CMC_glb_TMP.nc\n') # ## Get the value for a specific point based on a location in longitude/latitude # # Using `gdallocationinfo` command we can get the raw value of a pixel by specifying a location in either longitude/latitude or by specifying a pixel position. # # In the following example, we use longitude/latitude. The resulting value is in °C. # In[32]: get_ipython().run_cell_magic('bash', '', '\ngdallocationinfo -wgs84 CMC_glb_TMP.tif -100 50\n')