Introduction to image processing - Radiometry

Introduction¶

For the following exercices, you need Python 3 with some basic librairies (see below). All images necessary for the session are available here.

If you use your own Python 3 install, you should download the images, put them in a convenient directory and update the path in the next cell.

In [1]:

path = './'

In [2]:

import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline

Load and display a color image¶

A color image is made of three channels : red, green and blue. A color image in $\mathbb{R}^{N\times M}$ is stored as a $N\times M\times 3$ matrix.

Be careful with the functions plt.imread() and plt.imshow() of matplotlib.

plt.imread() reads png images as numpy arrays of floating points between 0 and 1, but it reads jpg or bmp images as numpy arrays of 8 bit integers.
In this practical session, we assume images encoded as floating point values between 0 and 1, so if you load a jpg or bmp file you must convert the image to float type and normalize its values.
If 'im' is an image encoded as a double numpy array, plt.imshow(im) will display all values above 1 in white and all values below 0 in black. If the image 'im' is encoded on 8 bits though, plt.imshow(im) will display 0 in black and 255 in white.

In [3]:

imrgb = plt.imread(path+'parrot.png')

In [4]:

# we can first show imrgb itself, i.e. the NumPy array of values :
print(imrgb)

[[[0.40784314 0.3647059  0.25490198]
  [0.4117647  0.36862746 0.25882354]
  [0.41568628 0.37254903 0.25490198]
  ...
  [0.34117648 0.29803923 0.18039216]
  [0.34117648 0.29803923 0.18039216]
  [0.34117648 0.29803923 0.18039216]]

 [[0.4117647  0.36862746 0.25882354]
  [0.4117647  0.36862746 0.25882354]
  [0.4117647  0.36862746 0.2509804 ]
  ...
  [0.34117648 0.29803923 0.18039216]
  [0.34117648 0.29803923 0.18039216]
  [0.34117648 0.29803923 0.18039216]]

 [[0.4117647  0.36862746 0.2509804 ]
  [0.4117647  0.36862746 0.2509804 ]
  [0.41960785 0.3647059  0.2509804 ]
  ...
  [0.34509805 0.3019608  0.19215687]
  [0.34509805 0.3019608  0.19215687]
  [0.34117648 0.30588236 0.19215687]]

 ...

 [[0.4392157  0.41960785 0.3019608 ]
  [0.4392157  0.41960785 0.3019608 ]
  [0.44313726 0.42352942 0.30588236]
  ...
  [0.30588236 0.28235295 0.19607843]
  [0.30588236 0.28235295 0.1882353 ]
  [0.3019608  0.2784314  0.18431373]]

 [[0.4392157  0.41960785 0.3019608 ]
  [0.44313726 0.42352942 0.30588236]
  [0.44705883 0.42745098 0.30980393]
  ...
  [0.3137255  0.28627452 0.18431373]
  [0.30980393 0.28235295 0.18039216]
  [0.30588236 0.28235295 0.18039216]]

 [[0.44313726 0.42352942 0.30588236]
  [0.44313726 0.42352942 0.30588236]
  [0.44705883 0.42745098 0.30980393]
  ...
  [0.3137255  0.28627452 0.18431373]
  [0.31764707 0.2901961  0.1882353 ]
  [0.30980393 0.28235295 0.17254902]]]

Display the image size:

In [5]:

[nrow,ncol,nch] = imrgb.shape
print(nrow,ncol,nch)

495 495 3

You can use plt.imshow() to display the 3D numpy array imrgb as an image:

In [6]:

plt.figure(figsize=(7, 7))
plt.imshow(imrgb)
plt.show()

In [7]:

# Let's do the same with a very simple gray level image with only 4 pixels, that we define by hand :
test = np.array([[0,100],[150,200]])

# First we just print the matrix itself :
print(test) 

# Then we use imshow function to display it as an image
plt.figure(figsize=(7, 7))
plt.imshow(test, cmap='gray', vmin=0, vmax=255)
plt.show()

[[  0 100]
 [150 200]]

It might be useful to convert the color image to gray level. This can be done by averaging the three channels, or by computing another well chosen linear combination of the coordinates R, G and B. First we try with simple averaging $$I_{gs}=(R+G+B)/3$$

In [8]:

imgray = np.mean(imrgb,2)
plt.figure(figsize=(7, 7))
plt.imshow(imgray, cmap='gray', vmin=0, vmax=1)
plt.show()

1. Now use a custom weighted averaging of the three channels, that reflects better human perception: $$I_{gs}=0.21 R + 0.72 G + 0.07 B$$

In [9]:

imgray2 = np.average(imrgb, 2, weights=[0.21,0.72,0.07])
# autre solution :
imgray2 = 0.21*imrgb[:,:,0] + 0.72*imrgb[:,:,1] + 0.07*imrgb[:,:,2]
plt.figure(figsize=(7, 7))
plt.imshow(imgray2, cmap='gray', vmin=0, vmax=1)
plt.show()

In [10]:

# The two images imgray and imgray2 look very similar, let's check that they are indeed different :

# First we check that the difference of gray level values are non zero :
print(imgray-imgray2)

# We can also display the difference as an image:
plt.figure(figsize=(7, 7))
plt.imshow(imgray-imgray2, cmap='gray')
plt.show()

[[-0.02359477 -0.0235948  -0.02566013 ... -0.02566013 -0.02566013
  -0.02566013]
 [-0.0235948  -0.0235948  -0.02566016 ... -0.02566013 -0.02566013
  -0.02566013]
 [-0.02566016 -0.02566016 -0.02317649 ... -0.02359477 -0.02359477
  -0.02559477]
 ...
 [-0.02856213 -0.02856213 -0.0285621  ... -0.01981699 -0.02188236
  -0.02188236]
 [-0.02856213 -0.0285621  -0.0285621  ... -0.02346405 -0.02346405
  -0.02394772]
 [-0.0285621  -0.0285621  -0.0285621  ... -0.02346405 -0.02346405
  -0.02552941]]

Histograms and contrast enhancement¶

Computing histograms¶

In the following, we compute and display the gray level histogram and the cumulative histogram of an image.

The cumulative histogram of a discrete image $u$ is an increasing function defined on $\mathbb{R}$ by $$H_u(\lambda)=\frac{1}{|\Omega|}\#{\{\textbf{x};\;u(\textbf{x})\leq \lambda\}}.$$ The histogram of $u$ is the derivative of $H_u$ in the sense of distributions.

a. We compute the histogram of the image imgray:

In [11]:

imhisto, bins = np.histogram(imgray, range=(0,1), bins = 256)
imhisto       = imhisto/np.sum(imhisto)

b. We now compute the corresponding cumulative histogram thanks to the function np.cumsum()which cumulates the values of a vector from left to right:

In [12]:

imhistocum = np.cumsum(imhisto) 

c. We display the image, histogram and cumulative histogram:

In [13]:

values = (bins[1:]+bins[:-1])/2
fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(15, 5))
axes[0].imshow(imgray, cmap='gray', vmin=0, vmax=1)
axes[0].set_title('parrot image, gray level')
axes[1].bar(values,imhisto,width=1/256)
axes[1].set_title('histogram')
axes[2].bar(values,imhistocum,width=1/256)
axes[2].set_title('cumulative histogram')
fig.tight_layout()
plt.show()

Histogram equalization¶

If $u$ is a discrete image and $h_u$ its gray level distribution, histogram equalization consists in applying a contrast change $g$ (increasing function) to $u$ such that $h_{g(u)}$ is as close as possible to a constant distribution. We can compute directly $$H_u(u)*255.$$

To this aim, we can apply directly the vector imhistocum (which can be seen as a function from $\{0,\dots,255\}$ into $[0,1]$) to the numpy array imgray. Since imgray has values between $0$ and $1$, it is necessary to multiply it by $255$ and cast it as a 8-bit array:

In [14]:

imeq = imhistocum[np.uint8(255*imgray)]

We can now display the resulting equalized image:

In [15]:

plt.figure(figsize=(7, 7))
plt.imshow(imeq, cmap = 'gray', vmin=0, vmax=1)
plt.show()

2. Compute and plot also the corresponding histograms and cumulative histograms of the equalized image.

In [16]:

# to do: question 2
def plot_histos(im, title=''):
    imhisto, bins = np.histogram(im, range=(0,1), bins = 256)
    imhisto       = imhisto/np.sum(imhisto)
    imhistocum = np.cumsum(imhisto) 
    values = (bins[1:]+bins[:-1])/2
    fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(15, 5))
    axes[0].imshow(im, cmap='gray', vmin=0, vmax=1)
    axes[0].set_title(title)
    axes[1].bar(values,imhisto,width=1/256)
    axes[1].set_title('histogram')
    axes[2].bar(values,imhistocum,width=1/256)
    axes[2].set_title('cumulative histogram')
    fig.tight_layout()
    plt.show()

plot_histos(imeq, 'parrot image equalized, gray level')

3. Now, apply the previous histogram equalization to the two images parrot_bright.png and parrot_dark.png, plot the equalized images, the corresponding histograms and cumulative histograms. Comment the results and explain the observed differences.

In [17]:

# to do: question 3
imrgb_bright = plt.imread(path+'parrot_bright.png')
imgray_bright = np.mean(imrgb_bright,2)
plt.figure(figsize=(7, 7))
plt.imshow(imgray_bright, cmap='gray', vmin=0, vmax=1)
plt.show()

In [18]:

plot_histos(imgray_bright, 'parrot_bright image')

In [19]:

imhisto, bins = np.histogram(imgray_bright, range=(0,1), bins = 256)
imhisto       = imhisto/np.sum(imhisto)
imhistocum = np.cumsum(imhisto)
imeq_bright = imhistocum[np.uint8(255*imgray_bright)]
plot_histos(imeq_bright, 'parrot_bright images, equalized')

In [20]:

imrgb_dark = plt.imread(path+'parrot_dark.png')
imgray_dark = np.mean(imrgb_dark,2)
plot_histos(imgray_dark, 'parrot_dark image')

In [21]:

imhisto, bins = np.histogram(imgray_dark, range=(0,1), bins = 256)
imhisto       = imhisto/np.sum(imhisto)
imhistocum = np.cumsum(imhisto)
imeq_dark = imhistocum[np.uint8(255*imgray_dark)]
plot_histos(imeq_dark, 'parrot_dark images, equalized')

Histogram specification¶

If $u$ is a discrete image and $h_u$ its gray level distribution, histogram specification consists in applying a contrast change $g$ (an increasing function) to $u$ such that $h_{g(u)}$ is as close as possible to a target discrete probability distribution $h_t$. Specification is particularly useful to compare two images of the same scene (in this case the target distribution is the histogram of the second image $v$).

We start by reading our two images $u$ and $v$:

In [22]:

buenos1=plt.imread(path+'buenosaires4.png')
buenos2=plt.imread(path+'buenosaires3.png')
u = buenos1[:,:,0]
v = buenos2[:,:,0]
[nrowu,ncolu]=u.shape
[nrowv,ncolv]=v.shape

Now, histogram specification between two grey level images $u$ and $v$ can be computed easily by sorting the pixels of both images and by replacing each gray level in $u$ by the gray level of similar rank in $v$:

In [23]:

index_u = np.argsort(u,axis=None)  # calcule l'image des rangs des pixels de u
u_sort = u.flatten()[index_u] #np.sort(u,axis=None)
index_v = np.argsort(v,axis=None)
v_sort = v.flatten()[index_v] #np.sort(v,axis=None)
uspecifv = np.zeros(nrowu*ncolu)
uspecifv[index_u] = v_sort
uspecifv = uspecifv.reshape(nrowu,ncolu)

We can now display the result.

In [24]:

fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(15, 5))
axes[0].set_title('image u')
axes[0].imshow(u,'gray', vmin=0, vmax=1)
axes[1].set_title('image v')
axes[1].imshow(v,'gray', vmin=0, vmax=1)
axes[2].set_title('image specification')
axes[2].imshow(uspecifv,'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

4. Try to translate the grey levels of $u$ such that it has the same mean grey level than $v$ and display the result. Is it similar to the specification of $u$ on $v$ ?

In [25]:

# to do: question 4
u_translation = u + (np.mean(v)-np.mean(u))
fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(15, 5))
axes[0].set_title('image u')
axes[0].imshow(u,'gray', vmin=0, vmax=1)
axes[1].set_title('image v')
axes[1].imshow(v,'gray', vmin=0, vmax=1)
axes[2].set_title('image translatee')
axes[2].imshow(u_translation,'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [26]:

# let's compare the two images uspecifv and u_translation :
fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(15, 5))
axes[0].set_title('image u "spécifiée" sur v')
axes[0].imshow(uspecifv, 'gray', vmin=0, vmax=1)
axes[1].set_title('image u "translatée" sur v')
axes[1].imshow(u_translation, 'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

We can remark that the translated image on the right has less contrast than the specified image on the left. The specified image is visually of better quality and looks also more similar to the image v.

5. Same question by applying an affine transform to $u$ so that its mean and variance match the ones of $v$.

In [28]:

# to do: question 5
u_affine = (u-np.mean(u))*(np.std(v)/np.std(u)) + np.mean(v)
fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(15, 5))
axes[0].set_title('image u')
axes[0].imshow(u,'gray', vmin=0, vmax=1)
axes[1].set_title('image v')
axes[1].imshow(v,'gray', vmin=0, vmax=1)
axes[2].set_title('image transfo affine')
axes[2].imshow(u_affine,'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [30]:

# let's compare the two images uspecifv and u_affine :
fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(15, 5))
axes[0].set_title('image u "spécifiée" sur v')
axes[0].imshow(uspecifv, 'gray', vmin=0, vmax=1)
axes[1].set_title('image u "transfo affine" sur v')
axes[1].imshow(u_affine, 'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [32]:

plot_histos(u, 'u')
plot_histos(u_translation, 'u_translation')
plot_histos(u_affine, 'u_affine')

Midway histogram¶

The Midway histogram between two histograms $h_u$ et $h_v$ is defined from it cumulative function $H_{midway}$ : $$H_{midway}=\left( \frac{H_u^{-1}+H_v^{-1}}{2}\right)^{-1}.$$ The goal is to modify the contrast of both images $u$ and $v$ in order to give them the same intermediary grey level distribution.

In [34]:

u_midway=np.zeros(len(index_u))
v_midway=np.zeros(len(index_v))

u_midway[index_u] = (u_sort + v_sort)/2
v_midway[index_v] = (u_sort + v_sort)/2
u_midway = u_midway.reshape(nrowu,ncolu)
v_midway = v_midway.reshape(nrowv,ncolv)


#Display the results
fig, axes = plt.subplots(nrows=2, ncols=2, figsize=(10, 8))
axes[0,0].set_title('image u')
axes[0,0].imshow(u,'gray', vmin=0, vmax=1)
axes[0,1].set_title('image v')
axes[0,1].imshow(v,'gray', vmin=0, vmax=1)
axes[1,0].set_title('image u_midway')
axes[1,0].imshow(u_midway,'gray', vmin=0, vmax=1)
axes[1,1].set_title('image v_midway')
axes[1,1].imshow(v_midway,'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

Simple transformations¶

In this exercice, you are asked to perform simple transformations on an image and find out what happens on the corresponding histogram : thresholding, affine transformation, gamma correction.

Effect of Noise on histograms¶

In the following, we want to create different noisy versions of an image $u$ and observe how the histogram $h_u$ is transformed.

Gaussian noise

6 a) Write a function adding a gaussian noise $b$ to the image $u$. An image of gaussian noise of mean $0$ and of standard deviation $\sigma$ is obtained with the command

    sigma*np.random.randn(nrow,ncol)

6 b) Display the noisy image and its histogram for different values of $\sigma$.

6 c) What do you observe ? What is the relation between the histogram of $u$ and the one of $u+b$ ?

In [37]:

imgray = np.mean(plt.imread(path+'parrot.png'),2)

def add_gaussian_noise(u, sigma):
    nrow, ncol = u.shape
    res = u + sigma*np.random.randn(nrow,ncol)
    return res

imgray_noise = add_gaussian_noise(imgray, 0.2)

fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(15, 5))
axes[0].set_title('imgray')
axes[0].imshow(imgray, 'gray', vmin=0, vmax=1)
axes[1].set_title('imgray_noise')
axes[1].imshow(imgray_noise, 'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [39]:

bruit = np.random.randn(100,100)
plt.imshow(bruit, cmap='gray')

Out[39]:

<matplotlib.image.AxesImage at 0x7fc2a9354748>

In [40]:

for sigma in [0, 0.1, 0.25, 0.5, 1]:
    plot_histos(add_gaussian_noise(imgray, sigma))

Uniform noise 7. Same questions with $b$ a uniform additive noise.

In [43]:

imgray = np.mean(plt.imread(path+'parrot.png'),2)

def add_uniform_noise(u, alpha):
    nrow, ncol = u.shape
    # solution avec np.random.uniform
    res = u + np.random.uniform(size=[nrow,ncol], low=-alpha, high=alpha)
    # solution avec np.random.rand
    #res = u + np.random.rand(nrow,ncol)*2*alpha-alpha
    return res

imgray_noise = add_uniform_noise(imgray, 0.2)

fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(15, 5))
axes[0].set_title('imgray')
axes[0].imshow(imgray, 'gray', vmin=0, vmax=1)
axes[1].set_title('imgray_noise')
axes[1].imshow(imgray_noise, 'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [44]:

for sigma in [0, 0.1, 0.25, 0.5, 1]:
    plot_histos(add_uniform_noise(imgray, sigma))

Impulse noise Let us recall that impulse noise destroy randomly a proportion $p$ of the pixels in $u$ and replace their values by uniform random values between $0$ and $255$. Mathematically, this can be modeled as $u_b= (1-X)u+XY$, where $X$ follows a Bernouilli law of parameter $p$ and $Y$ follows a uniform law on $\{0,\dots 255\}$.

8 a) Write a function adding impulse noise of parameter p to an image u. /Hint/ : you can start by using the function ~rand~ to create a table tab of random numbers following the uniform law on $[0,1[$

           tab = np.random.rand(u.shape[0],u.shape[1])

and then replace randomly $p\%$ of the pixels of $u$ by a random grey level

           ub = 255*np.random.rand(u.shape[0],u.shape[1])*(tab<p/100)+(tab>=p/100)*u;

8 b) Display the noisy image and its histogram for different values of p. What is the relation between the histogram of u and the one of u_b ?

In [48]:

# to do: question 8
def add_impulse_noise(u, p):
    nrow, ncol = u.shape
    tab = np.random.rand(nrow,ncol)
    y = np.random.rand(nrow,ncol)
    res = y*(tab<p/100) + u*(tab>=p/100)
    return res

imgray_noise = add_impulse_noise(imgray, 50)

fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(15, 5))
axes[0].set_title('imgray')
axes[0].imshow(imgray, 'gray', vmin=0, vmax=1)
axes[1].set_title('imgray_noise')
axes[1].imshow(imgray_noise, 'gray', vmin=0, vmax=1)
fig.tight_layout()
plt.show()

In [53]:

for p in [0, 5, 10, 20, 50, 75, 99]:
    plot_histos(add_impulse_noise(imgray, p))

Image Quantization¶

Quantization¶

Image quantization consists in reducing the set of grey levels $Y = \{ y_0,\dots y_{n-1} \}$ or colors of an image $u$ into a smaller set of quantized values $\{q_0,\dots q_{p-1}\}$ ($p < n$). This operation is useful for displaying an image $u$ on a screen that supports a smaller number of colors (this is needed with a standard screen if $u$ is coded on more than 8 bits by channel).

A quantization operator $Q$ is defined by the values $(q_i)_{i=0, \dots p-1}$ and $(t_j)_{j=0,\dots p}$ such that $$ t_0 \leq q_0 \leq t_1 \leq q_1 \leq \dots q_{p-1} \leq t_p,\text{ and } Q(\lambda)=q_i \text{ if } t_i \leq \lambda < t_{i+1}.$$

Uniform Quantization Uniform quantization consists in dividing the set $Y$ in $p$ regular intervals.

Use uniform quantization on a gray level image (try different numbers $K$ of grey levels) and display the result. For which value of $K$ do you start to see a difference with the original image ?

In [57]:

u = np.mean(plt.imread(path+'simpson512.png'),2)
# to do: question 9
plt.imshow(u, cmap='gray', vmin=0, vmax=1)
plt.show()
u_quantif = (np.floor(u*10)+0.5)/10
plt.imshow(u_quantif, cmap='gray', vmin=0, vmax=1)
plt.show()

In [58]:

u_quantif = (np.floor(u*2)+0.5)/2
plt.imshow(u_quantif, cmap='gray', vmin=0, vmax=1)
plt.show()

Histogram-based Quantization This consists in choosing $t_i=\min \{\lambda; H_u(\lambda) \geq \frac{i}{p} \}$, and the $q_i$ are defined as the barycenters of the intervals $[t_i,t_{i+1}].$

10 a) Show that this boils down to an histogram equalization followed by a uniform quantization

10 b) Apply this quantization on a gray level image and display the result. Same question on the limit value $K$ for which we perceive a difference with the original image.

In [ ]:

# to do: question 10

Lloyd-Max quantization This quantization consists in minimizing the least square error $$LSE((q_i)_{i=1\dots p-1},(t_i)_{i=1\dots p})= \sum_{i=0}^{p-1} \int_{t_i}^{t_{i+1}} h(\lambda) (\lambda -q_i)^2.$$ It is equivalent to the algorithm Kmeans in one dimension.

11 a) Write the optimality conditions that should be satisfied by the solution $\{(\widehat{q_i}),(\widehat{t_i})\}$.

11 b) Write a function which minimizes the least square error by alternatively minimizing in $(q_i)_{i=0, \dots p-1}$ and $(t_j)_{j=0,\dots p}$.

11 c) Apply this quantization on the previous gray level image for different values of $K$ and display the result. Comment.

In [ ]:

imgray  = np.mean(plt.imread(path+'simpson512.png'),2)
# to do : question 11

Dithering¶

Dithering consists in adding intentionnally noise to an image before quantization. For instance, it can be used to convert a grey level image to black and white in such a way that the density of white dots in the new image is an increasing function of the grey level in the original image. This is particularly useful for impression or displaying.

Let us explain how dithering works in the case of 2 grey levels (binarization). All grey levels smaller than a value $\lambda$ are replaced by $0$ and those greater than $\lambda$ are replaced by $255$. If we add a i.i.d. noise $B$ of density $p_B$ to $u$ before the binarization, then at the pixel $x$ we get $$P[u(x) + B(x) > \lambda] = P[B(x) > \lambda - u(x) ] = \int_{\lambda - u(x)}^{+\infty} p_B(s)ds,$$ which is an increasing function of the value $u(x)$. The probability that $x$ turns white in the dithered image is thus an increasing function of its original grey level.

12 a) Perform dithering in order to quantize a gray level image on 10 levels (you can add a small Gaussian noise of std 5/255 for instance). Compare the result with the previous quantizations without dithering.

12 b) Try with different levels of noise.

In [ ]:

# to do : question 12

In [ ]: