| Basics | Data >¶

Basics¶

In this first notebook, we will introduce Tensors, which are the base element in PyTorch.
We will see different ways to create Tensors and check their properties. Then, we will briefly go through all the different kind of operations they support, such as mathematical operations, indexing, reshaping, expansion, masking, type conversion, etc.

Table of Contents¶

1. Introduction ¶

2. Tensor Creation ¶

3. Tensor Properties ¶

4. Tensor Operations ¶

5. Tensor Conversions ¶

Introduction¶

What is PyTorch ?¶

Python-based scientific computing library, similar to NumPy.
Differentiates from NumPy in 3 main aspects:

It allows to use the power of GPU computing
It comes with an automatic differentiation module
It is a fully-fledged deep learning research platform

In [ ]:

import torch
import numpy as np

In [ ]:

print("PyTorch Version:", torch.__version__)

What is a tensor?¶

A matrix is a grid of numbers, let's say (3x5).
In simple terms, a tensor can be seen as a generalization of a matrix to higher dimension.
It can be of arbitrary shape, e.g. (3 x 6 x 2 x 10).

You can think of tensors as multidimensional arrays.

In [ ]:

X = torch.tensor([1, 2, 3, 4, 5])
X

In [ ]:

X.shape

In [ ]:

X = torch.tensor([[1, 2, 3], [4, 5, 6]])
X

In [ ]:

X.shape

PyTorch vs NumPy¶

torch.tensor behaves like numpy.array under mathematical operations.
The syntax is very similar between the two libraries.
If you are familiar with NumPy, you can browse here to check what are NumPy functions equivalent in PyTorch.

For example:

In [ ]:

np.eye(2)

In [ ]:

torch.eye(2)

In [ ]:

np.arange(1,5)

In [ ]:

torch.arange(1,5)

As we said, torch.tensor additionally keeps track of the computation graphs (see next notebook) and provides GPU support.

Tensor Creation¶

In [ ]:

torch.zeros(5)

In [ ]:

torch.ones(5)

In [ ]:

torch.eye(3)

In [ ]:

torch.empty((3, 5))

In [ ]:

 torch.rand((5, 3))

In [ ]:

torch.arange(3, 9, 2)

In [ ]:

torch.linspace(0, 1, 11)

Most of these creation operations have a _like counterpart that creates a tensor of the same size, dtype and device as the given tensor.

In [ ]:

A = torch.ones((2,3))    # A is a float tensor of size 2x3 located on cpu
B = torch.zeros_like(A)  # So B is a float tensor of size 2x3 located on cpu
B

This list is not exhaustive but gives you an idea of the diversity of way to create a Tensor.

Your turn!

Create the tensor:

$$ \begin{bmatrix} 5 & 7 & 9 & 11 & 13 & 15 & 17 & 19 \end{bmatrix} $$

In [ ]:

Tensor Properties¶

In [ ]:

x = torch.Tensor([[0,1,2], [3,4,5]])

print("x.shape: \n%s\n" % (x.shape,))
print("x.size(): \n%s\n" % (x.size(),))
print("x.size(1): \n%s\n" % x.size(1))
print("x.dim(): \n%s\n" % x.dim())
print("x.numel(): \n%s\n" % x.numel())

In [ ]:

print("x.dtype: \n%s\n" % x.dtype)
print("x.device: \n%s\n" % x.device)

The nonzero function returns indices of the non zero elements.

In [ ]:

x = torch.Tensor([[0,1,2], [3,4,5]])

print("x.nonzero(): \n%s\n" % x.nonzero())

Tensor Operations¶

Unlike in NumPy, there are two ways to performs most operations in PyTorch:

using torch.op(tensor)
using tensor.op()

In [ ]:

X = torch.rand(3, 2)

In [ ]:

torch.exp(X)

In [ ]:

X.exp()

You can easily chain operators :

In [ ]:

X.sqrt().std()

In [ ]:

(X.exp() + 2).sqrt() - 2 * X.log().sigmoid()  # be creative :-)

Many more functions are available: sin, cos, tanh, bmm, cumsum, dot, etc.

Your turn!

Compute the norms of the row-vectors in matrix X without using torch.norm().

Remember: $$||\vec{v}||_2 = \sqrt{x_1^2 + x_2^2 + \dots + x_n^2}$$

Hint: X**2 computes the element-wise square.

In [ ]:

X = torch.eye(4) + torch.arange(4).repeat(4, 1).float()

# YOUR TURN

# SOLUTION: tensor([3.8730, 4.1231, 4.3589, 4.5826])

Reductions¶

In [ ]:

X = torch.rand(3, 2)
X

In [ ]:

X.sum()

In [ ]:

X.max()

In [ ]:

X.mean(dim=1)

In [ ]:

X.norm(p=1)

Your turn!

For $X_{i,j}$ , compute $Y$ such that $Y_j = \log \big[ \sum_j \exp (X_{i,j}) \big]$.

In [ ]:

X = torch.eye(4) + torch.arange(4).repeat(4, 1).float()

# YOUR TURN

# SOLUTION: tensor([3.4938, 3.5797, 3.7817, 4.1852])

Linear Algebra¶

In [ ]:

Y = torch.rand(2, 3)
Y

In [ ]:

# Matrix multiplication
Y.t() @ Y

In [ ]:

Y.t().matmul(Y)

In [ ]:

# CAUTION: Operator '*' does element-wise multiplication, just like in numpy!
# Y.t() * Y  # error, dimensions do not match for element-wise multiplication

In [ ]:

torch.inverse(Y.t() @ Y)

In [ ]:

Y = torch.rand(3, 3)

In [ ]:

Y.det()

In [ ]:

Y.eig()

In-place operators (mutations)¶

Functions that mutate the object end with an underscore, e.g. add_, div_, etc.

In [ ]:

A = torch.eye(3)
A

In [ ]:

A.add(5)

In [ ]:

A.add_(5)

In [ ]:

A.uniform_()  # fills the tensor with random uniform numbers in [0, 1]

In [ ]:

Also note the difference:¶

A = A + 1  # After this operation, A is a new variable and memory has been copied
A += 1     # After this operation, A stayed the same variable and memory has been changed in place

Compare the outputs:

In [ ]:

A = torch.ones(1)

A_before = A
A = A + 1

print(A, A_before)

In [ ]:

A = torch.ones(1)

A_before = A
A += 1

print(A, A_before)

Indexing & Masking¶

Again, it works just like in NumPy.

Indexing¶

In [ ]:

A = torch.randint(100, (3, 3))
A

In [ ]:

A[0,2]

In [ ]:

A[:, 1:2], A[:, 1:2].shape

Note: You can use ... to mark any number of dimension

Masking¶

In [ ]:

X = torch.randint(100, (5, 3))
X

In [ ]:

mask = (X > 25) & (X < 75)
mask

In [ ]:

X[mask]  # returns all elements matching the criteria in a 1D-tensor

In [ ]:

X[mask] = 0  # You can assign new values only to indices matching the condition:
X

Your turn!

In [ ]:

X = torch.arange(40).view(5,8)
X

Extract this vector from the tensor X:

$ \begin{bmatrix} 17 & 19 & 21 & 23 \\ \end{bmatrix} $

In [ ]:

# YOUR TURN

In [ ]:

X = torch.tensor([[1., 0., 2.], [4., 6., 0.]])

Get the number of non-zeros in X

In [ ]:

# YOUR TURN

Compute the sum of all entries in X that are larger than the mean of all values in X.

In [ ]:

# YOUR TURN

Reshaping & Expanding¶

View
view is the equivalent of reshape in NumPy, but view does not allocate new memory: the output tensor shares the same data!

The number of arguments of view will be the number of dimensions of the output tensor.

In [ ]:

X = torch.tensor([1, 2, 3, 4, 5, 6])
X

In [ ]:

Y = X.view(2, 3)  # view tensor X on 2 dimensions, with a size 2 on dimension 1 and a size 3 on dimension 2
Y

In [ ]:

Y = X.view(2, -1)  ## -1 tells PyTorch to infer the number of elements along that dimension
Y, Y.shape

Expand
expand creates a new view of the tensor with dimensions of size 1 expanded to a larger size. This does not allocate new memory either !

In [ ]:

Y = torch.ones(5)
Y

In [ ]:

Y.expand(5, 5)

Note: There also exists reshape and repeat functions in PyTorch. They work similarly to view and expand but do copy memory.

Squeeze and Unsqueeze
squeeze removes all dimensions of size 1, while unsqueeze adds a new dimension at a given position

In [ ]:

X = torch.eye(4)
Y = X[:1]
Y, Y.shape

In [ ]:

Y = Y.squeeze()  # removes all dimensions of size '1'
Y, Y.shape

In [ ]:

Y = Y.unsqueeze(1)  # add a new dimension in position 1
Y, Y.shape

Flatten
We use flatten to reshape the input into a lower-dimensional tensor.

In [ ]:

X = torch.rand(3,2,4)
print(X.shape)
X

In [ ]:

Y = X.flatten()
Y.shape, Y

In [ ]:

Y = X.flatten(start_dim=1)
Y.shape, Y

Your turn!

Create the tensor:

$ \begin{bmatrix} 7 & 5 & 5 & 5 & 5 \\ 5 & 7 & 5 & 5 & 5 \\ 5 & 5 & 7 & 5 & 5 \\ 5 & 5 & 5 & 7 & 5 \\ 5 & 5 & 5 & 5 & 7 \end{bmatrix} $

Hint: You can use matrix sum and scalar multiplication

In [ ]:

# YOUR TURN

Create the tensor:

$ \begin{bmatrix} 2 & 2 & 2 & 2 & 2 \\ 4 & 4 & 4 & 4 & 4 \\ 6 & 6 & 6 & 6 & 6 \\ 8 & 8 & 8 & 8 & 8 \end{bmatrix} $

In [ ]:

# YOUR TURN

And many more ...¶

In [ ]:

# press tab to autocomplete
x.

Tensor Conversions¶

Type conversions¶

Using `.to()`¶

To cast a Tensor to a different type, we can use the Tensor.to(type) function.

In [ ]:

Y = 4 * torch.rand((2,4))
Y

In [ ]:

Y.dtype

In [ ]:

Y.to(torch.float16)

In [ ]:

Y.to(torch.int64)

In [ ]:

Y.to(torch.bool)

Note the automatic type promotion :

In [ ]:

torch.LongTensor([1, 2]) + torch.FloatTensor([1.1, 2.2])

Using dedicated function per type¶

You can use .bool(), .short(), .int(), .long(), .float(), .double() to convert the tensor to the required type.

In [ ]:

Y = 4 * torch.rand((2,4))
Y

In [ ]:

Y.int()

In [ ]:

Y.float()

Converting between PyTorch and NumPy¶

In [ ]:

X = np.random.random((5,3))
X

In [ ]:

# numpy ---> torch
Y = torch.from_numpy(X)  # Y is actually a DoubleTensor (i.e. 64-bit representation)
Y

In [ ]:

Y = torch.rand((2,4))
Y

In [ ]:

# torch ---> numpy
X = Y.numpy()
X

Converting between GPU and CPU¶

First, you may want to check:

that cuda can actually be used : torch.cuda.is_available()
how many gpus are available : torch.cuda.device_count()

In [ ]:

torch.cuda.is_available()

In [ ]:

torch.cuda.device_count()

In [ ]:

x = torch.Tensor([[1,2,3], [4,5,6]])
print(x)

`torch.device`¶

The best way to easily move a tensor from a device to another is again by using the Tensor.to(...) function.
You need to pass as argument a torch.device object.

A torch.device is an object representing the device on which a torch.tensor is or will be allocated.

Note : If you don't have Cuda on the machine, the following examples won't work

In [ ]:

cpu = torch.device('cpu')
cuda_0 = torch.device('cuda:0')

x = x.to(cpu)
print(x.device)
x = x.to(cuda_0)
print(x.device)

It is flexible since you can check if cuda exists only once in your code

Your Turn
Define the device to be a gpu if available or to fallback on cpu if not

In [ ]:

device = None # YOUR TURN

x = x.to(device)  # We don't need to worry anymore about whether cuda is available or not in the rest of the code
print(x.device)

`Tensor.cuda` and `Tensor.cpu`¶

You may also see the use of the Tensor.cuda() and Tensor.cpu() functions.

In [ ]:

x = torch.Tensor([[1,2,3], [4,5,6]])
print(x)

In [ ]:

x.cuda()
print(x.device) # Note x is still on cpu

x = x.cuda()
print(x.device)

x = x.cuda(1) # This will fail if you have only one gpu
print(x.device)

x = x.cpu()
print(x.device)

In [ ]:

x = torch.Tensor([[1,2,3], [4,5,6]])

# This will generate an error since you cannot do operation on tensor that are not on the same device
x + x.cuda()

In general, the more flexible Tensor.to(...) function should be used preferably.

| Basics | Data >¶

Basics¶

Table of Contents¶

1. Introduction¶

2. Tensor Creation¶

3. Tensor Properties¶

4. Tensor Operations¶

5. Tensor Conversions¶

Introduction¶

What is PyTorch ?¶

What is a tensor?¶

PyTorch vs NumPy¶

Tensor Creation¶

Your turn!

Tensor Properties¶

Tensor Operations¶

Your turn!

Reductions¶

Your turn!

Linear Algebra¶

In-place operators (mutations)¶

Also note the difference:¶

Indexing & Masking¶

Indexing¶

Masking¶

Your turn!

Reshaping & Expanding¶

Your turn!

And many more ...¶

Tensor Conversions¶

Type conversions¶

Using .to()¶

Using dedicated function per type¶

Converting between PyTorch and NumPy¶

Converting between GPU and CPU¶

torch.device¶

Tensor.cuda and Tensor.cpu¶

| Basics | Data >¶

1. Introduction ¶

2. Tensor Creation ¶

3. Tensor Properties ¶

4. Tensor Operations ¶

5. Tensor Conversions ¶

Using `.to()`¶

`torch.device`¶

`Tensor.cuda` and `Tensor.cpu`¶