GANs¶

In [ ]:

%matplotlib inline
from fastai.gen_doc.nbdoc import *
from fastai.vision import * 
from fastai.vision.gan import *

GAN stands for Generative Adversarial Nets and were invented by Ian Goodfellow. The concept is that we will train two models at the same time: a generator and a critic. The generator will try to make new images similar to the ones in our dataset, and the critic's job will try to classify real images from the fake ones the generator does. The generator returns images, the discriminator a feature map (it can be a single number depending on the input size). Usually the discriminator will be trained to return 0. everywhere for fake images and 1. everywhere for real ones.

This module contains all the necessary function to create a GAN.

We train them against each other in the sense that at each step (more or less), we:

Freeze the generator and train the discriminator for one step by:

getting one batch of true images (let's call that real)
generating one batch of fake images (let's call that fake)
have the discriminator evaluate each batch and compute a loss function from that; the important part is that it rewards positively the detection of real images and penalizes the fake ones
update the weights of the discriminator with the gradients of this loss

Freeze the discriminator and train the generator for one step by:

generating one batch of fake images
evaluate the discriminator on it
return a loss that rewards posisitivly the discriminator thinking those are real images; the important part is that it rewards positively the detection of real images and penalizes the fake ones
update the weights of the generator with the gradients of this loss

In [ ]:

show_doc(GANLearner)

`class` `GANLearner`[source]

GANLearner(data:DataBunch, generator:Module, critic:Module, gen_loss_func:LossFunction, crit_loss_func:LossFunction, switcher:Callback=*None, gen_first:bool=False, switch_eval:bool=True, show_img:bool=True, clip:float=None, ***learn_kwargs**) :: Learner

A Learner suitable for GANs.

This is the general constructor to create a GAN, you might want to use one of the factory methods that are easier to use. Create a GAN from data, a generator and a critic. The data should have the inputs the generator will expect and the images wanted as targets.

gen_loss_func is the loss function that will be applied to the generator. It takes three argument fake_pred, target, output and should return a rank 0 tensor. output is the result of the generator applied to the input (the xs of the batch), target is the ys of the batch and fake_pred is the result of the discriminator being given output. outputand target can be used to add a specific loss to the GAN loss (pixel loss, feature loss) and for a good training of the gan, the loss should encourage fake_pred to be as close to 1 as possible (the generator is trained to fool the critic).

crit_loss_func is the loss function that will be applied to the critic. It takes two arguments real_pred and fake_pred. real_pred is the result of the critic on the target images (the ys of the batch) and fake_pred is the result of the critic applied on a batch of fake, generated byt the generator from the xs of the batch.

switcher is a Callback that should tell the GAN when to switch from critic to generator and vice versa. By default it does 5 iterations of the critic for 1 iteration of the generator. The model begins the training with the generator if gen_first=True. If switch_eval=True, the model that isn't trained is switched on eval mode (left in training mode otherwise, which means some statistics like the running mean in batchnorm layers are updated, or the dropouts are applied).

clip should be set to a certain value if one wants to clip the weights (see the Wassertein GAN for instance).

If show_img=True, one image generated by the GAN is shown at the end of each epoch.

Factory methods¶

In [ ]:

show_doc(GANLearner.from_learners)

`from_learners`[source]

from_learners(learn_gen:Learner, learn_crit:Learner, switcher:Callback=*None, weights_gen:Point=None, ***learn_kwargs**)

Create a GAN from learn_gen and learn_crit.

Directly creates a GANLearner from two Learner: one for the generator and one for the critic. The switcher and all kwargs will be passed to the initialization of GANLearner along with the following loss functions:

loss_func_crit is the mean of learn_crit.loss_func applied to real_pred and a target of ones with learn_crit.loss_func applied to fake_pred and a target of zeros
loss_func_gen is the mean of learn_crit.loss_func applied to fake_pred and a target of ones (to full the discriminator) with learn_gen.loss_func applied to output and target. The weights of each of those contributions can be passed in weights_gen (default is 1. and 1.)

In [ ]:

show_doc(GANLearner.wgan)

`wgan`[source]

wgan(data:DataBunch, generator:Module, critic:Module, switcher:Callback=*None, clip:float=0.01, ***learn_kwargs**)

Create a WGAN from data, generator and critic.

The Wasserstein GAN is detailed in [this article]. switcher and the kwargs will be passed to the GANLearner init, clipis the weight clipping.

Switchers¶

In any GAN training, you will need to tell the Learner when to switch from generator to critic and vice versa. The two following Callback are examples to help you with that.

As usual, don't call the on_something methods directly, the fastai library will do it for you during training.

In [ ]:

show_doc(FixedGANSwitcher, title_level=3)

`class` `FixedGANSwitcher`[source]

FixedGANSwitcher(learn:Learner, n_crit:Union[int, Callable]=*1, n_gen:Union[int, Callable]=1*) :: LearnerCallback

Switcher to do n_crit iterations of the critic then n_gen iterations of the generator.

In [ ]:

show_doc(FixedGANSwitcher.on_train_begin)

`on_train_begin`[source]

on_train_begin(****kwargs**)

Initiate the iteration counts.

In [ ]:

show_doc(FixedGANSwitcher.on_batch_end)

`on_batch_end`[source]

on_batch_end(iteration, ****kwargs**)

Switch the model if necessary.

In [ ]:

show_doc(AdaptiveGANSwitcher, title_level=3)

`class` `AdaptiveGANSwitcher`[source]

AdaptiveGANSwitcher(learn:Learner, gen_thresh:float=*None, critic_thresh:float=None*) :: LearnerCallback

Switcher that goes back to generator/critic when the loss goes below gen_thresh/crit_thresh.

In [ ]:

show_doc(AdaptiveGANSwitcher.on_batch_end)

`on_batch_end`[source]

on_batch_end(last_loss, ****kwargs**)

Switch the model if necessary.

Discriminative LR¶

If you want to train your critic at a different learning rate than the generator, this will let you do it automatically (even if you have a learning rate schedule).

In [ ]:

show_doc(GANDiscriminativeLR, title_level=3)

`class` `GANDiscriminativeLR`[source]

GANDiscriminativeLR(learn:Learner, mult_lr:float=*5.0*) :: LearnerCallback

Callback that handles multiplying the learning rate by mult_lr for the critic.

In [ ]:

show_doc(GANDiscriminativeLR.on_batch_begin)

`on_batch_begin`[source]

on_batch_begin(train, ****kwargs**)

Multiply the current lr if necessary.

In [ ]:

show_doc(GANDiscriminativeLR.on_step_end)

`on_step_end`[source]

on_step_end(****kwargs**)

Put the LR back to its value if necessary.

Specific models¶

In [ ]:

show_doc(basic_critic)

`basic_critic`[source]

basic_critic(in_size:int, n_channels:int, n_features:int=*64, n_extra_layers:int=0, ***conv_kwargs**)

A basic critic for images n_channels x in_size x in_size.

This model contains a first 4 by 4 convolutional layer of stride 2 from n_channels to n_features followed by n_extra_layers 3 by 3 convolutional layer of stride 1. Then we put as many 4 by 4 convolutional layer of stride 2 with a number of features multiplied by 2 at each stage so that the in_size becomes 1. kwargs can be used to customize the convolutional layers and are passed to conv_layer.

In [ ]:

show_doc(basic_generator)

`basic_generator`[source]

basic_generator(in_size:int, n_channels:int, noise_sz:int=*100, n_features:int=64, n_extra_layers=0, ***conv_kwargs**)

A basic generator from noise_sz to images n_channels x in_size x in_size.

This model contains a first 4 by 4 transposed convolutional layer of stride 1 from noise_size to the last numbers of features of the corresponding critic. Then we put as many 4 by 4 transposed convolutional layer of stride 2 with a number of features divided by 2 at each stage so that the image ends up being of height and widht in_size//2. At the end, we addn_extra_layers 3 by 3 convolutional layer of stride 1. The last layer is a transpose convolution of size 4 by 4 and stride 2 followed by tanh. kwargs can be used to customize the convolutional layers and are passed to conv_layer.

In [ ]:

show_doc(gan_critic)

`gan_critic`[source]

gan_critic(n_channels:int=*3, nf:int=128, n_blocks:int=3, p:int=0.15*)

Critic to train a GAN.

In [ ]:

show_doc(GANTrainer)

`class` `GANTrainer`[source]

GANTrainer(learn:Learner, switch_eval:bool=*False, clip:float=None, beta:float=0.98, gen_first:bool=False, show_img:bool=True*) :: LearnerCallback

Handles GAN Training.

LearnerCallback that will be responsible to handle the two different optimizers (one for the generator and one for the critic), and do all the work behind the scenes so that the generator (or the critic) are in training mode with parameters requirement gradients each time we switch.

switch_eval=True means that the GANTrainer will put the model that isn't training into eval mode (if it's False its running statistics like in batchnorm layers will be updated and dropout will be applied). clip is the clipping applied to the weights (if not None). beta is the coefficient for the moving averages as the GANTrainertracks separately the generator loss and the critic loss. gen_first=True means the training begins with the generator (with the critic if it's False). If show_img=True we show a generated image at the end of each epoch.

In [ ]:

show_doc(GANTrainer.switch)

`switch`[source]

switch(gen_mode:bool=*None*)

Switch the model, if gen_mode is provided, in the desired mode.

If gen_mode is left as None, just put the model in the other mode (critic if it was in generator mode and vice versa).

In [ ]:

show_doc(GANTrainer.on_train_begin)

`on_train_begin`[source]

on_train_begin(****kwargs**)

Create the optimizers for the generator and critic if necessary, initialize smootheners.

In [ ]:

show_doc(GANTrainer.on_epoch_begin)

`on_epoch_begin`[source]

on_epoch_begin(epoch, ****kwargs**)

Put the critic or the generator back to eval if necessary.

In [ ]:

show_doc(GANTrainer.on_batch_begin)

`on_batch_begin`[source]

on_batch_begin(last_input, last_target, ****kwargs**)

Clamp the weights with self.clip if it's not None, return the correct input.

In [ ]:

show_doc(GANTrainer.on_backward_begin)

`on_backward_begin`[source]

on_backward_begin(last_loss, last_output, ****kwargs**)

Record last_loss in the proper list.

In [ ]:

show_doc(GANTrainer.on_epoch_end)

`on_epoch_end`[source]

on_epoch_end(pbar, epoch, ****kwargs**)

Put the various losses in the recorder and show a sample image.

In [ ]:

show_doc(GANTrainer.on_train_end)

`on_train_end`[source]

on_train_end(****kwargs**)

Switch in generator mode for showing results.

Specific modules¶

In [ ]:

show_doc(GANModule, title_level=3)

`class` `GANModule`[source]

GANModule(generator:Module=*None, critic:Module=None, gen_mode:bool=False*) :: Module

Wrapper around a generator and a critic to create a GAN.

If gen_mode is left as None, just put the model in the other mode (critic if it was in generator mode and vice versa).

In [ ]:

show_doc(GANModule.switch)

`switch`[source]

switch(gen_mode:bool=*None*)

Put the model in generator mode if gen_mode, in critic mode otherwise.

In [ ]:

show_doc(GANLoss, title_level=3)

`class` `GANLoss`[source]

GANLoss(loss_funcG:Callable, loss_funcC:Callable, gan_model:GANModule) :: GANModule

Wrapper around loss_funcC (for the critic) and loss_funcG (for the generator).

In [ ]:

show_doc(AdaptiveLoss, title_level=3)

`class` `AdaptiveLoss`[source]

AdaptiveLoss(crit) :: Module

Expand the target to match the output size before applying crit.

In [ ]:

show_doc(accuracy_thresh_expand)

`accuracy_thresh_expand`[source]

accuracy_thresh_expand(y_pred:Tensor, y_true:Tensor, thresh:float=*0.5, sigmoid:bool=True*) → Rank0Tensor

Compute accuracy after expanding y_true to the size of y_pred.

Data Block API¶

In [ ]:

show_doc(NoisyItem, title_level=3)

`class` `NoisyItem`[source]

NoisyItem(noise_sz) :: ItemBase

An random ItemBase of size noise_sz.

In [ ]:

show_doc(GANItemList, title_level=3)

`class` `GANItemList`[source]

GANItemList(items, noise_sz:int=*100, ***kwargs**) :: ImageItemList

ItemList suitable for GANs.

Inputs will be NoisyItem of noise_sz while the default class for target is ImageItemList.

In [ ]:

show_doc(GANItemList.show_xys)

`show_xys`[source]

show_xys(xs, ys, imgsize:int=*4, figsize:Optional[Tuple[int, int]]=None, ***kwargs**)

Shows ys (target images) on a figure of figsize.

In [ ]:

show_doc(GANItemList.show_xyzs)

`show_xyzs`[source]

show_xyzs(xs, ys, zs, imgsize:int=*4, figsize:Optional[Tuple[int, int]]=None, ***kwargs**)

Shows zs (generated images) on a figure of figsize.

Undocumented Methods - Methods moved below this line will intentionally be hidden¶

In [ ]:

show_doc(GANLoss.critic)

`critic`[source]

critic(real_pred, input)

Create some fake_pred with the generator from input and compare them to real_pred in self.loss_funcD.

In [ ]:

show_doc(GANModule.forward)

`forward`[source]

forward(***args**)

Defines the computation performed at every call. Should be overridden by all subclasses.

.. note:: Although the recipe for forward pass needs to be defined within this function, one should call the :class:Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

In [ ]:

show_doc(GANLoss.generator)

`generator`[source]

generator(output, target)

Evaluate the output with the critic then uses self.loss_funcG to combine it with target.

In [ ]:

show_doc(NoisyItem.apply_tfms)

`apply_tfms`[source]

apply_tfms(tfms, ****kwargs**)

Subclass this method if you want to apply data augmentation with tfms to this ItemBase.

In [ ]:

show_doc(AdaptiveLoss.forward)

`forward`[source]

forward(output, target)

Defines the computation performed at every call. Should be overridden by all subclasses.

.. note:: Although the recipe for forward pass needs to be defined within this function, one should call the :class:Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

In [ ]:

show_doc(GANItemList.get)

`get`[source]

get(i)

Subclass if you want to customize how to create item i from self.items.

In [ ]:

show_doc(GANItemList.reconstruct)

`reconstruct`[source]

reconstruct(t)

Reconstruct one of the underlying item for its data t.

In [ ]:

show_doc(AdaptiveLoss.forward)

`forward`[source]

forward(output, target)

Defines the computation performed at every call. Should be overridden by all subclasses.

.. note:: Although the recipe for forward pass needs to be defined within this function, one should call the :class:Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

GANs¶

class GANLearner[source]

Factory methods¶

from_learners[source]

wgan[source]

Switchers¶

class FixedGANSwitcher[source]

on_train_begin[source]

on_batch_end[source]

class AdaptiveGANSwitcher[source]

on_batch_end[source]

Discriminative LR¶

class GANDiscriminativeLR[source]

on_batch_begin[source]

on_step_end[source]

Specific models¶

basic_critic[source]

basic_generator[source]

gan_critic[source]

class GANTrainer[source]

switch[source]

on_train_begin[source]

on_epoch_begin[source]

on_batch_begin[source]

on_backward_begin[source]

on_epoch_end[source]

on_train_end[source]

Specific modules¶

class GANModule[source]

switch[source]

class GANLoss[source]

class AdaptiveLoss[source]

accuracy_thresh_expand[source]

Data Block API¶

class NoisyItem[source]

class GANItemList[source]

show_xys[source]

show_xyzs[source]

Undocumented Methods - Methods moved below this line will intentionally be hidden¶

critic[source]

forward[source]

generator[source]

apply_tfms[source]

forward[source]

get[source]

reconstruct[source]

forward[source]

New Methods - Please document or move to the undocumented section¶

`class` `GANLearner`[source]

`from_learners`[source]

`wgan`[source]

`class` `FixedGANSwitcher`[source]

`on_train_begin`[source]

`on_batch_end`[source]

`class` `AdaptiveGANSwitcher`[source]

`on_batch_end`[source]

`class` `GANDiscriminativeLR`[source]

`on_batch_begin`[source]

`on_step_end`[source]

`basic_critic`[source]

`basic_generator`[source]

`gan_critic`[source]

`class` `GANTrainer`[source]

`switch`[source]

`on_train_begin`[source]

`on_epoch_begin`[source]

`on_batch_begin`[source]

`on_backward_begin`[source]

`on_epoch_end`[source]

`on_train_end`[source]

`class` `GANModule`[source]

`switch`[source]

`class` `GANLoss`[source]

`class` `AdaptiveLoss`[source]

`accuracy_thresh_expand`[source]

`class` `NoisyItem`[source]

`class` `GANItemList`[source]

`show_xys`[source]

`show_xyzs`[source]

`critic`[source]

`forward`[source]

`generator`[source]

`apply_tfms`[source]

`forward`[source]

`get`[source]

`reconstruct`[source]

`forward`[source]