Importing your own neural net¶

Verifying the example neural net was all well and good, but you probably want to verify your own neural net now. In this tutorial, we show you how to import the parameters for the feed-forward net in the introduction.

In [1]:

using MIPVerify
using Gurobi
using MAT

We'll download a .mat file containing the parameters of the sample neural net, containing two layers (exported from tensorflow).

In [2]:

param_dict = Base.download("https://github.com/vtjeng/MIPVerify_data/raw/master/weights/mnist/n1.mat") |> matread

Out[2]:

Dict{String,Any} with 20 entries:
  "fc1/weight"           => Float32[-0.75335 -0.702841 … -0.0350578 -0.284728; …
  "logits/bias/Adam_1"   => Float32[3.17918f-5 1.73752f-5 … 7.47935f-5 4.50664f…
  "logits/bias/Adam"     => Float32[-0.000917174 0.000822852 … -0.00134091 -0.0…
  "fc1/bias"             => Float32[0.763984 0.215094 … -0.702274 2.13576]
  "logits/weight/Adam_1" => Float32[9.6672f-5 8.2316f-5 … 0.000832168 0.0001292…
  "fc2/weight/Adam_1"    => Float32[3.93547f-6 3.1609f-5 … 7.98859f-7 6.52231f-…
  "fc2/bias"             => Float32[0.657566 -1.76837 … 1.75472 0.168173]
  "beta1_power"          => 0.0
  "logits/bias"          => Float32[0.194171 -0.632471 … -1.13399 -1.00617]
  "fc2/bias/Adam"        => Float32[-0.000178318 0.000325774 … -0.000285843 6.3…
  "fc1/weight/Adam"      => Float32[0.0 0.0 … 4.9268f-6 -1.41152f-7; 5.39786f-1…
  "fc2/weight"           => Float32[0.242356 0.800177 … 0.189305 -1.83328; -0.1…
  "logits/weight"        => Float32[-0.0120639 0.0527637 … 0.458511 0.0677005; …
  "fc1/weight/Adam_1"    => Float32[1.38988f-14 1.15851f-15 … 9.08535f-9 5.8825…
  "fc2/weight/Adam"      => Float32[0.000683748 -0.000629174 … 0.000171999 -7.7…
  "beta2_power"          => 0.0
  "fc1/bias/Adam"        => Float32[0.000136568 7.1624f-5 … 0.000106514 0.00081…
  "logits/weight/Adam"   => Float32[-0.00237726 0.000125507 … -0.000695765 -0.0…
  "fc1/bias/Adam_1"      => Float32[1.77318f-6 2.53318f-6 … 3.44693f-6 2.25489f…
  "fc2/bias/Adam_1"      => Float32[7.38528f-6 5.92213f-6 … 1.58686f-6 4.14633f…

Layer 1¶

Let's begin by importing the parameters for the first fully connected layer, which has 784 inputs (corresponding to a flattened 28x28 image) and 40 outputs.

Basic Approach¶

We begin with a basic approach where we extract the weights and the biases of the fully connected layer seperately.

In [3]:

fc1_weight = param_dict["fc1/weight"]

Out[3]:

784×40 Array{Float32,2}:
 -0.75335   -0.702841   0.00934679   0.0424287   …  -0.0350578   -0.284728
 -1.26091   -0.975876  -0.0690286   -0.145539       -0.139512    -0.494751
 -1.30244   -1.54214   -0.063707    -0.306707       -0.0144018   -0.644423
 -1.07623   -0.899058  -0.0936826   -0.025368        0.0716064   -0.357508
 -1.09825   -1.49246   -0.137173    -0.272637       -0.211558    -0.73989
 -0.96502   -0.886079  -0.081703    -0.0568857   …   0.00850921  -0.274247
 -0.733129  -0.931515  -0.0551823    0.0135264       0.138682    -0.322842
 -0.991218  -0.86704   -0.036489    -0.0409108       0.0330326   -0.472234
 -1.09089   -1.31528   -0.0488595    0.00274563     -0.160661    -0.603701
 -1.0728    -1.15951   -0.0882954   -0.10305        -0.138768    -0.700878
 -0.703903  -0.736082  -0.0991159   -0.0173509   …  -0.0374209   -0.25564
 -1.35577   -1.31734   -0.110668    -0.145239       -0.163263    -0.791115
 -0.633791  -0.653072  -0.0239808    0.0591085       0.0185225   -0.322362
  ⋮                                              ⋱               
 -2.43389   -1.91281   -0.181009     0.185867        0.220844    -0.786262
 -2.26904   -1.57718   -0.0287671    0.158107       -0.112781    -0.806248
 -2.00159   -1.50079    0.00412571   0.0239607      -0.221186    -0.629596
 -1.42311   -1.20815   -0.0115304    0.111935    …  -0.41617     -0.518008
 -1.9324    -1.53047   -0.109568    -0.0240137      -0.608832    -1.12603
 -1.51948   -1.42803    0.248483    -0.0787689      -0.0857386   -0.729084
 -1.63084   -1.27073    0.158391    -0.0356018      -0.323439    -0.692301
 -1.01737   -1.03474   -0.0673599    0.0771562      -0.133131    -0.396228
 -0.80131   -0.675911  -0.0228199    0.0430513   …  -0.0688749   -0.451306
 -0.943113  -0.780626  -0.0223145    0.0169085       0.237589    -0.340899
 -0.885898  -0.713929  -0.0602917   -0.021813       -0.111022    -0.320723
 -0.963221  -0.951116  -0.0672328   -0.0266708       0.137328    -0.312931

In [4]:

fc1_bias = param_dict["fc1/bias"]

Out[4]:

1×40 Array{Float32,2}:
 0.763984  0.215094  4.37897  -0.164343  …  4.89644  -0.702274  2.13576

We group the weights and biases in a Linear.

(NB: We have to flatten the bias layer using dropdims since Linear expects a 1-D array for the bias.)

In [5]:

fc1_manual = Linear(fc1_weight, dropdims(fc1_bias, dims=1))

Out[5]:

Linear(784 -> 40)

That was a lot to remember. Wouldn't it be nice if there was a helper function to take care of all that?

With Helper Functions¶

In [6]:

fc1 = get_matrix_params(param_dict, "fc1", (784, 40))

Out[6]:

Linear(784 -> 40)

get_matrix_params requires that 1) you specify the expected size of the layer, and 2) your weight and bias arrays following the naming convention outlined in the [documentation](https://vtjeng.github.io/MIPVerify.jl/stable/utils/import_weights.html#MIPVerify.get_matrix_params-Tuple%7BDict%7BString,V%7D where V,String,Tuple{Int64,Int64}}).

As a sanity check, you can verify that the parameters we get from both methods are equal.

In [7]:

fc1.matrix == fc1_manual.matrix

Out[7]:

true

In [8]:

fc1.bias == fc1_manual.bias

Out[8]:

true

Importing the rest of the layers¶

Since we followed the naming convention required by get_matrix_params when exporting our neural net parameters as a .mat file, importing the rest of the neural net is relatively straightforward.

In [9]:

fc2 = get_matrix_params(param_dict, "fc2", (40, 20))

Out[9]:

Linear(40 -> 20)

In [10]:

logits = get_matrix_params(param_dict, "logits", (20, 10))

Out[10]:

Linear(20 -> 10)

Composing the network¶

We now put the entire network together. We need to start by flattening the input since the input images are provided as a 4-dimensional tensor.

In [11]:

n1 = Sequential([
        Flatten(4),
        fc1,
        # you can always use interval arithmetic for the first layer
        ReLU(interval_arithmetic),
        fc2,
        ReLU(),
        logits
    ], "MNIST.n1")

Out[11]:

sequential net MNIST.n1
  (1) Flatten(): flattens 4 dimensional input, with dimensions permuted according to the order [4, 3, 2, 1]
  (2) Linear(784 -> 40)
  (3) ReLU()
  (4) Linear(40 -> 20)
  (5) ReLU()
  (6) Linear(20 -> 10)

There we go! Now it's your turn to try to verify your own neural network.