Notebooks for cnn

pobonomo · pobonomo · commit 20ef6e90bde3 · 2025-10-06T10:50:02.000+02:00
A very tiny one without pooling layers.
A less tiny with pooling. Right now I need to cheat on the bounds
of the input layer.
diff --git a/notebooks/adversarial/adversarial_cnn-tiny.ipynb b/notebooks/adversarial/adversarial_cnn-tiny.ipynb
@@ -0,0 +1,317 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Adversarial example using Keras\n",
+    "\n",
+    "In this example, we redo the [adversarial example](https://gurobi-optimization-ml2gurobi.readthedocs-hosted.com/en/latest/examples/adversarial_mnist.html) of the documentation but use tensorflow Keras for training the neural network.\n",
+    "\n",
+    "We don't detail the optimization model here. Please refer to the example in the documentation.\n",
+    "Note that many of the differences between this notebook and the one from the documentation come from\n",
+    "using tensorflow instead of numpy for manipulating data.\n",
+    "\n",
+    "This example requires the additional packages:\n",
+    " - [tensorflow](https://www.tensorflow.org/)\n",
+    " - [matplotlib](https://matplotlib.org/)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Import the necessary packages and load data\n",
+    "\n",
+    "We import all the package we need for this example.\n",
+    "The MNIST dataset is available from Keras."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from matplotlib import pyplot as plt\n",
+    "import tensorflow as tf\n",
+    "from tensorflow import keras\n",
+    "import numpy as np\n",
+    "import gurobipy as gp"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "%load_ext autoreload\n",
+    "%autoreload 2"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from gurobi_ml import add_predictor_constr"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "(x_train, y_train), (x_test, y_test) = keras.datasets.mnist.load_data()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We reshape and scale `x_train` and `x_test`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "x_train = tf.cast(x_train, tf.float32) / 255.0\n",
+    "x_test = tf.cast(x_test, tf.float32) / 255.0\n",
+    "\n",
+    "x_train = np.expand_dims(x_train, -1)\n",
+    "x_test = np.expand_dims(x_test, -1)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Construct and train the neural network\n",
+    "\n",
+    "We construct a sequential neural network with 2 hidden layers of 50 neurons and ReLU activation.\n",
+    "\n",
+    "We use the usual Keras functions to compile and fit the network."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "nn = keras.Sequential(\n",
+    "    [\n",
+    "        keras.Input(shape=(28, 28, 1)),\n",
+    "        keras.layers.Conv2D(2, kernel_size=(4, 4), strides=(2, 2), activation=\"relu\"),\n",
+    "        keras.layers.Conv2D(2, kernel_size=(4, 4), strides=(2, 2), activation=\"relu\"),\n",
+    "        # keras.layers.MaxPooling2D(pool_size=(2, 2)),\n",
+    "        keras.layers.Flatten(),\n",
+    "        keras.layers.Dense(10, activation=\"relu\"),\n",
+    "        keras.layers.Dense(10),\n",
+    "    ]\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "nn.compile(\n",
+    "    optimizer=tf.keras.optimizers.Adam(0.001),\n",
+    "    loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),\n",
+    "    metrics=[tf.keras.metrics.SparseCategoricalAccuracy()],\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "nn.fit(\n",
+    "    x_train,\n",
+    "    y_train,\n",
+    "    batch_size=128,\n",
+    "    epochs=10,\n",
+    "    validation_data=(x_test, y_test),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Print summary of the trained network"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "nn.summary()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Build optimization model\n",
+    "\n",
+    "Now we turn to building the optimization model.\n",
+    "\n",
+    "We choose a training example and the steps are similar to the one with scikit-learn in the documentation.\n",
+    "The only differences come from the data being stored in tensors instead of arrays."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "example = x_train[18, :]\n",
+    "plt.imshow(example, cmap=\"gray\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "label = tf.math.argmax(nn.predict(tf.reshape(example, (1, 28, 28, 1)))[0])\n",
+    "print(f\"Example is classified as {label}\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ex_prob = nn.predict(tf.reshape(example, (1, 28, 28, 1)))\n",
+    "sorted_labels = tf.argsort(ex_prob)[0]\n",
+    "right_label = sorted_labels[-1]\n",
+    "wrong_label = sorted_labels[-2]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "m = gp.Model()\n",
+    "delta = 5\n",
+    "\n",
+    "x = m.addMVar((1, 28, 28, 1), lb=0.0, ub=1.0, name=\"x\")\n",
+    "y = m.addMVar(ex_prob.shape, lb=-gp.GRB.INFINITY, name=\"y\")\n",
+    "\n",
+    "abs_diff = m.addMVar(example.shape, lb=0, ub=1, name=\"abs_diff\")\n",
+    "\n",
+    "m.setObjective(y[0, wrong_label] - y[0, right_label], gp.GRB.MAXIMIZE)\n",
+    "\n",
+    "# Bound on the distance to example in norm-1\n",
+    "m.addConstr(abs_diff >= x - example)\n",
+    "m.addConstr(abs_diff >= -x + example)\n",
+    "m.addConstr(abs_diff.sum() <= delta)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "pred_constr = add_predictor_constr(m, nn, x, y)\n",
+    "\n",
+    "pred_constr.print_stats()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "m.Params.BestBdStop = 0.0\n",
+    "m.Params.BestObjStop = 0.0\n",
+    "m.Params.OBBT = 2\n",
+    "m.Params.Presolve = 1\n",
+    "m.optimize()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Finally, display the adversarial example if one was found."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "m.write(\"toto.lp\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if m.ObjVal > 0.0:\n",
+    "    plt.imshow(x.X.reshape((28, 28)), cmap=\"gray\")\n",
+    "    label = tf.math.argmax(nn.predict(x.X), axis=1)\n",
+    "    print(f\"Solution is classified as {label}\")\n",
+    "else:\n",
+    "    print(\"No counter example exists in neighborhood.\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "copyright © 2023 Gurobi Optimization, LLC"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "gurobi-machinelearning",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.13.7"
+  },
+  "license": {
+   "full_text": "# Copyright © 2023 Gurobi Optimization, LLC\n#\n# Licensed under the Apache License, Version 2.0 (the \"License\");\n# you may not use this file except in compliance with the License.\n# You may obtain a copy of the License at\n#\n#     http://www.apache.org/licenses/LICENSE-2.0\n#\n# Unless required by applicable law or agreed to in writing, software\n# distributed under the License is distributed on an \"AS IS\" BASIS,\n# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\n# See the License for the specific language governing permissions and\n# limitations under the License.\n# =============================================================================="
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/notebooks/adversarial/adversarial_cnn.ipynb b/notebooks/adversarial/adversarial_cnn.ipynb
@@ -108,9 +108,9 @@
     "nn = keras.Sequential(\n",
     "    [\n",
     "        keras.Input(shape=(28, 28, 1)),\n",
-    "        keras.layers.Conv2D(16, kernel_size=(2, 2), activation=\"relu\"),\n",
+    "        keras.layers.Conv2D(2, kernel_size=(2, 2), activation=\"relu\"),\n",
     "        keras.layers.MaxPooling2D(pool_size=(2, 2)),\n",
-    "        keras.layers.Conv2D(32, kernel_size=(2, 2), activation=\"relu\"),\n",
+    "        keras.layers.Conv2D(4, kernel_size=(2, 2), activation=\"relu\"),\n",
     "        keras.layers.MaxPooling2D(pool_size=(2, 2)),\n",
     "        keras.layers.Flatten(),\n",
     "        keras.layers.Dense(50, activation=\"relu\"),\n",
@@ -142,7 +142,7 @@
     "    x_train,\n",
     "    y_train,\n",
     "    batch_size=128,\n",
-    "    epochs=1,\n",
+    "    epochs=10,\n",
     "    validation_data=(x_test, y_test),\n",
     ")"
    ]
@@ -240,6 +240,21 @@
     "pred_constr.print_stats()"
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# We really don't like maxpooling layers (because fomrulatting the max...)\n",
+    "# Put bound on inputs to maxpooling layers (0.0 is valid, 10.0 looks reasonable)\n",
+    "\n",
+    "pred_constr.layers[1].input.lb = 0.0\n",
+    "pred_constr.layers[1].input.ub = 10.0\n",
+    "pred_constr.layers[3].input.lb = 0.0\n",
+    "pred_constr.layers[3].input.ub = 10.0"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -269,15 +284,6 @@
     "m.write(\"toto.lp\")"
    ]
   },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "m.getRow(pred_constr.layers[0].constrs[1])"
-   ]
-  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -286,7 +292,7 @@
    "source": [
     "if m.ObjVal > 0.0:\n",
     "    plt.imshow(x.X.reshape((28, 28)), cmap=\"gray\")\n",
-    "    label = tf.math.argmax(nn.predict(tf.reshape(x.X, (1, -1))), axis=1)\n",
+    "    label = tf.math.argmax(nn.predict(x.X), axis=1)\n",
     "    print(f\"Solution is classified as {label}\")\n",
     "else:\n",
     "    print(\"No counter example exists in neighborhood.\")"
@@ -316,7 +322,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.12.9"
+   "version": "3.13.7"
   },
   "license": {
    "full_text": "# Copyright © 2023 Gurobi Optimization, LLC\n#\n# Licensed under the Apache License, Version 2.0 (the \"License\");\n# you may not use this file except in compliance with the License.\n# You may obtain a copy of the License at\n#\n#     http://www.apache.org/licenses/LICENSE-2.0\n#\n# Unless required by applicable law or agreed to in writing, software\n# distributed under the License is distributed on an \"AS IS\" BASIS,\n# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\n# See the License for the specific language governing permissions and\n# limitations under the License.\n# =============================================================================="
diff --git a/src/gurobi_ml/modeling/neuralnet/layers.py b/src/gurobi_ml/modeling/neuralnet/layers.py