Model development (#18)

ioannispol · web-flow · commit f04f2b92dd27 · 2023-10-07T16:52:34.000+01:00
* add model blocks and tests

* add model and tests

* add torch to setup

* add torch to requirements

* update workflow

* add data loader and tests

* add opencv to requirements

* add opencv to requirements and setup

* update opencv version

* implement attention mechanism in unet

* add network visualisations

* update readme

* add visualisations

* update requirements

* devcontainer tweaks

* tweak devcontainer

* minor changes to setup.sh- add safe directory

* add utils func to make images to npy
diff --git a/.devcontainer/devcontainer.json b/.devcontainer/devcontainer.json
@@ -8,7 +8,24 @@
 		// Update the 'dockerFile' property if you aren't using the standard 'Dockerfile' filename.
 		"dockerfile": "../Dockerfile"
 	},
-	"features": {},
+	"features": {
+        "ghcr.io/devcontainers/features/common-utils:2": {
+			"installzsh": true,
+			"configurezshasdefaultshell": true,
+			"installohmyzsh": true,
+			"upgradePackages": false
+		},
+        "ghcr.io/devcontainers/features/docker-outside-of-docker:1": {
+			"moby": true,
+			"installdockerbuildx": true,
+			"version": "20.10",
+			"dockerdashcomposeversion": "v2"
+		},
+		"ghcr.io/devcontainers/features/github-cli:1": {
+			"installDirectlyFromGitHubRelease": true,
+			"version": "latest"
+		}
+    },
     "postCreateCommand": {
         "post_create": ".devcontainer/setup.sh"
     },
@@ -28,8 +45,8 @@
         }
     },
     "runArgs": [
-       // "--runtime=nvidia",
-        "--gpus=all"
+       //"--runtime=nvidia",
+       "--gpus=all"
     ]
 	// Features to add to the dev container. More info: https://containers.dev/features.
 	// "features": {},
diff --git a/.devcontainer/setup.sh b/.devcontainer/setup.sh
@@ -1,5 +1,7 @@
 #!/bin/bash
 
+git config --global --add safe.directory /workspaces/UnderWaterU-Net
+
 pip install -e .[dev]
 pip install pytest-cov
-pre-commit install
+pre-commit install
diff --git a/.gitignore b/.gitignore
@@ -158,3 +158,7 @@ cython_debug/
 #  and can be added to the global gitignore or merged into this file.  For a more nuclear
 #  option (not recommended) you can uncomment the following to ignore the entire idea folder.
 .idea/
+
+# Datasets
+data/
+*.npy
diff --git a/.pre-commit-config.yaml b/.pre-commit-config.yaml
@@ -0,0 +1,46 @@
+repos:
+-   repo: https://github.com/pre-commit/pre-commit-hooks
+    rev: v3.4.0  # Use the version you prefer
+    hooks:
+    -   id: trailing-whitespace
+    -   id: end-of-file-fixer
+    -   id: check-yaml
+    -   id: check-added-large-files
+
+-   repo: https://github.com/psf/black
+    rev: 21.9b0  # Use the version you prefer
+    hooks:
+    -   id: black
+        args: ['--safe']
+
+-   repo: https://github.com/pycqa/flake8
+    rev: 3.9.2  # Use the version you prefer
+    hooks:
+    -   id: flake8
+
+-   repo: https://github.com/pre-commit/mirrors-autopep8 # Auto formatting
+    rev: v2.0.2
+    hooks:
+    -   id: autopep8
+
+-   repo: https://github.com/pre-commit/pre-commit-hooks
+    rev: v1.2.3
+    hooks:
+      - id: flake8 # Checking PEP8 that was not corrected by autopep8
+      - id: trailing-whitespace
+      - id: end-of-file-fixer
+      - id: check-yaml
+      - id: check-added-large-files
+
+-   repo: https://github.com/kynan/nbstripout
+    rev: 0.6.1
+    hooks:
+      - id: nbstripout # Remove outputs from notebooks
+
+-   repo: https://github.com/nbQA-dev/nbQA # Same as above but for notebooks content
+    rev: 1.7.0
+    hooks:
+      - id: nbqa-autopep8
+      - id: nbqa-flake8
+        args: [--ignore=F401] # Ignore unused imports as they are not fixed automatically
+      - id: nbqa-isort
diff --git a/README.md b/README.md
@@ -1,4 +1,3 @@
-
 # UnderWaterU-Net 🌊
 
 ![UnderWaterU-Net Logo](path_to_my_logo.png) 
@@ -12,3 +11,63 @@ Welcome to UnderWaterU-Net, a deep learning repository specially optimized for u
 - **Expandable with Submodules**: Modular design allows for easy expansion and incorporation of additional functionalities.
 - **Streamlined Workflow**: From raw underwater images to precise segmentations, UnderWaterU-Net makes the process seamless.
 
+
+## 🚀 Getting Started
+
+### Prerequisites
+
+- List any prerequisites or dependencies here.
+
+### Installation
+
+1. **Direct Installation**:
+   ```bash
+   git clone git@github.com:ioannispol/UnderWaterU-Net.git
+   ```
+
+2. **Advanced Setup (With Submodules)**:
+   ```bash
+   git clone --recurse-submodules git@github.com:ioannispol/UnderWaterU-Net.git
+   ```
+
+## 📖 Documentation
+
+Detailed documentation can be found [here](link_to_your_documentation). 
+<!-- Replace with a link to your documentation if you have it. -->
+
+## 🤝 Contributing
+
+We welcome contributions! Please see our [CONTRIBUTING.md](link_to_contributing_guide) for details. 
+<!-- Replace with a link to your contributing guide if you have it. -->
+
+## 📜 License
+
+This project is licensed under the XYZ License - see the [LICENSE.md](link_to_license) for details. 
+<!-- Replace with a link to your license file and mention the type of license you're using. -->
+
+## 📬 Contact
+
+For any queries, feel free to reach out to [ioannispol](mailto:your_email@example.com). 
+<!-- Replace with your email or contact details. -->
+
+## Attention Mechanisms in U-Net
+
+The U-Net architecture has been extended to include attention gates, which allow the model to focus on specific regions of the input, enhancing its capability to segment relevant regions more accurately.
+
+### AttentionGate Module
+
+The AttentionGate module takes two inputs, \( g \) and \( x \), and computes the attention coefficients. These coefficients are used to weight the features in \( x \) to produce the attended features. The process can be summarized as follows:
+
+1. Two 1x1 convolutions transform \( g \) and \( x \) into a compatible space.
+2. A non-linearity (ReLU) is applied after summing the transformed versions of \( g \) and \( x \).
+3. Another 1x1 convolution followed by a sigmoid activation produces the attention coefficients in the range [0, 1].
+4. The original \( x \) is multiplied by the attention coefficients to obtain the attended features.
+
+This mechanism is particularly useful in tasks like image segmentation, enabling the network to emphasize more informative regions during training and prediction.
+
+### Reference
+
+The attention mechanism is inspired by the following paper:
+- Oktay, O., Schlemper, J., Folgoc, L. L., Lee, M., Heinrich, M., Misawa, K., ... & Glocker, B. (2018). Attention U-Net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999.
+
+
diff --git a/notebooks/test_unet.ipynb b/notebooks/test_unet.ipynb
@@ -0,0 +1,130 @@
+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "\n",
+    "import cv2\n",
+    "import matplotlib.pyplot as plt\n",
+    "import numpy as np\n",
+    "import torch\n",
+    "import torch.nn as nn\n",
+    "import torchvision.datasets as datasets\n",
+    "import torchvision.transforms as transforms\n",
+    "\n",
+    "from underwater_unet.model import UNet\n",
+    "\n",
+    "% matplotlib inline"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Insert the UNet and AttentionUNet code here\n",
+    "model = UNet(n_channels=1, n_classes=2)  # Example for a grayscale image to be classified into 2 classes"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "transform = transforms.Compose([\n",
+    "    transforms.Resize((256, 256)),  # Resizing to fit the U-Net architecture\n",
+    "    transforms.ToTensor(),\n",
+    "])\n",
+    "\n",
+    "test_dataset = datasets.MNIST(root='./data', train=False, transform=transform, download=True)\n",
+    "test_loader = torch.utils.data.DataLoader(test_dataset, batch_size=4, shuffle=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def display_image_from_npy(npy_path, image_index, method=\"opencv\"):\n",
+    "    \"\"\"\n",
+    "    Load and display an image from a .npy file.\n",
+    "\n",
+    "    Parameters:\n",
+    "    - npy_path (str): Path to the .npy file containing the images.\n",
+    "    - image_index (int): 0-based index of the image to display from the .npy file.\n",
+    "    - method (str): Method to use for displaying the image. Options are \"opencv\" or \"matplotlib\".\n",
+    "    \"\"\"\n",
+    "\n",
+    "    # Load the dataset from the .npy file\n",
+    "    dataset = np.load(npy_path)\n",
+    "\n",
+    "    # Check if the image_index is valid\n",
+    "    if image_index < 0 or image_index >= len(dataset):\n",
+    "        print(f\"Invalid image index. Please provide an index between 0 and {len(dataset) - 1}.\")\n",
+    "        return\n",
+    "\n",
+    "    # Get the desired image\n",
+    "    image = dataset[image_index]\n",
+    "\n",
+    "    if method == \"opencv\":\n",
+    "        # Display the image using OpenCV\n",
+    "        cv2.imshow(f'Image {image_index}', image)\n",
+    "        cv2.waitKey(0)\n",
+    "        cv2.destroyAllWindows()\n",
+    "    elif method == \"matplotlib\":\n",
+    "        # Display the image using Matplotlib\n",
+    "        plt.imshow(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))\n",
+    "        plt.title(f'Image {image_index}')\n",
+    "        plt.axis('off')\n",
+    "        plt.show()\n",
+    "    else:\n",
+    "        print(\"Invalid method. Choose 'opencv' or 'matplotlib'.\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "dataset_path = '/workspaces/UnderWaterU-Net/dataset.npy'\n",
+    "display_image_from_npy(dataset_path, 20, method=\"matplotlib\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.8.10"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/requirements.txt b/requirements.txt
@@ -6,3 +6,4 @@ wandb
 jupyterlab
 torch >= 2.0
 opencv-python <=4.8.0.74
+dowhy
diff --git a/setup.py b/setup.py
@@ -9,6 +9,7 @@
         'setuptools',
         'numpy',
         'scipy',
+        'dowhy',
         'matplotlib',
         'pandas',
         'torch ~= 2.0',
diff --git a/train.py b/train.py
@@ -0,0 +1,52 @@
+import torch
+import torch.nn as nn
+import torch.optim as optim
+from torch.utils.data import DataLoader
+from torchvision import transforms
+
+from underwater_unet.model import UNet
+from utils.data_load import UnderwaterDataset
+
+
+# Hyperparameters and setup
+num_epochs = 10
+learning_rate = 0.001
+batch_size = 16
+device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
+
+# Load dataset and dataloader
+train_dataset = UnderwaterDataset(images_dir='data/images', mask_dir='data/masks', resize_to=None)
+train_loader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True)
+
+# Load dataset and dataloader
+transform = transforms.Compose([transforms.ToTensor()])
+train_dataset = UnderwaterDataset(root_dir='data', transform=transform)
+train_loader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True)
+
+# Initialize model, loss and optimizer
+model = UNet(n_channels=3, n_classes=1).to(device)
+criterion = nn.BCEWithLogitsLoss()
+optimizer = optim.Adam(model.parameters(), lr=learning_rate)
+
+# Training loop
+for epoch in range(num_epochs):
+    model.train()
+    for batch in train_loader:
+        images = batch['image'].to(device)
+        masks = batch['mask'].to(device)
+
+        # Forward pass
+        outputs = model(images)
+        loss = criterion(outputs, masks)
+
+        # Backward pass and optimization
+        optimizer.zero_grad()
+        loss.backward()
+        optimizer.step()
+
+    print(f"Epoch [{epoch + 1}/{num_epochs}], Loss: {loss.item():.4f}")
+
+    # Save the model
+    torch.save(model.state_dict(), f"experiment/model_epoch_{epoch + 1}.pth")
+
+print("Training completed.")
diff --git a/underwater_unet/model.py b/underwater_unet/model.py
diff --git a/underwater_unet/unet_blocks.py b/underwater_unet/unet_blocks.py
diff --git a/utils/images_to_np.py b/utils/images_to_np.py
diff --git a/utils/network_vizualisation_matplotlib.py b/utils/network_vizualisation_matplotlib.py