Name		Name	Last commit message	Last commit date
parent directory ..
deployment		deployment
images		images
README.md		README.md
caption.py		caption.py
create_input_files.py		create_input_files.py
datasets.py		datasets.py
eval.py		eval.py
image_captioning.ipynb		image_captioning.ipynb
models.py		models.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

README.md

Session 12 - Image Captioning

The goal of this assignment is to train and deploy an image caption generation model. The code for deployment can be found here.

Parameters and Hyperparameters

Loss Function: NLLLoss
Bleu Score: 14.0
Epochs: 120
Encoder: Pre-trained ResNet-18 on ImageNet dataset
Decoder Learning Rate: 4e-4
Optimizer: Adam
Batch Size: 32
Embedding dimension: 128
Attention dimension: 128
Decoder dimension: 128
Dropout: 0.5

Results

Input Image	Output Caption
	a man in a wetsuit is surfing on a surfboard
	a group of people sit on a snowy mountain
	a young boy in a red shirt is riding on a tire swing

Architecture

We used an Encoder-Decoder architecture. The encoder is 18-layered Residual Network pre-trained on the ImageNet classification task and the layers are not fine tuned. Decoder has attention incorporated into it as it will help to look at different parts of the image to generate prediction for the sequence.

Attention Mechanism

A neural network is considered to be an effort to mimic human brain actions in a simplified manner. Attention Mechanism is also an attempt to implement the same action of selectively concentrating on a few relevant things, while ignoring others in deep neural networks.

An attention mechanism allows the model to focus on the currently most relevant part of the source sentence. In this project we implemented additive attention that was used in Bahdanau et al.

The code present here has been referenced from this repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

12 - Image Captioning

12 - Image Captioning

README.md

Session 12 - Image Captioning

Parameters and Hyperparameters

Results

Architecture

Attention Mechanism

Files

12 - Image Captioning

Directory actions

More options

Directory actions

More options

Latest commit

History

12 - Image Captioning

Folders and files

parent directory

README.md

Session 12 - Image Captioning

Parameters and Hyperparameters

Results

Architecture

Attention Mechanism