GitHub - jugheadjones10/vanilla-rnn: Vanilla RNN in Pytorch

Simple Pytorch implementation of a vanilla RNN

Pytorch's RNN layer obscures important details of how an RNN works, while Karpathy's classic implementation written in pure Numpy requires some math and backprop to understand.

If you would like to understand Karpathy's code, Eli Bendesrky provides an excellent explanation of the details of the math used in Karpathy's code. He also provides an updated, more well-commented version of Karpathy's original code here.

My implementation here finds a middle ground by depending on Pytorch's autograd capabilities to handle the backprop while retaining the low-level details of how an RNN works. Most of the code is modified from here.

The RNN is a minimal character-level language model that trains on any given text, in this case some Shakespeare.

Comparisons

Loss for Karpathy's implementation:

Loss for my implementation:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figures		figures
README.md		README.md
input.txt		input.txt
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
rnn.py		rnn.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple Pytorch implementation of a vanilla RNN

Comparisons

About

Uh oh!

Releases

Packages

Languages

jugheadjones10/vanilla-rnn

Folders and files

Latest commit

History

Repository files navigation

Simple Pytorch implementation of a vanilla RNN

Comparisons

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages