Part of Speech Tagging Transformer from Scratch

This repo contains an implementation of an encoder-only transformer model for part-of-speech tagging. We have implemented this from scratch in both Matlab and Pytorch (Pytorch version to be added soon). The most important part of the code is the implementation of the transformer backpropagation from scratch.

Dataset

To run POS taggin on the conll 2003 dataset, first download the data:

We use word2vec word embeddings which you can downlaod from here:

word vectors

Results

Number of Parameters : 202351

Training Accuracy : 93.59%

Testing Accuracy : 89.62%

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
transformer		transformer
README.md		README.md
get_batch.m		get_batch.m
main.m		main.m
prep_data.m		prep_data.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Part of Speech Tagging Transformer from Scratch

Dataset

Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

emreonal11/Transformer-encoder-from-scratch

Folders and files

Latest commit

History

Repository files navigation

Part of Speech Tagging Transformer from Scratch

Dataset

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages