HatefulMemes

Intro

This is the source code of FacebookAI HatefulMemes challenge solution...

Dependency

Docker >= 19.0.3
nvidia-container-toolkit.

NOTE: Make sure you follow this guide to let docker run as root, so it can be run by shell scripts with out sudo.

System sepc

Original experiement was conduct on GCP n1-highmem-16 instance init with TensorFlow2.3/Keras.CUDA11.0.GPU GCE Image:

OS: Ubuntu 18.04.5 LTS
CPU: 16 Core Intel CPU
Memory: 104 GB
GPU: 4 Nvidia T4
Disk: 500GB HDD

Most of the data preprocessing and model training could be done with only 1 T4 GPU, except VL-BERT need 4 GPU to achieve high enough batch size when fine-tuning Faster-RCNN & BERT togather.
NOTE: All models used in this project is using fp16 acceleration during training. Please use GPU support NVDIA AMP.

Steps

Data preprocess and extract additional features. See detailed instruction at data_utils/README.
Train modified VL-BERT(2 large one and 1 base one). See detailed instruction at VL-BERT/README.
Train UNITER-ITM(1 large one and 1 base one) and VILLA-ITM(1 large one and 1 base one). See detailed instruction at UNITER/README.
Train ERNIE-Vil(1 large one and 1 base one). See detailed instruction at ERNIE-VIL/README.
Ensemble by average predictions of all model then apply simple rule-base racism detector on top of it.
```
bash run_ensemble.sh
```
This script will let you select the predition of different model to taken into ensemble. As result it will output ROOT/test_set_ensemble.csv as final result and copy all the csv files used in ensemble to ROOT/test_set_csvs folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HatefulMemes

Intro

Dependency

System sepc

Steps

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ERNIE-Vil		ERNIE-Vil
HatefulMemes		HatefulMemes
UNITER		UNITER
VL-BERT		VL-BERT
data		data
data_utils		data_utils
doc		doc
pretrain_model		pretrain_model
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ensemble.py		ensemble.py
run_ensemble.sh		run_ensemble.sh

License

FightingFighting/HatefulMemesChallenge

Folders and files

Latest commit

History

Repository files navigation

HatefulMemes

Intro

Dependency

System sepc

Steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages