MNER-MI (Multimodal Named Entity Recognition with Multiple Images)

The pytorch implementation of "MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media" and publication of the MNER-MI dataset (LREC-COLING 2024).

MNER-MI is the abbreviation of MNER with Multi-Image, a MNER task with multi-image scenarios.

Note: The name of the dataset proposed in this paper is also MNER-MI.

The repository contains the dataset MNER-MI proposed in the paper as well as an implementation of the TPM-MI framework.

The examples of MNER-MI are shown in the following figure:

The framework of TPM-MI is shown in the following figure:

Enviroment

We recommend the following actions to create the environment:

conda create -n  TPM-MI python==3.8.16
conda activate TPM-MI
pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 -f https://download.pytorch.org/whl/torch_stable.html
pip install transformers ==4.29.0
pip install pytorch-crf==0.7.2
pip install tokenizers ==0.13.3

Dataset Download & Introduction

In this paper, we introduce a dataset called MNER-MI. We merge it with Twitter-2017 to get a dataset called MNER-MI-Plus.

These two datasets can be downloaded via Google Drive, and the download links are as follows: https://drive.google.com/drive/folders/1Yk5pTei9vVjkKpoHxEfs-5DP48LmRR1Z?usp=drive_link

After downloading, you need to unzip the downloaded files and place them in the Dataset folder, and the file structure should appear as follows:

Dataset
|-- text
|    |-- MNER-MI_train.txt   # The training set of MNER-MI.
|    |-- MNER-MI_val.txt     # The validation set of MNER-MI.
|    |-- MNER-MI_test.txt    # The test set of MNER-MI.
|    |-- MNER-UNI_train.txt  # The training set of MNER-MI-Plus.
|    |-- MNER-UNI_val.txt    # The validation set of MNER-MI-Plus.
|    |-- MNER-UNI_test.txt   # The test set of MNER-MI-Plus.
|-- images.zip -> images (folder)
|-- twitter2017_images.zip -> twitter2017_images (folder)

The shared cloud files (Google Drive) include a folder named text, and two zip files named images.zip and twitter2017_images.zip.

The images.zip contains all the images from the MNER-MI dataset, while the twitter2017_images.zip contains all the images from the Twitter-2017 dataset.

The format of each instance in text file is as follows:

# MNER-MI dataset examples
{"id": 5302, "text": ["i", "have", "such", "a", "soft", "spot", "for", "jaebeom"], "images": ["5302_0.jpg", "5302_1.jpg"], "label": ["O", "O", "O", "O", "O", "O", "O", "B-PER"], "url": "https://twitter.com/i/web/status/1389292031688720386"}

# Twitter-2017 examples in MNER-MI-Plus
{"id": "twitter2017-O_538", "text": ["Ryan", "Newman", "NASCAR", "Endorses", "Donald", "Trump", "for", "President"], "images": ["twitter2017-O_538.jpg"], "label": ["B-PER", "I-PER", "B-ORG", "O", "B-PER", "I-PER", "O", "O"], "url": ""}

The instances from Twitter-2017 has been processed and converted into JSON format. The id represents the data identifier, text contains the words in text, images corresponds to the images associated with the instance, label represents the labels for each word/token in the text, and url refers to the original tweet link (not available in Twitter-2017).

Required pre-trained models

In our paper, we use BERT and VIT as text encoder and image encoder, respectively.

For the code implementation, we utilized the models and weights provided by Hugging Face. Specifically, in lines 44 and 47 of the run.py, we downloaded the corresponding models (bert-base-uncased and ViTB-16) from the Hugging Face model repository to the local folders. The respective Hugging Face links are: https://huggingface.co/google-bert/bert-base-uncased and https://huggingface.co/google/vit-base-patch16-224.

Running

After you have prepared the required enviroment, data, and models, you can run python run.py to train a model for MNER-MI.

If running with the python run.py --dataset MI, the training and testing will be performed on the MNER-MI dataset. If running with the python run.py --dataset UNI, the training and testing will be performed on the MNER-MI-Plus dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
models		models
modules		modules
README.md		README.md
example.png		example.png
freamework.png		freamework.png
ner_evaluate.py		ner_evaluate.py
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNER-MI (Multimodal Named Entity Recognition with Multiple Images)

Enviroment

Dataset Download & Introduction

Required pre-trained models

Running

About

Releases

Packages

Languages

JinFish/MNER-MI

Folders and files

Latest commit

History

Repository files navigation

MNER-MI (Multimodal Named Entity Recognition with Multiple Images)

Enviroment

Dataset Download & Introduction

Required pre-trained models

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages