Skip to content

MinhVuong2000/Youtube-Engagement-Deep-Multimodal-Fusion

Repository files navigation

Youtube Engagement Analytics via Deep Multimodal Model Fusion


The code is implemented for the paper Youtube Engagement Analytics via Deep Multimodal Model Fusionpublished at Pacific-Rim Symposium on Image and Video Technology 2022 (PSIVT2022),

Read the paper at here

How to run code:

  1. You can get data which is feature extraction at here.
  • Data input includes 3 files: entube_embedding_train.pt, entube_embedding_val.pt, entube_embedding_test.pt
  • Data in each file is a list with each item is a dictionary including keys:
'id': id of video on Youtube
'embedding_title':tensor which is feature extraction of title, has shape: (768,)
'embedding_tag':tensor which is feature extraction of tag, has shape: (768,)
'embedding_thumbnail':tensor which is feature extraction of thumbnail, has shape: (2560,)
'embedding_video':tensor which is feature extraction of the video, has shape: (2304,1,2,2)
'embedding_audio':tensor which is feature extraction of audio, has shape: (62, 128)
'label_1':tensor of label 1 which not use q-score
'label_2':tensor of label 2 which use q-score
  1. Clone this repo to your folder and change the current working directory into the folder
    cd <path/to/the/folder>
  • You can download and move above data folder like this Folder structure:
    project
    │   README.md
    │   main.py
    │   improved_model.py
    │   const.py
    │   early_stopping.py
    │   multihead_attention.py
    │   requirements.txt
    └───data
        │   entube_embedding_train.pt
        │   entube_embedding_val.pt
        │   entube_embedding_test.pt
    ...
    
  • If you using colab, you can mount drive, and change path of data in const.py file at variables: TRAIN_EMBED_PATH, VAL_EMBED_PATH, TEST_EMBED_PATH
  1. Install neccessary package \
python3.10 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
  1. The model when run will have log and checkpoints for each epoch, you can see or change path of them in const.py file
  2. Run file main.py
    python main.py

Additional Information

Crawl data

sh scripts/crawling.sh

Feature extraction

sh scripts/extract_feature.sh

Model training

sh scripts/model.sh

Citation and star

If you find this repository useful, please consider giving a star 🌟 and citation.

@inproceedings{nguyen2022youtube,
  title={Youtube Engagement Analytics via Deep Multimodal Fusion Model},
  author={Nguyen-Thi, Minh-Vuong and Le, Huy and Le, Truong and Le, Tung and Nguyen, Huy Tien},
  booktitle={Pacific-Rim Symposium on Image and Video Technology},
  pages={50--60},
  year={2022},
  organization={Springer}
}

and

@article{le2022entube,
  title={Entube: A dataset for youtube video engagement analytics},
  author={Le, Truong and Nguyen-Thi, Minh-Vuong and Le, Huy and Vo, Quoc-Thang and Le, Tung and Nguyen, Huy Tien},
  year={2022}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •