Skip to content

Files

Latest commit

d78673b · Dec 17, 2021

History

History
This branch is 9 commits ahead of, 4 commits behind mpc001/Lipreading_using_Temporal_Convolutional_Networks:master.

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Jun 24, 2020
Jun 9, 2021
Aug 18, 2021
Aug 18, 2021
Jun 9, 2021
Aug 12, 2021
Dec 17, 2021
Jun 24, 2020
Dec 17, 2021
Dec 17, 2021
Jun 24, 2020
Jun 24, 2020

Pre-processing

  • To get mouth ROIs

Run mouth cropping script to save grayscale mouth ROIs. We assume you save cropped mouths to $TCN_LIPREADING_ROOT/datasets/visual_data/. You can choose --testset-only to produce testing set.

python crop_mouth_from_video.py --video-direc <LRW-DIREC> \
                                --landmark-direc <LANDMARK-DIREC> \
                                --save-direc <MOUTH-ROIS-DIRECTORY> \
                                --convert-gray \
                                --testset-only
  • To get audio waveforms

Run format conversion script to extract audio waveforms (.npz) from raw videos. We assume you save audio waveforms to $TCN_LIPREADING_ROOT/datasets/audio_data/. You can choose --testset-only to produce testing set.

python extract_audio_from_video.py --video-direc <LRW-DIREC> \
                                   --save-direc <AUDIO-WAVEFORMS-DIRECTORY> \
                                   --testset-only