Skip to content

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

License

Notifications You must be signed in to change notification settings

anilkeshwani/speech-integration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-integration

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

Setup

Clone Repository

git clone [email protected]:anilkeshwani/speech-integration.git &&
    cd speech-integration &&
    git submodule update --init --recursive --progress # future-proof

Set Up Environment

Ensure the necessary binary requirements are installed:

apt install sox ffmpeg

Install the package including development dependencies:

conda create -n ssi python=3.10.6 -y &&
    conda activate ssi &&
    pip install -e .["dev"] &&
    pip install --no-dependencies git+https://github.com/anilkeshwani/speech-text-alignment.git

Get shell completions for the configurations from Hydra for the duration of the Bash session by running:

eval "$(python ssi/train.py -sc install=bash)"

If you want to use pre-commit remember to install hooks:

pre-commit install --install-hooks

Supervised Fine-tuning

Run supervised fine-tuning for ASR with:

python ssi/train.py \
    checkpointer.config_json='/mnt/scratch-artemis/anilkeshwani/experiments/Llama-3.2-1B-5000-dsus-sft/avid-wind-123-id_xoafid42/checkpoints/config.json' \
    checkpointer.checkpoint_dir='/mnt/scratch-artemis/anilkeshwani/experiments/Llama-3.2-1B-5000-dsus-sft/avid-wind-123-id_xoafid42/checkpoints/global-step-006500' \
    checkpointer.checkpoint_files='["hf_model_0001_0.pt"]' # slightly weird syntax

About

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages