speech-integration

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

Setup

Clone Repository

git clone [email protected]:anilkeshwani/speech-integration.git &&
    cd speech-integration &&
    git submodule update --init --recursive --progress # future-proof

Set Up Environment

Ensure the necessary binary requirements are installed:

apt install sox ffmpeg

Install the package including development dependencies:

conda create -n ssi python=3.10.6 -y &&
    conda activate ssi &&
    pip install -e .["dev"] &&
    pip install --no-dependencies git+https://github.com/anilkeshwani/speech-text-alignment.git

Get shell completions for the configurations from Hydra for the duration of the Bash session by running:

eval "$(python ssi/train.py -sc install=bash)"

If you want to use pre-commit remember to install hooks:

pre-commit install --install-hooks

Supervised Fine-tuning

Run supervised fine-tuning for ASR with:

python ssi/train.py \
    checkpointer.config_json='/mnt/scratch-artemis/anilkeshwani/experiments/Llama-3.2-1B-5000-dsus-sft/avid-wind-123-id_xoafid42/checkpoints/config.json' \
    checkpointer.checkpoint_dir='/mnt/scratch-artemis/anilkeshwani/experiments/Llama-3.2-1B-5000-dsus-sft/avid-wind-123-id_xoafid42/checkpoints/global-step-006500' \
    checkpointer.checkpoint_files='["hf_model_0001_0.pt"]' # slightly weird syntax

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
scripts		scripts
ssi		ssi
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech-integration

Setup

Clone Repository

Set Up Environment

Supervised Fine-tuning

About

Releases

Packages

Languages

License

anilkeshwani/speech-integration

Folders and files

Latest commit

History

Repository files navigation

speech-integration

Setup

Clone Repository

Set Up Environment

Supervised Fine-tuning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages