Skip to content

Conversation

@fishcakeday
Copy link

  • Use CUDA 12.6 and Ubuntu 24.04
  • Use venv
  • Update to Faster Whisper 1.1.1
  • Remove other than large-v3 and distil-large-v3 models to minimize container size
  • Add batch_size and allow for faster parallel processing
  • Make VAD used by default
  • Add multilingual switch to allow for multilingual text and transcription
  • Make model use large-v3 as default model
  • Fix schema data types to allow None for some string parameters

@ef0xa
Copy link

ef0xa commented Feb 8, 2025

Thanks for your submission! I'm in the middle of reworking this significantly, so I'm not going to accept it as is, but i will definitely look this over and try and use the parts I can.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants