Releases: LlamaEdge/whisper-api-server
Releases · LlamaEdge/whisper-api-server
LlamaEdge-Whisper 0.3.2
Major changes:
-
New endpoints
GET /v1/files/{file_id}
: Retrieve information of a specific file by idDELETE /v1/files/{file_id}
: Remove a specific file by id
-
Upgrade to
llama-core v0.22.0
LlamaEdge-Whisper 0.3.1
Major changes:
-
New CLI options:
threads
: Number of threads to use during computation. Defaults to 4.processors
: Number of processors to use during computation. Defaults to 1.task
: Task type. Default tofull
. Possible values:transcribe
,translate
,full
port
: Port number. Defaults to8080
.
-
Support new fields of transcription requests
language
: The language of the input audio. Defaults toen
.temperature
: Sampling temperature, between 0 and 1. Defaults to 0.00.prompt
: Text to guide the model's style or continue a previous audio segment. Defaults tonone
max_len
: Maximum number of tokens that the model can generate in a single transcription segment (or chunk). Defaults to 0.split_on_word
: Split audio chunks on word rather than on token. Defaults to false.detect_language
: Automatically detect the spoken language in the provided audio input. Defaults to false.offset_time
: Time offset in milliseconds. Defaults to 0.duration
: Length of audio (in seconds) to be processed starting from the point defined by theoffset_time
field (or from the beginning by default). Defaults to 0.
-
Support new fields for translation requests
detect_language
: Automatically detect the spoken language in the provided audio input. Defaults to false.offset_time
: Time offset in milliseconds. Defaults to 0.duration
: Length of audio (in seconds) to be processed starting from the point defined by theoffset_time
field (or from the beginning by default). Defaults to 0.max_len
: Maximum number of tokens that the model can generate in a single transcription segment (or chunk). Defaults to 0.split_on_word
: Split audio chunks on word rather than on token. Defaults to false.
LlamaEdge-Whisper 0.3.0
Major change:
- Remove the support for
/v1/audio/speech
endpoint. The endpoint will be supported in the comingtts-api-server
.
LlamaEdge-Whisper 0.2.2
Major change:
- Migrate to
WasmEdge v0.14.1
LlamaEdge-Whisper 0.2.1
Major changes:
/v1/audio/translations
endpoint for translating audio into English text.- Integrate
wasi_nn-piper
plugin
LlamaEdge-Whisper 0.2.0
Major features:
/v1/audio/transcriptions
endpoint for transcribing audio into the input language.