Skip to content

Add latency info #7

Open
Open
@clemlesne

Description

@clemlesne

An extremely important parameter for UX in realtime scenarios is first token latency with a certain amount of input tokens (ie 1k and 10k). It applies on STT, TTS and LLM. This would be a useful data.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions