Parallel Claude Code sessions talking to different models via Peering

If I use two or more claude code sessions in parallel and re-route via peering to other providers, e.g. Anthropic and Z.AI, I think the llama-swap endpoint doesn't handle these multiple requests very well. Does this ring any bells, is there any current known limitation in llama-swap?

For example, if I had a huge tool call list and which would make llama-swap (probably) block on prompt processing, do other requests go through?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel Claude Code sessions talking to different models via Peering #483

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Parallel Claude Code sessions talking to different models via Peering #483

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions