Skip to content

Releases: runpod-workers/worker-sglang

2.0.2

24 Nov 15:38
cb4b196

Choose a tag to compare

  • chore(deps): update runpod to latest version

2.0.1

07 Nov 14:37
1b924c9

Choose a tag to compare

  • fix: allow more cuda versions; use proper name of the MODEL_NAME

2.0.0

03 Nov 15:38
78dcd24

Choose a tag to compare

  • feat: added tool_call_parser; updated sglang to 0.5.2
  • feat: use "MODEL_NAME" instead of "MODEL_PATH"

breaking changes

  • MODEL_PATHMODEL_NAME to use the same naming as worker-vllm
  • CUDA 12.6 minimum (was 12.4)

1.2.0

19 Aug 14:31
ec910bd

Choose a tag to compare

What's Changed

New Contributors

1.1.2

19 Aug 08:51

Choose a tag to compare

  • docs: added better examples for users to use sglang

1.1.1

18 Aug 08:10

Choose a tag to compare

  • fix: use proper type for HF_TOKEN

1.1.0

14 Aug 20:36

Choose a tag to compare

  • feat: added HF_TOKEN support in the runpod hub

1.0.0

31 Jul 09:36

Choose a tag to compare

This release adds this worker into the hub and is from now on using the official docker images from sglang instead of installing sglang manually.

What's Changed

New Contributors

v0.2.0

02 Jan 19:32
28b54c5

Choose a tag to compare

  • Sglang Version update from 0.3.3-> v0.4.1.post3

v0.1.0

09 Oct 22:41
c5c90d7

Choose a tag to compare

Initial Sglang worker release.

  • Pre-built docker images.
  • Engine configuration through the Environment variables.