Skip to content

Releases: llm-d-incubation/workload-variant-autoscaler

v0.0.3

04 Nov 18:26
13d1bdf

Choose a tag to compare

v0.0.3 Pre-release
Pre-release

What's Changed

  • Refactoring deployment scripts for Kubernetes and Kind by @WheelyMcBones in #226
  • Add prom queries for avg input tokens and ttft by @vishakha-ramani in #194
  • Enhance Helm chart security configuration and add development values by @mamy-CS in #242
  • Refactoring deployment scripts and integrating llm-d-inference-sim in emulated deployment environment by @WheelyMcBones in #237

Full Changelog: v0.0.2...v0.0.3

v0.0.2

24 Oct 17:11
b829546

Choose a tag to compare

v0.0.2 Pre-release
Pre-release

What's Changed

  • Update crd docs by @asm582 in #184
  • Update README.md to include ref to new home for container image by @clubanderson in #183
  • Update ci-release.yaml to do multi arch by @clubanderson in #182
  • Removing unsupported architectures from image multi-arch build by @WheelyMcBones in #192
  • Setting minNumReplicas to 1 by default by @WheelyMcBones in #189
  • Update ci-release.yaml to include 'latest' tag by @clubanderson in #193
  • Update QuickStart documentation to disable Rosetta on Apple Silicon by @myechuri in #195
  • Fix zero divide in queue analyzer by @atantawi in #190
  • add helm chart for ocp, supporting files, and instructions to install wva, prom adaptor, and llm-d in simple form by @clubanderson in #200
  • removed 'oc adm policy' (replaced with clusterrolebinding) and rollout commands - no longer needed by @clubanderson in #207
  • Deployment script and readme for wva + llmd on openshift by @mamy-CS in #203
  • updated files to consume llmd model name and modelID by @clubanderson in #209
  • Fixing tolerance function by @WheelyMcBones in #213
  • Remove resource accounting logic and disable limited mode by @mamy-CS in #210
  • E2es on openshift using sharegpt data by @mamy-CS in #201
  • remove epp from wva helm chart and instructions to delete it after wv… by @clubanderson in #215
  • Adding unit tests for the internal optimizer code by @WheelyMcBones in #204
  • Add metrics validation and health monitoring system with Kubernetes conditions by @mamy-CS in #214
  • refactor: Reorganize repository structure and documentation by @mamy-CS in #216
  • Changing OCP deployment script to use the Helm chart by @WheelyMcBones in #212
  • Documentation update by @mamy-CS in #223
  • Parameterizing OCP script by @WheelyMcBones in #218
  • Optimize VariantAutoscaling's owner setting by @learner0810 in #219
  • refactor: Externalize metric names and labels to constants package by @ev-shindin in #228
  • Enhancements to OC E2E Testing by @Vezio in #220
  • Helm Chart Refactoring by @Vezio in #222

New Contributors

Full Changelog: v0.0.1...v0.0.2

v0.0.1

23 Sep 18:18
9abd73c

Choose a tag to compare

v0.0.1 Pre-release
Pre-release

What's Changed

Read more