Skip to content

Commit 338e947

Browse files
committed
Merge branch 'dynamic-loading' of github.com:fastmachinelearning/SuperSONIC into dynamic-loading
2 parents 03fda78 + aa6129f commit 338e947

File tree

2 files changed

+20
-6
lines changed

2 files changed

+20
-6
lines changed

docs/.values-table.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,9 +34,11 @@
3434
| envoy.rate_limiter.listener_level.max_tokens | int | `5` | Maximum number of simultaneous connections to the Envoy Proxy. Each new connection takes a "token" from the "bucket" which initially contains ``max_tokens`` tokens. |
3535
| envoy.rate_limiter.listener_level.tokens_per_fill | int | `1` | ``tokens_per_fill`` tokens are added to the "bucket" every ``fill_interval``, allowing new connections to be established. |
3636
| envoy.rate_limiter.listener_level.fill_interval | string | `"12s"` | For example, adding a new token every 12 seconds allows 5 new connections every minute. |
37-
| envoy.rate_limiter.prometheus_based | object | `{"enabled":false,"luaConfig":"cfg/envoy-filter.lua"}` | This rate limiter rejects new connections based on metric extracted from Prometheus (e.g. inference queue latency). The metric is taken from parameter ``prometheus.serverLoadMetric``, and the threshold is set by ``prometheus.serverLoadThreshold``. These parameters are the same as those used by the KEDA autoscaler. |
37+
| envoy.rate_limiter.prometheus_based | object | `{"enabled":false}` | This rate limiter rejects new connections based on metric extracted from Prometheus (e.g. inference queue latency). The metric is taken from parameter ``prometheus.serverLoadMetric``, and the threshold is set by ``prometheus.serverLoadThreshold``. These parameters are the same as those used by the KEDA autoscaler. |
3838
| envoy.rate_limiter.prometheus_based.enabled | bool | `false` | Enable rate limiter |
3939
| envoy.loadBalancerPolicy | string | `"LEAST_REQUEST"` | Envoy load balancer policy. Options: ROUND_ROBIN, LEAST_REQUEST, RING_HASH, RANDOM, MAGLEV |
40+
| envoy.lua_filter.enabled | bool | `false` | |
41+
| envoy.lua_filter.lua_config | string | `"cfg/envoy-filter.lua"` | |
4042
| envoy.dynamic_routing | object | `{"enabled":false}` | Enable dynamic routing in Envoy proxy. |
4143
| envoy.auth.enabled | bool | `false` | Enable authentication in Envoy proxy |
4244
| envoy.auth.jwt_issuer | string | `""` | |

helm/supersonic/values.schema.json

Lines changed: 17 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -370,14 +370,10 @@
370370
"properties": {
371371
"enabled": {
372372
"type": "boolean"
373-
},
374-
"luaConfig": {
375-
"type": "string"
376373
}
377374
},
378375
"required": [
379-
"enabled",
380-
"luaConfig"
376+
"enabled"
381377
]
382378
}
383379
},
@@ -389,6 +385,21 @@
389385
"loadBalancerPolicy": {
390386
"type": "string"
391387
},
388+
"lua_filter": {
389+
"type": "object",
390+
"properties": {
391+
"enabled": {
392+
"type": "boolean"
393+
},
394+
"lua_config": {
395+
"type": "string"
396+
}
397+
},
398+
"required": [
399+
"enabled",
400+
"lua_config"
401+
]
402+
},
392403
"dynamic_routing": {
393404
"type": "object",
394405
"properties": {
@@ -441,6 +452,7 @@
441452
"image",
442453
"ingress",
443454
"loadBalancerPolicy",
455+
"lua_filter",
444456
"rate_limiter",
445457
"replicas",
446458
"resources",

0 commit comments

Comments
 (0)