Draft: AI Policies #120

jasonmadigan · 2025-04-09T07:04:41Z

Re: #118

Not ready for review yet.

Signed-off-by: Jason Madigan <[email protected]>

Signed-off-by: David Martin <[email protected]>

Signed-off-by: Jason Madigan <[email protected]>

Signed-off-by: David Martin <[email protected]>

Signed-off-by: Jason Madigan <[email protected]>

Signed-off-by: David Martin <[email protected]>

Signed-off-by: Jason Madigan <[email protected]>

rfcs/0013-ai-policies.md

maleck13 · 2025-04-17T07:33:23Z

rfcs/0013-ai-policies.md

+
+### `LLMPromptRiskCheckPolicy`
+
+A Kuadrant `LLMPromptRiskCheckPolicy` is a custom resource provided by Kuadrant that targets Gateway API resources (`Gateway` and `HTTPRoute`), enabling users to define and enforce content safety rules with LLM prompts to detect and block sensitive prompts.  Prompt guards can be defined and enforced for both Gateways and individual HTTPRoutes.


Would be interesting to get some feedback from a SME about what they and where they want things checked. In my naive view I can see the gateway being a good place to check the prompt. But I wonder about the response. Why would you not do that to be done before sending the response over the network?

It may not be possible to avoid network traffic, like in the case of streaming the response.
You may end up streaming/chunking the response and checking it all, or in parts.

maleck13 · 2025-04-17T07:34:54Z

rfcs/0013-ai-policies.md

+
+A Kuadrant `LLMPromptRiskCheckPolicy` is a custom resource provided by Kuadrant that targets Gateway API resources (`Gateway` and `HTTPRoute`), enabling users to define and enforce content safety rules with LLM prompts to detect and block sensitive prompts.  Prompt guards can be defined and enforced for both Gateways and individual HTTPRoutes.
+
+### `LLMResponseRiskCheckPolicy`


My instinct here as hinted at above is to start with the Prompt check and seek more opinion and feedback on the suitability of checking this at the gateway? Maybe it is still desirable to have a "last line of defence" type policy in place?

Depending on how you use it, it could be a last line of defence.

It could also be a targeted response check based on a specific group of users for example, since you have access to the users auth context. That capability may not be easily available in the model serving runtime without custom logic.

maleck13 · 2025-04-17T08:44:39Z

rfcs/0013-ai-policies.md

+    counter: auth.identity.userid
+---
+apiVersion: kuadrant.io/v1alpha1
+kind: LLMPromptRiskCheckPolicy


I wonder could these be in the same API such as LLMContentPolicy but split underneath by the kuadrant operator as they are very similar? or do you expect the model being used to change based on the policy type?

spec: model: .... llmprompt: categories: .... llmresponse: categories: response: ...

They certainly could be in the same API.
My sole reason at the time for splitting them was usability.
That is, writing/configuring each part separately in a sizable block.

There may be a more concrete case for keeping them split.
Thoughts on if you want to apply a prompt check to a different group of users than a risk check?
Although not shown in this example, it could be useful to have a predicate for what users the policy applies to (based on auth headers etc..)
If combined into 1 policy, it would mean having multiple predicate fields.
Do we have a precedence for this with existing policies?

yes RLP has that

limits: "alice-limit": rates: - limit: 5 window: 10s when: - predicate: "auth.identity.userid == 'alice'" "bob-limit": rates: - limit: 2 window: 10s when: - predicate: "auth.identity.userid == 'bob'"

so perhaps its a use case like

spec: model: .... llmprompt: - "under18": categories: .... when: - predicate: "auth.identity.age < 18" - "over18" categories: .... when: - predicate: "auth.identity.age >= 18" - predicate: "request.model == 'educational'" llmresponse: - "under18":

This is just sudo stuff but might be useful to think about?

nit: request.model seem to be wanting to "enhance" the request struct with arbitrary fields. I'd advise against that. When is that model field present? I don't want users to ask themselves these questions. I understand this is just for illustration purposes, but nonetheless raises an interesting question: where would "additional policies" append data to the "well-known attributes"?

Yes where we append this data is an important consideration. Perhaps we need a new namespace for AI metadata. ai.model

Signed-off-by: Jason Madigan <[email protected]>

rfcs/0013-ai-policies.md

Co-authored-by: Craig Brookes <[email protected]>

maleck13 · 2025-04-17T13:19:55Z

rfcs/0013-ai-policies.md

+- Either:
+  - Extend our existing `wasm-shim` to optionally amend the existing `actionSet` to optionally call both the guard filter and the token parsing filter implementation.
+  - Create a new `ext_proc` gRPC service for parsing OpenAI-style and usage metrics and adding these as well-known dynamic metadata, for use by Limitador
+- Extend the wasm-shim and `RateLimitPolicy` to give a means to specify an increment (currently, [hard-coded](https://github.com/Kuadrant/wasm-shim/blob/main/src/service/rate_limit.rs#L18) to `1`)


I don't know if this is needed in the RLP api? it seems a very special use case. I wonder instead if it could be made some form of internal config not exposed to the user at this point?

that is an option yes: we either inc the counter in a custom fashion in something in this filter chain, or we extend RLP somehow to support. if there's general utility in RLP, I guess that route may be more preferable

slight worry about having another mechanism other than limitador doing it - would end up needing to re-implment/copy a bunch of existing machinery? unsure

hmm I might be misunderstanding. I was instead thinking of using the hits_addend and setting it dynamically if it is an AI interaction. We need @eguzki or @alexsnaps here interested to know what there thoughts are on how to send a custom increment.

While this isn't implemented afaik, well-known attributes were meant to support this. So one way would be to have ratelimit.hits_addend (while defaulting to 1) be mutable by "upstream" actions. So that a policy could set it to some arbitrary value before the request to limitador is made.

that'd do nicely

and, yes! 🙌 , service.ext_proc.v3.ProcessingResponse support dynamic metadata to have data flow between envoy and these processes. If wasm-shim dispatches the call, then no worries in using that for that. If envoy does, we have to check what happens to it and how/if we can read them back properly from wasm tho (should be fine™ - t&c apply)

maleck13 · 2025-04-17T13:40:28Z

rfcs/0013-ai-policies.md

+
+### Parsing OpenAI-style usage metrics
+
+OpenAI-style usage metrics for both completion and response APIs generally have a `usage` object, with values for `prompt_tokens` (token count for the initial prompt), `completion_tokens` (tokens generated by the model in response) and `total_tokens` (prompt + response tokens count).


is there one of these that is more popular than the other? I am wondering does it make sense to support just one for now and expand beyond that in the future?

assume this is re: Chat completion API vs Responses API

The chat completion API is more universally supported, the Responses API is newer, but is designed to work with more use-cases (agentic use-cases, as well as support for "reasoning/show my thoughts" response streaming)

maleck13 · 2025-04-17T13:43:29Z

rfcs/0013-ai-policies.md

+
+Given the permutations, this will add some extra complexity to how we parse usage metrics. There is a basic Golang example of an `ext_proc` that can parse these metrics (non-streamed responses) here: https://github.com/jasonmadigan/token-ext-proc
+
+We will also want to support llama-stack style responses. Inference chat-completion with llama-stack offers the option for configurable (JSON-schema) guideded `response_format`. This may hint that we'll want to offer some customisation in terms of where to look for metrics, (probably CEL, or JQ-style querying?).


beyond these tokens are there other reasons a user may want to pull something from the body? I think I would start with inferring and looking for these specific values rather than surfacing them into the API at this stage?

As in I would prefer the user to have to hint at the response types to expect and we then use that to decide what values to pull out rather than open up the entire request body to the user to pull values out of.

Reason I think we may want to offer some sort of API here is that although lots of runtimes offer openai-style completion APIs, in the ones I've looked at there are some small differences (i.e. where in the JSON response usage metrics are) which could break the policy. We could have some prebuilt "variants" to make the APIs look nice though (perhaps starting with one for llama-stack and one for openai-style) - these variants would come with builtin selectors on what attributes to pluck to get our usage metrics (if that makes sense)

Yeah like we could for now call out those as the supported options. Rather than jumping straight to giving the power to the user which we might regret and not be able to take back. That said we are talking about Alpha apis so easier to take back then other places

that makes sense. I suppose internally we'll probably use selectors, and then if we decide later, we can expose those to end users

maleck13

This is a really great start to some cool features. I think we still need to nail down how we want to do the request filtering (seems to be leaning towards the WASM shim) and also whether we need to expose certain options to the user or not

jasonmadigan · 2025-04-17T13:58:56Z

One other potential policy which may emerge here, depending on how this PoC progresses, is a SemanticCachingPolicy for short-circuiting (well, partially, still need to do embedding) expensive LLM calls if we see similar prompts

jasonmadigan and others added 20 commits April 8, 2025 17:02

AI policies RFC

3c7eacf

Signed-off-by: Jason Madigan <[email protected]>

overview

e66d674

Signed-off-by: Jason Madigan <[email protected]>

some high-level use-cases

fd5a7d3

Signed-off-by: Jason Madigan <[email protected]>

Extend use cases & use inference personas

a6dc873

Signed-off-by: David Martin <[email protected]>

Prefix policies, avoiding use of 'promptguard' which may be confusing.

df190c8

Signed-off-by: David Martin <[email protected]>

Add explanation section with example crds

7d1c44e

Signed-off-by: David Martin <[email protected]>

guide-level

502587c

Signed-off-by: Jason Madigan <[email protected]>

coupla typos, clarity tweaks

30fefad

Signed-off-by: Jason Madigan <[email protected]>

tokens

c831f7b

Signed-off-by: Jason Madigan <[email protected]>

diagram

c41abe1

Signed-off-by: Jason Madigan <[email protected]>

stream/complete responses, usage parsing notes

7285b2f

Signed-off-by: Jason Madigan <[email protected]>

slack link

1b7a947

Signed-off-by: Jason Madigan <[email protected]>

add example

1365a7f

Signed-off-by: Jason Madigan <[email protected]>

Update token limit

75dde79

Signed-off-by: David Martin <[email protected]>

notes on llama-stack

4648f47

Signed-off-by: Jason Madigan <[email protected]>

notes on parsing customisation

6da722e

more clarity on wasm-shim scope

1e236eb

Signed-off-by: Jason Madigan <[email protected]>

small diagram fix

5aad9b9

Signed-off-by: Jason Madigan <[email protected]>

Add AuthPolicy to compliment LLMTokenRateLimitPolicy

9310a80

Signed-off-by: David Martin <[email protected]>

Add user id annotation

450f169

Signed-off-by: David Martin <[email protected]>

This was referenced Apr 10, 2025

Investigate Implementation Options for AI Policy Enforcement #121

Closed

Semantic Caching proof of concept Kuadrant/kuadrant-operator#1279

Closed

sequence diagram

4ac95d4

Signed-off-by: Jason Madigan <[email protected]>

jasonmadigan force-pushed the 118-ai-policies branch from 09eeb7a to 4ac95d4 Compare April 11, 2025 07:48

jasonmadigan added 4 commits April 11, 2025 09:43

separate diagrams for rate limiting vs guarding

baa2658

Signed-off-by: Jason Madigan <[email protected]>

add auth JWT flow to trlp

aa00279

Signed-off-by: Jason Madigan <[email protected]>

meta

76f4af4

Signed-off-by: Jason Madigan <[email protected]>

meta

cc5fde6

Signed-off-by: Jason Madigan <[email protected]>

alexsnaps reviewed Apr 15, 2025

View reviewed changes

rfcs/0013-ai-policies.md Outdated Show resolved Hide resolved

maleck13 reviewed Apr 17, 2025

View reviewed changes

sequence update: E->WL

fb84e52

Signed-off-by: Jason Madigan <[email protected]>

maleck13 reviewed Apr 17, 2025

View reviewed changes

rfcs/0013-ai-policies.md Outdated Show resolved Hide resolved

Update 0013-ai-policies.md

26913af

Co-authored-by: Craig Brookes <[email protected]>

maleck13 reviewed Apr 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: AI Policies #120

Draft: AI Policies #120

jasonmadigan commented Apr 9, 2025 •

edited

Loading

maleck13 Apr 17, 2025

david-martin Apr 17, 2025

maleck13 Apr 17, 2025

david-martin Apr 17, 2025

maleck13 Apr 17, 2025

david-martin Apr 17, 2025

maleck13 Apr 17, 2025

maleck13 Apr 17, 2025 •

edited

Loading

alexsnaps Apr 17, 2025

maleck13 Apr 17, 2025

maleck13 Apr 17, 2025

jasonmadigan Apr 17, 2025

maleck13 Apr 17, 2025

alexsnaps Apr 17, 2025

jasonmadigan Apr 17, 2025

alexsnaps Apr 17, 2025

maleck13 Apr 17, 2025

jasonmadigan Apr 17, 2025

maleck13 Apr 17, 2025

maleck13 Apr 17, 2025

jasonmadigan Apr 17, 2025

maleck13 Apr 17, 2025

jasonmadigan Apr 17, 2025

maleck13 left a comment

jasonmadigan commented Apr 17, 2025


		### `LLMPromptRiskCheckPolicy`

		A Kuadrant `LLMPromptRiskCheckPolicy` is a custom resource provided by Kuadrant that targets Gateway API resources (`Gateway` and `HTTPRoute`), enabling users to define and enforce content safety rules with LLM prompts to detect and block sensitive prompts. Prompt guards can be defined and enforced for both Gateways and individual HTTPRoutes.


		A Kuadrant `LLMPromptRiskCheckPolicy` is a custom resource provided by Kuadrant that targets Gateway API resources (`Gateway` and `HTTPRoute`), enabling users to define and enforce content safety rules with LLM prompts to detect and block sensitive prompts. Prompt guards can be defined and enforced for both Gateways and individual HTTPRoutes.

		### `LLMResponseRiskCheckPolicy`


		### Parsing OpenAI-style usage metrics

		OpenAI-style usage metrics for both completion and response APIs generally have a `usage` object, with values for `prompt_tokens` (token count for the initial prompt), `completion_tokens` (tokens generated by the model in response) and `total_tokens` (prompt + response tokens count).


		Given the permutations, this will add some extra complexity to how we parse usage metrics. There is a basic Golang example of an `ext_proc` that can parse these metrics (non-streamed responses) here: https://github.com/jasonmadigan/token-ext-proc

		We will also want to support llama-stack style responses. Inference chat-completion with llama-stack offers the option for configurable (JSON-schema) guideded `response_format`. This may hint that we'll want to offer some customisation in terms of where to look for metrics, (probably CEL, or JQ-style querying?).

Draft: AI Policies #120

Are you sure you want to change the base?

Draft: AI Policies #120

Conversation

jasonmadigan commented Apr 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maleck13 Apr 17, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maleck13 left a comment

Choose a reason for hiding this comment

jasonmadigan commented Apr 17, 2025

jasonmadigan commented Apr 9, 2025 •

edited

Loading

maleck13 Apr 17, 2025 •

edited

Loading