Update prompt token usage for input detection on chat completions

## Description

As an orchestrator user, I want to know how many prompt tokens were used in the [`/chat/completions-detection` endpoint](https://foundation-model-stack.github.io/fms-guardrails-orchestrator/#/Task%20-%20Chat%20Completions%2C%20with%20detection/api_v2_chat_completions_detection_handler) even when there are input detections found, so that I can know how many prompt tokens were used for the endpoint, for billing or informational purposes.

## Discussion

`usage.prompt_tokens` ref. https://platform.openai.com/docs/api-reference/chat/object#chat/object-usage. Currently, a separate tokenization call is done for text generation to get input token information, so the "tokenization" equivalent for chat completions may have to be investigated

## Acceptance Criteria



- [ ] Unit tests cover new/changed code
- [ ] Examples build against new/changed code
- [ ] READMEs are updated
- [ ] Type of [semantic version](https://semver.org/) change is identified

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update prompt token usage for input detection on chat completions #295

Description

Discussion

Acceptance Criteria

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Update prompt token usage for input detection on chat completions #295

Description

Description

Discussion

Acceptance Criteria

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions