Optimize Kyma Companion Workflow Latency


### Problem

Our **Kyma Companion** is significantly slower compared to other AI-based tools like **Perplexity**.
Looking at our current workflow:

```
_start → InitialSummarization → Gatekeeper → Supervisor 
        → { Common | KubernetesAgent | KymaAgent } → Finalizer → _end
```

We see that the **Supervisor** orchestrates multiple specialized agents (`Common`, `KubernetesAgent`, `KymaAgent`) 
At present, these calls run sequentially, which increases end-to-end latency.


### Proposal: Optimization Areas

1. **Parallelization of Agents**

   * Enable the invoke of `Common`, `KubernetesAgent`, and `KymaAgent` **concurrently** rather than sequentially.

2. **Streaming Responses**

   * Begin streaming tokens from `finalizer` back to the user.
   * Improves *perceived speed* significantly.

3. **Caching Strategies**

   * Cache frequent results from RAG agent (e.g., generic Kubernetes answers).

4. **Monitoring & Metrics**

   * Track key latency metrics:

     * **TPOT** (Time per Output Token)
     * Agent-level latency breakdowns (`Common`, `KubernetesAgent`, `KymaAgent`)

### Steps

* [ ] Benchmark current agent call timings across the workflow.
* [ ] Prototype **parallel agents execution** (async agent calls).
* [ ] Add **token streaming** from `finalizer`.
* [ ] Experiment with **caching** .
* [ ] Measure improvements against baseline latency.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize Kyma Companion Workflow Latency #804

Problem

Proposal: Optimization Areas

Steps

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimize Kyma Companion Workflow Latency #804

Description

Problem

Proposal: Optimization Areas

Steps

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions