Skip to content

Conversation

@sorenlouv
Copy link
Member

@sorenlouv sorenlouv commented Sep 12, 2025

Closes https://github.com/elastic/search-team/issues/10970

This PR adds two API tests for the Agent Builder converse endpoint:

  • Simple conversation test: Verifies basic conversation functionality, including title generation and response handling
  • ESQL query test: Tests complex tool-calling flows including esql generation, searching ES and returning a structured tool result to the LLM

Instead of calling a real LLM these tests call a simulated LLM (called "LLM Proxy"). It's a lightweight HTTP server that simulates LLM responses. The proxy:

  • Acts as a drop-in replacement for an actual LLM
  • Intercepts requests to the LLM and returns predefined responses based on pattern matching
  • Supports complex conversation flows including tool calls
  • Enables testing the full chain of interactions between Agent Builder and the LLM

This approach allows us to test the integration between Agent Builder and LLMs in a deterministic, cheap and fast way.

@sorenlouv sorenlouv force-pushed the one-chat-api-test branch 2 times, most recently from bd2855f to 5de7dc4 Compare September 12, 2025 22:04
@sorenlouv sorenlouv marked this pull request as ready for review September 15, 2025 11:20
@sorenlouv sorenlouv requested review from a team as code owners September 15, 2025 11:20
@sorenlouv sorenlouv added release_note:skip Skip the PR/issue when compiling release notes backport:skip This PR does not require backporting labels Sep 15, 2025
@sorenlouv sorenlouv changed the title [OneChat] Initial API tests with mocked LLM [Agent Builder] Initial API tests with mocked LLM Sep 15, 2025
Copy link
Member

@jbudz jbudz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.buildkite/ftr_platform_stateful_configs.yml

Copy link
Contributor

@csr csr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

x-pack/platform/test/tsconfig.json changes LGTM 👍

@sorenlouv sorenlouv enabled auto-merge (squash) September 16, 2025 09:49
@sorenlouv sorenlouv merged commit eea4c86 into elastic:main Sep 16, 2025
13 checks passed
@elasticmachine
Copy link
Contributor

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

  • [job] [logs] FTR Configs #136 / discover/security/context_awareness cell renderer ES|QL mode should render alert workflow status badge

Metrics [docs]

✅ unchanged

History

sorenlouv added a commit that referenced this pull request Sep 19, 2025
sorenlouv added a commit that referenced this pull request Sep 22, 2025
Follow up to #234985

The chart type was hardcoded to `Line`. This is removed and instead Lens
will auto-detect the most likely chart type
CAWilson94 pushed a commit to CAWilson94/kibana that referenced this pull request Sep 24, 2025
Closes elastic/search-team#10970

This PR adds two API tests for the Agent Builder converse endpoint:

- Simple conversation test: Verifies basic conversation functionality,
including title generation and response handling
- ESQL query test: Tests complex tool-calling flows including esql
generation, searching ES and returning a structured tool result to the
LLM

Instead of calling a real LLM these tests call a simulated LLM (called
"LLM Proxy"). It's a lightweight HTTP server that simulates LLM
responses. The proxy:

- Acts as a drop-in replacement for an actual LLM 
- Intercepts requests to the LLM and returns predefined responses based
on pattern matching
- Supports complex conversation flows including tool calls
- Enables testing the full chain of interactions between Agent Builder
and the LLM

This approach allows us to test the integration between Agent Builder
and LLMs in a deterministic, cheap and fast way.

---------

Co-authored-by: kibanamachine <[email protected]>
CAWilson94 pushed a commit to CAWilson94/kibana that referenced this pull request Sep 24, 2025
CAWilson94 pushed a commit to CAWilson94/kibana that referenced this pull request Sep 24, 2025
Follow up to elastic#234985

The chart type was hardcoded to `Line`. This is removed and instead Lens
will auto-detect the most likely chart type
niros1 pushed a commit that referenced this pull request Sep 30, 2025
Closes elastic/search-team#10970

This PR adds two API tests for the Agent Builder converse endpoint:

- Simple conversation test: Verifies basic conversation functionality,
including title generation and response handling
- ESQL query test: Tests complex tool-calling flows including esql
generation, searching ES and returning a structured tool result to the
LLM

Instead of calling a real LLM these tests call a simulated LLM (called
"LLM Proxy"). It's a lightweight HTTP server that simulates LLM
responses. The proxy:

- Acts as a drop-in replacement for an actual LLM 
- Intercepts requests to the LLM and returns predefined responses based
on pattern matching
- Supports complex conversation flows including tool calls
- Enables testing the full chain of interactions between Agent Builder
and the LLM

This approach allows us to test the integration between Agent Builder
and LLMs in a deterministic, cheap and fast way.

---------

Co-authored-by: kibanamachine <[email protected]>
niros1 pushed a commit that referenced this pull request Sep 30, 2025
niros1 pushed a commit that referenced this pull request Sep 30, 2025
Follow up to #234985

The chart type was hardcoded to `Line`. This is removed and instead Lens
will auto-detect the most likely chart type
rylnd pushed a commit to rylnd/kibana that referenced this pull request Oct 17, 2025
Closes elastic/search-team#10970

This PR adds two API tests for the Agent Builder converse endpoint:

- Simple conversation test: Verifies basic conversation functionality,
including title generation and response handling
- ESQL query test: Tests complex tool-calling flows including esql
generation, searching ES and returning a structured tool result to the
LLM

Instead of calling a real LLM these tests call a simulated LLM (called
"LLM Proxy"). It's a lightweight HTTP server that simulates LLM
responses. The proxy:

- Acts as a drop-in replacement for an actual LLM 
- Intercepts requests to the LLM and returns predefined responses based
on pattern matching
- Supports complex conversation flows including tool calls
- Enables testing the full chain of interactions between Agent Builder
and the LLM

This approach allows us to test the integration between Agent Builder
and LLMs in a deterministic, cheap and fast way.

---------

Co-authored-by: kibanamachine <[email protected]>
rylnd pushed a commit to rylnd/kibana that referenced this pull request Oct 17, 2025
rylnd pushed a commit to rylnd/kibana that referenced this pull request Oct 17, 2025
Follow up to elastic#234985

The chart type was hardcoded to `Line`. This is removed and instead Lens
will auto-detect the most likely chart type
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport:skip This PR does not require backporting release_note:skip Skip the PR/issue when compiling release notes v9.2.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants