Skip to content

Conversation

@dgieselaar
Copy link
Member

@dgieselaar dgieselaar commented Sep 11, 2025

Various improvements for the executeAsReasoningAgent helper:

  • add finalToolChoice to formalize the concept of having a tool call that is required for the definitive output
  • change from PLAN > GATHER > REFLECT to GOAL > REFLECT > PLAN to have the LLM continuously focus on the goal of the task
  • remove all content after reasoning markers to prevent the LLM from producing definitive output at the wrong time (we should use stopSequences for this in the near future)
  • add tests

@dgieselaar dgieselaar force-pushed the reasoning-util-improvements branch 3 times, most recently from 12bca28 to e955e80 Compare September 15, 2025 14:39
@dgieselaar dgieselaar added release_note:skip Skip the PR/issue when compiling release notes backport:skip This PR does not require backporting v9.2.0 backport:version Backport to applied version labels labels Sep 15, 2025
@dgieselaar dgieselaar force-pushed the reasoning-util-improvements branch from 904b2e4 to 2a54209 Compare September 15, 2025 15:15
@dgieselaar dgieselaar marked this pull request as ready for review September 15, 2025 18:11
@dgieselaar dgieselaar requested review from a team as code owners September 15, 2025 18:11
@botelastic botelastic bot added ci:project-deploy-observability Create an Observability project Team:Obs AI Assistant Observability AI Assistant labels Sep 15, 2025
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

@github-actions
Copy link
Contributor

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

  • /oblt-deploy : Deploy a Kibana instance using the Observability test environments.
  • run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

@dgieselaar dgieselaar enabled auto-merge (squash) September 16, 2025 07:01
@elasticmachine
Copy link
Contributor

⏳ Build in-progress

  • Buildkite Build
  • Commit: a46ac14
  • Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-234724-a46ac14cf554

Failed CI Steps

Test Failures

  • [job] [logs] FTR Configs #14 / discover/group3 discover request counts ES|QL mode should send expected requests for saved search changes

History

@dgieselaar dgieselaar merged commit c290921 into elastic:main Sep 16, 2025
13 checks passed
@dgieselaar dgieselaar deleted the reasoning-util-improvements branch September 16, 2025 08:56
CAWilson94 pushed a commit to CAWilson94/kibana that referenced this pull request Sep 24, 2025
Various improvements for the `executeAsReasoningAgent` helper:

- add `finalToolChoice` to formalize the concept of having a tool call
that is required for the definitive output
- change from PLAN > GATHER > REFLECT to GOAL > REFLECT > PLAN to have
the LLM continuously focus on the goal of the task
- remove all content after reasoning markers to prevent the LLM from
producing definitive output at the wrong time (we should use
`stopSequences` for this in the near future)
- add tests
niros1 pushed a commit that referenced this pull request Sep 30, 2025
Various improvements for the `executeAsReasoningAgent` helper:

- add `finalToolChoice` to formalize the concept of having a tool call
that is required for the definitive output
- change from PLAN > GATHER > REFLECT to GOAL > REFLECT > PLAN to have
the LLM continuously focus on the goal of the task
- remove all content after reasoning markers to prevent the LLM from
producing definitive output at the wrong time (we should use
`stopSequences` for this in the near future)
- add tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport:skip This PR does not require backporting backport:version Backport to applied version labels ci:project-deploy-observability Create an Observability project release_note:skip Skip the PR/issue when compiling release notes Team:Obs AI Assistant Observability AI Assistant v9.2.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants