openai-proxy does not support custom reasoning and embedding models

**Describe the bug**
When self-hosting and trying to use OpenAI server with custom models (Qwen3) loaded, reasoning cannot be enabled in Letta application and embedding model cannot be changed.

**Please describe your setup**
- [x] How are you running Letta?
  - ~~Docker~~ Podman
- [x] Describe your setup
  - Linux, CachyOS (Arch-based distro)
  - Running letta self-hosted in container

**Screenshots**
<img width="1120" height="464" alt="Image" src="https://github.com/user-attachments/assets/87290e01-536b-4fee-8735-488465e11da8" />

**Additional context**
- [x] What model you are using?
  - Qwen3 14B (self-hosted)

**Agent File (optional)**
It's just memory-agent template.

---

If you're not using OpenAI, please provide additional information on your local LLM setup:

**Local LLM details**

If you are trying to run Letta with local LLMs, please provide the following information:

- [x] The exact model you're trying to use (e.g. `dolphin-2.1-mistral-7b.Q6_K.gguf`)
  - Qwen3 14B
- [x] The local LLM backend you are using (web UI? LM Studio?)
  - [OpenArc](https://github.com/SearchSavior/OpenArc) (openai compatible)
- [x] Your hardware for the local LLM backend (local computer? operating system? remote RunPod?)
  - local workstation with Intel Arc GPUs.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

openai-proxy does not support custom reasoning and embedding models #3064

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

openai-proxy does not support custom reasoning and embedding models #3064

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions