You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pages/deployments/operator/api.md
+5Lines changed: 5 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,6 +53,7 @@ _Appears in:_
53
53
| Field | Description | Default | Validation |
54
54
| --- | --- | --- | --- |
55
55
|`model`_string_| Model is the LLM model name to use. || Optional: {} <br /> |
56
+
|`toolModel`_string_| Model to use for tool calling, which is less frequent and often requires more advanced reasoning || Optional: {} <br /> |
56
57
|`baseUrl`_string_| A custom base url to use, for reimplementations of the same API scheme (for instance Together.ai uses the OpenAI API spec) || Optional: {} <br /> |
57
58
|`tokenSecretRef`_[SecretKeySelector](https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.29/#secretkeyselector-v1-core)_| TokenSecretRef is a reference to the local secret holding the token to access<br />the configured AI provider. || Required: {} <br /> |
58
59
@@ -98,6 +99,7 @@ _Appears in:_
98
99
|`endpoint`_string_| Your Azure OpenAI endpoint, should be formatted like: https://{endpoint}/openai/deployments/{deployment-id}" || Required: {} <br /> |
99
100
|`apiVersion`_string_| The azure openai Data plane - inference api version to use, defaults to 2024-10-01-preview or the latest available || Optional: {} <br /> |
100
101
|`model`_string_| The OpenAi Model you wish to use. If not specified, Plural will provide a default || Optional: {} <br /> |
102
+
|`toolModel`_string_| Model to use for tool calling, which is less frequent and often requires more advanced reasoning || Optional: {} <br /> |
101
103
|`tokenSecretRef`_[SecretKeySelector](https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.29/#secretkeyselector-v1-core)_| TokenSecretRef is a reference to the local secret holding the token to access<br />the configured AI provider. || Required: {} <br /> |
102
104
103
105
@@ -115,6 +117,7 @@ _Appears in:_
115
117
| Field | Description | Default | Validation |
116
118
| --- | --- | --- | --- |
117
119
|`modelId`_string_| The AWS Bedrock Model ID to use || Required: {} <br /> |
120
+
|`toolModelId`_string_| Model to use for tool calling, which is less frequent and often requires more advanced reasoning || Optional: {} <br /> |
118
121
|`accessKeyId`_string_| An AWS Access Key ID to use, can also use IRSA to acquire credentials || Optional: {} <br /> |
119
122
|`secretAccessKeyRef`_[SecretKeySelector](https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.29/#secretkeyselector-v1-core)_| An AWS Secret Access Key to use, can also use IRSA to acquire credentials || Optional: {} <br /> |
120
123
@@ -1602,6 +1605,7 @@ _Appears in:_
1602
1605
| --- | --- | --- | --- |
1603
1606
|`url`_string_| URL is the url this model is queryable on || Required: {} <br /> |
1604
1607
|`model`_string_| Model is the Ollama model to use when querying the /chat api || Required: {} <br /> |
1608
+
|`toolModel`_string_| Model to use for tool calling, which is less frequent and often requires more advanced reasoning || Optional: {} <br /> |
1605
1609
|`tokenSecretRef`_[SecretKeySelector](https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.29/#secretkeyselector-v1-core)_| TokenSecretRef is a reference to the local secret holding the contents of a HTTP Authorization header<br />to send to your ollama api in case authorization is required (eg for an instance hosted on a public network) || Optional: {} <br /> |
1606
1610
1607
1611
@@ -2731,6 +2735,7 @@ _Appears in:_
2731
2735
| Field | Description | Default | Validation |
2732
2736
| --- | --- | --- | --- |
2733
2737
|`model`_string_| The Vertex AI model to use || Optional: {} <br /> |
2738
+
|`toolModel`_string_| Model to use for tool calling, which is less frequent and often requires more advanced reasoning || Optional: {} <br /> |
2734
2739
|`project`_string_| The GCP project you'll be using || Required: {} <br /> |
2735
2740
|`location`_string_| The GCP region Vertex is queried from || Required: {} <br /> |
2736
2741
|`endpoint`_string_| A custom endpoint for self-deployed models || Optional: {} <br /> |
0 commit comments