From bf712a983fb84e7be7507e102a3e654b5af4fe07 Mon Sep 17 00:00:00 2001 From: Milvus-doc-bot Date: Thu, 12 Dec 2024 08:50:15 +0000 Subject: [PATCH] Release new docs to master --- v2.5.x/site/en/reference/gpu_index.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/v2.5.x/site/en/reference/gpu_index.md b/v2.5.x/site/en/reference/gpu_index.md index 5a71b88ca..52d657a85 100644 --- a/v2.5.x/site/en/reference/gpu_index.md +++ b/v2.5.x/site/en/reference/gpu_index.md @@ -27,6 +27,7 @@ GPU_CAGRA is a graph-based index optimized for GPUs, Using inference-grade GPUs | `graph_degree` | Affects search performance and recall by setting the graph's degree after pruning. A larger difference between these two degrees results in a longer build time. Its value must be smaller than the value of __intermediate_graph_degree__. | 64 | | `build_algo` | Selects the graph generation algorithm before pruning. Possible values:
IVF_PQ: Offers higher quality but slower build time.
NN_DESCENT: Provides a quicker build with potentially lower recall. | IVF_PQ | | `cache_dataset_on_device` | Decides whether to cache the original dataset in GPU memory. Possible values:
"true": Caches the original dataset to enhance recall by refining search results.
"false": Does not cache the original dataset to save gpu memory. | "false" | + | `adapt_for_cpu` | Decides whether to use GPU for index-building and CPU for search.
Setting this parameter to `true` requires the presence of the `ef` parameter in the search requests. | "false" | - Search parameters @@ -36,6 +37,7 @@ GPU_CAGRA is a graph-based index optimized for GPUs, Using inference-grade GPUs | `search_width` | Specifies the number of entry points into the CAGRA graph during the search. Increasing this value can enhance recall but may impact search performance(e.g. 1, 2, 4, 8, 16, 32). | Empty | | `min_iterations` / `max_iterations` | Controls the search iteration process. By default, they are set to `0`, and CAGRA automatically determines the number of iterations based on `itopk_size` and `search_width`. Adjusting these values manually can help balance performance and accuracy. | `0` | | `team_size` | Specifies the number of CUDA threads used for calculating metric distance on the GPU. Common values are a power of 2 up to 32 (e.g. 2, 4, 8, 16, 32). It has a minor impact on search performance. The default value is `0`, where Milvus automatically selects the `team_size` based on the vector dimension. | `0` | + | `ef` | Specifies the query time/accuracy trade-off. A higher `ef` value leads to more accurate but slower search.
This parameter is mandatory if you set `adapt_for_cpu` to `true` when you build the index. | `[top_k, int_max]` | * Limits on search