You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/build/nilRAG.md
+18Lines changed: 18 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -114,3 +114,21 @@ nilRAG is a standalone library available through
114
114
a feature of [SecretLLM](https://docs.nillion.com/build/secretLLM/quickstart) to
115
115
enhance the inference with context that has been uploaded to [SecretVault](https://docs.nillion.com/build/secret-vault).
116
116
117
+
118
+
### Performance Expectations
119
+
120
+
We have performed a series of benchmarks to evaluate the performance of nilRAG.
121
+
Currently, nilRAG scales linearly to the number of rows stored in nilDB.
122
+
The following table shows latency to upload to nilDB multiple paragraphs of a few sentences long, as well as the runtime for AI inference using SecretLLM with nilRAG.
123
+
124
+
| Number of Paragraphs Stored in nilDB | Upload Time to nilDB (sec.) | Query Time (Inference + RAG) (sec.) |
Additionally, using multiple concurrent users, the query time for inference with nilRAG increases.
133
+
Performing inference with nilRAG with a content of 100 paragraphs takes approximately 5 seconds for a single user, while with ten concurrent users the inference time for the same content goes up to almost 9 seconds.
134
+
We're developing new research to further accelerate nilRAG and make it more scalable, stay tuned!
0 commit comments