You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
3. **SecretVault:** SecretVault stores the blinded chunks and embeddings
65
90
provided by data owners. When a client submits a query, SecretVault computes
@@ -89,3 +114,21 @@ nilRAG is a standalone library available through
89
114
a feature of [SecretLLM](https://docs.nillion.com/build/secretLLM/quickstart) to
90
115
enhance the inference with context that has been uploaded to [SecretVault](https://docs.nillion.com/build/secret-vault).
91
116
117
+
118
+
### Performance Expectations
119
+
120
+
We have performed a series of benchmarks to evaluate the performance of nilRAG.
121
+
Currently, nilRAG scales linearly to the number of rows stored in nilDB.
122
+
The following table shows latency to upload to nilDB multiple paragraphs of a few sentences long, as well as the runtime for AI inference using SecretLLM with nilRAG.
123
+
124
+
| Number of Paragraphs Stored in nilDB | Upload Time to nilDB (sec.) | Query Time (Inference + RAG) (sec.) |
Additionally, using multiple concurrent users, the query time for inference with nilRAG increases.
133
+
Performing inference with nilRAG with a content of 100 paragraphs takes approximately 5 seconds for a single user, while with ten concurrent users the inference time for the same content goes up to almost 9 seconds.
134
+
We're developing new research to further accelerate nilRAG and make it more scalable, stay tuned!
0 commit comments