Integrate Panorama into `IndexHNSWFlatPanorama` #4621

AlSchlo · 2025-10-20T03:18:26Z

This PR introduces Panorama into HNSWFlat, following our paper. Panorama achieves up to 4× lower latency on higher-dimensional data, making it a great option for medium-sized datasets that don't benefit much from quantization.

Below are some benchmarks on SIFT-128, GIST-960, and synthetic 2048-dimensional data. I recommend checking out the paper for more results. As expected, Panorama is not a silver bullet when combined with HNSW—it’s only worthwhile for high-dimensional data.

It might be worth considering, in the future, adding a function that dynamically sets the number of levels. However, this would require reorganizing the cumulative sums.

SIFT-128

Note: SIFT-128 performs slightly worse here than in our paper because we use 8 levels, whereas the paper explored several level configurations. Eight levels introduce quite a bit of overhead for 128-dimensional data, but I kept it consistent across all benchmarks for comparison.

GIST-960

Synthetic-2048

faiss/impl/HNSW.cpp

mdouze

Thanks for the PR.
About Panorama in general: would it be feasible to make an IndexRefine that supports FlatPanorma as a refinement index?
The reason is because it may be more efficient to do all the non-exhaustive searches in low dimension and refine the result list in the end.
This would also make it possible to apply panorama to low-accuracy & fast indexes like FastScan and RabitQ indexes.

faiss/IndexHNSW.cpp

mdouze · 2025-10-20T07:27:50Z

faiss/IndexHNSW.cpp

+        for (int j = start_idx; j < end_idx; j++) {
+            sum += x[j] * x[j];
+        }
+        dst_cum_sums[level] = sqrt(sum);


is there a reason to do a sqrt (ie. use L2 distance instead of squared L2 distance as usual)?

From Cauchy-Schwarz, we precompute the `sqrt` so it does not happen in the hot path. A multiplication is always going to be cheaper.

faiss/IndexHNSW.cpp

mdouze · 2025-10-20T07:43:54Z

faiss/IndexHNSW.h

+ * in a random order, which makes cache misses dominate the distance computation
+ * time.
+ *
+ * The num_levels parameter controls the granularity of progressive distance


Would it be possible to call it something else than num_levels? HNSW has its notion of levels -- the number of hierachy levels.

Good idea, perhaps num_panorama_levels?

mdouze · 2025-10-20T07:45:38Z

faiss/IndexHNSW.h

+ * the algorithm to prune unpromising candidates early using Cauchy-Schwarz
+ * bounds on partial inner products. Hence, recall is not guaranteed to be the
+ * same as vanilla HNSW due to the heterogeneous precision within the search
+ * beam (exact vs. partial distance estimates affecting traversal order).


Does this mean that the batch size on which the distances are computed is at most the out-degree of the HNSW graph (set to 64 by default)?

faiss/impl/HNSW.cpp

faiss/impl/HNSW.h

faiss/impl/HNSW.cpp

mdouze · 2025-10-20T08:02:24Z

Please share any performance comparison you have with this code vs. the HNSWFlat implementation.
Since the data is not contiguous, the performance profile could be different from an IVFFlat index.

AlSchlo · 2025-10-20T20:17:05Z

@mdouze thanks for the review

Yes we can use it in IndexRefine, this is a good idea. I would assume that IndexRefine does not have its vector sequential in memory by design? If no, this is OK but sub-optimal, as the gains of Panorama are more modest in the presence of all those cache misses. We cover this in the paper.
Performance of HNSW is benched in the paper too, it's still worth it on higher dimensional data, but much more ad-hoc than IndexIVFFlatPanorama. We will include some benches with this new cleaned up code. Here is the graph from the paper.

Panorama can work on IVFPQ (including FastScan), but integration here is a bigger effort (to support all AVX targets, etc.) as we have to interleave the codes to keep the SIMD lanes busy. In fact, this is where we have the best performance speedups.

alexanderguzhva · 2025-10-20T22:16:25Z

@AlSchlo is it worth allowing configuring a default (UB + LB) / 2 behavior by allowing, say, other options like just LB?

AlSchlo · 2025-10-20T23:24:40Z

@alexanderguzhva excellent suggestion! so we actually used to have an epsilon knob there, but we ended up not talking about it in the paper. It's a knob that just adds confusion IMO and makes the workload more unpredictable.

We did not study it in more detail as the paper was getting too dense.

AlSchlo · 2025-10-21T03:41:56Z

Will write tests sometime this week. I also realize I need to change the write / read functions.
This slipped through the cracks for IVFFlatPanorama somehow.

AlSchlo · 2025-10-22T08:27:08Z

Pending benches, I believe this implementation to be pretty much complete.
Next line of work will be (1) fixing the nits in IVFFlat, there are currently 2 PRs open from @aknayar and (2) implement IndexRefinePanorama as @mdouze suggested.

AlSchlo · 2025-10-24T02:54:26Z

@mdouze The PR is done — could you please re-review?

Let's also try to get #4628 merged. It's a very useful metric to have, as there's a strong correlation between these stats and empirical performance.

Here's a nice excerpt from the paper that summarizes this:

Thanks!

AlSchlo · 2025-10-24T02:58:38Z

Once this is done, I will focus on getting the IndexRefinePanorama PR in.

mnorris11 · 2025-10-28T17:31:55Z

Hi @AlSchlo and @aknayar , much thanks for the contributions! The stats PR has been merged.

After discussing internally, it sounds like the priority is having the IndexRefinePanorama. For our (my) learning, after this is in place, is there still a need for various other indexes like IndexHNSWFlatPanorama? Or can it then be applied to all indexes?

AlSchlo · 2025-10-28T18:11:02Z

Hi @mnorris11,

IndexRefinePanorama would be a great fit when Panorama cannot be directly applied in the initial search space — for instance, when the dimensionality is small, or when integration into the main index has not yet been done. As a general rule, Panorama can be integrated almost anywhere with some engineering effort.

In our paper, we integrate Panorama into IVFPQ, and it performs well even for low-dimensional data, thanks to a SIMD optimization technique called byte-slicing, which helps keep vector lanes fully utilized. However, that implementation requires custom SIMD kernels, which makes it less portable.

This PR instead targets the common scenario where the dimensionality is large and no downstream index is used to refine results after HNSW. That setup is quite typical in practice — for instance, it would benefit us internally at Databricks.

So, my take would be to include both implementations: they address different use cases. Longer-term, the goal is to adapt Panorama into quantization-based techniques like RaBitQ, so we can accelerate both the initial search and refinement phases. This is our current research direction.

AlSchlo · 2025-10-28T19:39:25Z

TL;DR @mnorris11

IndexRefinePanorama integrates Panorama into the refinement phase (which might be needed if the upstream index yields poor recall).

IndexHNSWPanorama integrates Panorama into the search phase of HNSW.

alexanderguzhva · 2025-10-29T17:40:18Z

@AlSchlo the problem with rabitq that I see is that the overhead of storing additional coefficient is going to be significant, unlike 32-bit or even 16-bit floats for the refinement

AlSchlo · 2025-10-29T18:23:26Z

@alexanderguzhva Yes, this is one issue that will need clever engineering. I was thinking of perhaps quantizing those coefficients. Also rabitq theory assumes random projection. We need to adapt the theory to be able to make it work with Cayley & PCA.

From a computational point of view however, even if the vector is binary quantized, Panorama can still be applied.

meta-codesync · 2025-10-30T22:15:24Z

@mnorris11 has imported this pull request. If you are a Meta employee, you can view this in D85902427.

mnorris11 · 2025-11-07T23:11:40Z

Sorry for the delay on these PRs, I'm still conducting some benchmarking.

aknayar · 2025-11-07T23:36:02Z

@mnorris11 No worries, and thank you so much for the reviews! As an update, after #4645 is confirmed, I have a local build of IndexRefinePanorama ready to submit with really promising results (2x E2E speedups on GIST with IVF256,PQ60x4fs as the base index vs. L2Flat as the refine index—seen below). I think speedups of 3x and above could be expected from more amenable datasets (OpenAI's DBpedia-Large, etc.).

Alexis Schlomer and others added 16 commits October 15, 2025 04:32

Add plan for HNSW

c7d2a1c

Merge branch 'main' of github.com:AlSchlo/faiss-panorama

68fd945

Add distance APIs

39030cc

Remove useless files

d119666

Merge branch 'facebookresearch:main' into main

002d5e2

Add IndexHNSWFlatPanorama : second step of master plan

4b0d478

Initial commit

eceee85

Add storage adaptations

3e8041d

V1, but not yet get query done...

9eca959

Undo other PR changes

0911501

Undo mor

ecfc5e4

and another one

0121f67

and another one

62a3c84

Finish HNSW impl

332d90c

Finalize Panorama HNSW core impl

b99513d

nit

2d8a7a4

meta-cla bot added the CLA Signed label Oct 20, 2025

AlSchlo added 2 commits October 20, 2025 03:19

Nit

b3e135d

nit

f93fe16

AlSchlo commented Oct 20, 2025

View reviewed changes

faiss/impl/HNSW.cpp Outdated Show resolved Hide resolved

Fix compile bug

d7ae794

mdouze reviewed Oct 20, 2025

View reviewed changes

aknayar and others added 4 commits October 20, 2025 19:28

Merge branch 'facebookresearch:main' into main

bd42c34

Address mdouze comments

47125fa

Merge branch 'main' of github.com:AlSchlo/faiss-panorama

941b919

Revert style format

e840567

AlSchlo and others added 5 commits October 21, 2025 03:56

correct inlining

eb215f0

formatting

1672601

Add serialize code and swig bindings

b675b10

Add tests and fix bugs

234940f

Merge branch 'main' into main

0d33bb4

AlSchlo marked this pull request as ready for review October 22, 2025 08:26

Add bench file

630c581

AlSchlo requested a review from mdouze October 24, 2025 02:57

aknayar mentioned this pull request Oct 24, 2025

Implement serialization for IndexIVFFlatPanorama #4636

Closed

Merge branch 'main' into main

d60b8ce

mnorris11 added the Implementation label Oct 27, 2025

mnorris11 added 2 commits October 31, 2025 13:02

Merge branch 'main' into main

2e18b90

Merge branch 'main' into main

007c488

Integrate Panorama into IndexHNSWFlatPanorama #4621

Are you sure you want to change the base?

Integrate Panorama into IndexHNSWFlatPanorama #4621

Conversation

AlSchlo commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

SIFT-128

GIST-960

Synthetic-2048

Uh oh!

Uh oh!

mdouze left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mdouze Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

AlSchlo Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mdouze Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

AlSchlo Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

mdouze Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

AlSchlo Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mdouze commented Oct 20, 2025

Uh oh!

AlSchlo commented Oct 20, 2025

Uh oh!

alexanderguzhva commented Oct 20, 2025

Uh oh!

AlSchlo commented Oct 20, 2025

Uh oh!

AlSchlo commented Oct 21, 2025

Uh oh!

AlSchlo commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlSchlo commented Oct 24, 2025

Uh oh!

AlSchlo commented Oct 24, 2025

Uh oh!

mnorris11 commented Oct 28, 2025

Uh oh!

AlSchlo commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlSchlo commented Oct 28, 2025

Uh oh!

alexanderguzhva commented Oct 29, 2025

Uh oh!

AlSchlo commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync bot commented Oct 30, 2025

Uh oh!

mnorris11 commented Nov 7, 2025

Uh oh!

aknayar commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Integrate Panorama into `IndexHNSWFlatPanorama` #4621

Integrate Panorama into `IndexHNSWFlatPanorama` #4621

AlSchlo commented Oct 20, 2025 •

edited

Loading

AlSchlo commented Oct 22, 2025 •

edited

Loading

AlSchlo commented Oct 28, 2025 •

edited

Loading

AlSchlo commented Oct 29, 2025 •

edited

Loading