Support semantic reranking using contextual snippets instead of entire field text #129369

kderusso · 2025-06-12T19:36:32Z

Followup to the POC described in #128255

Adds the ability to rerank based on a smaller number of snippets.

Example, with a default of one snippet:

GET test/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": {
        "standard": {
          "query": {
            "term": {
              "other": "lotr"
            }
          }
        }
      },
      "rank_window_size": 2,
      "field": "semantic_text",
      "inference_text": "are all who wander lost?",
      "snippets": { }
    }
  }
}

Example, specifying snippets:

GET test/_search
{
  "retriever": {
    "text_similarity_reranker": {
      "retriever": {
        "standard": {
          "query": {
            "term": {
              "other": "lotr"
            }
          }
        }
      },
      "rank_window_size": 2,
      "field": "semantic_text",
      "inference_text": "are all who wander lost?",
      "snippets": {
        "num_snippets": 3
      }
    }
  }
}

Not specifying snippets will continue to send the entire field contents into the reranker model.

jimczi

I did a first pass and left some comments. I think we can better isolate this change in the text similarity rank builder.

server/src/main/java/org/elasticsearch/search/rank/RankBuilder.java

server/src/main/java/org/elasticsearch/search/rank/feature/CustomRankInput.java

server/src/main/java/org/elasticsearch/search/rank/feature/RankFeatureShardPhase.java

.../main/java/org/elasticsearch/search/rank/context/RankFeaturePhaseRankCoordinatorContext.java

…rity reranker only

cla-checker-service · 2025-07-21T18:56:54Z

❌ Author of the following commits did not sign a Contributor Agreement:
9ae38c9, 8a82b13, 7c6848d, b6aaf8f, 2418e41

Please, read and sign the above mentioned agreement if you want to contribute to this project

kderusso · 2025-07-21T19:09:18Z

cla/check

jimczi · 2025-07-21T20:58:30Z

...k/inference/rank/textsimilarity/TextSimilarityRerankingRankFeaturePhaseRankShardContext.java

+                int fragmentSize = tokenSizeLimit * TOKEN_SIZE_LIMIT_MULTIPLIER;
+                highlightBuilder.fragmentSize(fragmentSize);
+                SearchHighlightContext searchHighlightContext = highlightBuilder.build(context.getSearchExecutionContext());
+                context.highlight(searchHighlightContext);


Should we also set noMatchSize to ensure that we always get a snippet for every document?

jimczi · 2025-07-21T21:10:21Z

...k/inference/rank/textsimilarity/TextSimilarityRerankingRankFeaturePhaseRankShardContext.java

+
+import static org.elasticsearch.search.rank.feature.RerankSnippetConfig.DEFAULT_NUM_SNIPPETS;
+
+public class TextSimilarityRerankingRankFeaturePhaseRankShardContext extends RerankingRankFeaturePhaseRankShardContext {


Do we really need the separation here between TextSimilarityRerankingRankFeaturePhaseRankShardContext and RerankingRankFeaturePhaseRankShardContext? I don't see RerankingRankFeaturePhaseRankShardContext being used elsewhere so maybe we can just have TextSimilarityRerankingRankFeaturePhaseRankShardContext and implement the full logic there?

jimczi · 2025-07-21T21:11:07Z

server/src/main/java/org/elasticsearch/search/retriever/CompoundRetrieverBuilder.java

@@ -337,7 +337,7 @@ protected SearchSourceBuilder finalizeSourceBuilder(SearchSourceBuilder sourceBu
     * @param ctx The query rewrite context
     * @return RetrieverBuilder the rewritten retriever
     */
-    protected RetrieverBuilder doRewrite(QueryRewriteContext ctx) {
+    protected RetrieverBuilder doRewrite(QueryRewriteContext ctx) throws IOException {


Is it a leftover?

jimczi · 2025-07-21T21:14:43Z

server/src/main/java/org/elasticsearch/search/rank/feature/SnippetRankInput.java

+ */
+public class SnippetRankInput implements Writeable {
+
+    private final RerankSnippetConfig snippets;


Is the separation between SnippetRankInput and RerankSnippetConfig necessary? Why not injecting numSnippets and snippetQueryBuilder directly here?

jimczi · 2025-07-21T21:15:37Z

...in/java/org/elasticsearch/xpack/inference/rank/textsimilarity/TextSimilarityRankBuilder.java

+    /**
+     * The default token size limit of the Elastic reranker is 512.
+     */
+    private static final int RERANK_TOKEN_SIZE_LIMIT = 512;


This is currently used as a character limit and not as a token limit. I don't understand why you're separating the RERANK_TOKEN_SIZE_LIMIT and the DEFAULT_TOKEN_SIZE_LIMIT. Currently the issue is that HighlightBuilder#fragmentSize sets the size of the fragment in terms of number of characters and not tokens.
512 being the context length of the Elastic re-ranker, we can hack temporarily by multiplying with the average length of a token in English. 4096 seems high since we'd expect an average of 8 characters per token. The literature on the topic is more on an average of 4-5 chars per token even less if we consider the model's vocabulary and tokenisation (wordpiece, ...).

kderusso added 10 commits April 2, 2025 14:03

Do highlighting in RankFeatureShardPhase

d2c22a6

Propagate to

3f52ac7

Only rerank the first snippet

0ffaf94

Notes

f99c33e

Support reranking based on max score of multiple snippets per document

0196a7c

Update from main

d796953

Fix compilation error

1dc80a1

Merge from main

7537c21

Fix merge compile errors

f68b720

Merge update from main

b053fa3

elasticsearchmachine added the v9.1.0 label Jun 12, 2025

kderusso added 6 commits June 13, 2025 14:34

Merge main into branch

bbee29f

Fix compilation error after upstream merge

c4bcb43

Combine featureData and snippets

ce8364b

Remove docIndices from RankFeatureDoc

d8dbaab

Update API - remove max size, default num snippets, rename

b909ab9

Merge branch 'main' into kderusso/rerank-snippet-poc

243e169

elasticsearchmachine added v9.2.0 and removed v9.1.0 labels Jun 26, 2025

kderusso added 10 commits June 27, 2025 08:50

Merge main into kderusso/rerank-snippet-poc

1684fba

Add hardcoded max token length

e6208ec

Fix error in docId calculation

e3259d9

Fix snippet calculation

16dbca0

Fix test compilation errors

fee2b0d

Merge branch 'main' into kderusso/rerank-snippet-poc

9119d26

Fix more test compilation failures

b418f87

Cleanup

c30da72

Minor variable cleanup

cf2c8b3

Add some tests

4675586

kderusso changed the title ~~Rerank snippet POC~~ Support semantic reranking using contextual snippets instead of entire field text Jul 2, 2025

Merge main into kderusso/rerank-snippet-poc

15bdcef

github-actions bot deployed to docs-preview July 10, 2025 18:38 View deployment

Fix tests

75568a9

github-actions bot deployed to docs-preview July 10, 2025 21:07 View deployment

jimczi reviewed Jul 11, 2025

View reviewed changes

Addressed PR feedback moving snippet generation down into text simila…

143ed4a

…rity reranker only

github-actions bot deployed to docs-preview July 15, 2025 18:18 View deployment

elasticsearchmachine and others added 2 commits July 15, 2025 18:25

[CI] Auto commit changes from spotless

2418e41

Minor cleanup

ed08549

github-actions bot deployed to docs-preview July 15, 2025 18:26 View deployment

Merge main into kderusso/rerank-snippet-poc

ab8916b

github-actions bot deployed to docs-preview July 15, 2025 18:28 View deployment

Fix CI test compilation failures

b641438

github-actions bot deployed to docs-preview July 15, 2025 19:07 View deployment

Fix tests

a835902

github-actions bot deployed to docs-preview July 15, 2025 20:16 View deployment

kderusso added 2 commits July 16, 2025 09:35

Minor cleanup

1a79987

Merge branch 'main' into kderusso/rerank-snippet-poc

d330d5e

kderusso requested a review from jimczi July 16, 2025 13:42

github-actions bot deployed to docs-preview July 16, 2025 13:43 View deployment

Merge main into kderusso/rerank-snippet-poc

9e53b17

github-actions bot deployed to docs-preview July 17, 2025 18:15 View deployment

kderusso added 2 commits July 21, 2025 13:58

Increase highlighter fragment size for snippets

f88d678

Add feature flag for snippet reranking

3b08e12

github-actions bot deployed to docs-preview July 21, 2025 18:57 View deployment

Merge main into kderusso/rerank-snippet-poc

55277a0

github-actions bot deployed to docs-preview July 21, 2025 20:34 View deployment

jimczi reviewed Jul 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support semantic reranking using contextual snippets instead of entire field text #129369

Support semantic reranking using contextual snippets instead of entire field text #129369

kderusso commented Jun 12, 2025 •

edited

Loading

Uh oh!

jimczi left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cla-checker-service bot commented Jul 21, 2025

Uh oh!

kderusso commented Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Uh oh!

Uh oh!


		import static org.elasticsearch.search.rank.feature.RerankSnippetConfig.DEFAULT_NUM_SNIPPETS;

		public class TextSimilarityRerankingRankFeaturePhaseRankShardContext extends RerankingRankFeaturePhaseRankShardContext {

Support semantic reranking using contextual snippets instead of entire field text #129369

Are you sure you want to change the base?

Support semantic reranking using contextual snippets instead of entire field text #129369

Conversation

kderusso commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cla-checker-service bot commented Jul 21, 2025

Uh oh!

kderusso commented Jul 21, 2025

Uh oh!

jimczi Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kderusso commented Jun 12, 2025 •

edited

Loading