Add missing internal triples for UPDATEs by RobinTF · Pull Request #2674 · ad-freiburg/qlever

RobinTF · 2026-01-28T14:03:02Z

This PR adds code to add the missing "object"@language ql:langtag <@language> internal triples to the delta triples on insertions. This way all kinds of language filters now work with update. A caveat is that these triples are never removed again, so the memory requirement will simply increase more and more, but the behaviour will never be wrong, since these new triples are always joined with a regular index scan before being used.

codecov · 2026-01-28T15:06:56Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 91.59%. Comparing base (8c2d7c0) to head (6b1d3eb).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #2674   +/-   ##
=======================================
  Coverage   91.58%   91.59%           
=======================================
  Files         480      480           
  Lines       41357    41368   +11     
  Branches     5494     5496    +2     
=======================================
+ Hits        37877    37889   +12     
+ Misses       1901     1900    -1     
  Partials     1579     1579

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

joka921

A request for minor improvements.

src/index/DeltaTriples.cpp

src/index/DeltaTriples.h

…ternal-triples

joka921

small suggestions for the caching, let's see how easy they are to implement, but there is a chance to learn something here:)

src/engine/SpatialJoinParser.cpp

src/index/DeltaTriples.cpp

joka921

Definitely improved, I have some minor suggestions, but the next round should be good to go.

joka921 · 2026-01-30T12:10:05Z

src/index/DeltaTriples.h

  ad_utility::util::LRUCache<std::string, Id> languageTagCache_{
      languageTagCacheSize_};

+  // Cache commonly used predicates between calls.


Can be more precise. It caches the IDs of commonly used language tagged predicates like @en@rdfs:label

That's not what it does though. It just caches the predicates without the language tags, because those might be the ones that are most likely expensive to look up.

joka921 · 2026-01-30T12:11:26Z

src/util/LruCache.h

+  CPP_template(typename Key, typename Func)(
      requires ad_utility::InvocableWithConvertibleReturnType<
-          Func, V, const K&>) const V& getOrCompute(const K& key,
+          Func, V, const K&>) const V& getOrCompute(Key&& key,


Should be Func, V, const Key& in the requires clause for a little more precision.

No it shouldn't, because we end up passing a const ref of the actual thing to the function. (I ended up with slightly different code than we discussed)

Agreed, thanks for the explanation.

joka921 · 2026-01-30T12:12:30Z

src/util/LruCache.h

    }
-    auto result = cache_.try_emplace(key, computeFunction(key), keys_.begin());
+    auto result = cache_.try_emplace(
+        AD_FWD(key), computeFunction(keys_.front()), keys_.begin());


yes, that does the trick (the double string creation still is not nice, but that was already there before...)

joka921 · 2026-01-30T12:13:24Z

src/index/DeltaTriples.h

      languageTagCacheSize_};

+  // Cache commonly used predicates between calls.
+  static constexpr size_t predicateCacheSize_ = 50;


I think both of the cache sizes are a little small (it is global for the full index, maybe use a few hundreds or thousands?

For the language tags, the cache size seems reasonable in my opinion, there aren't that many frequently used languages. For the predicates you might have a point. It's heavily dataset dependent though. For Wikidata to get to a predicate that's being used fewer than 1M times, you'd have to cache at least 587 predicates which are used more frequently.
I'll increase it to 100 for now and I'll leave the final decision to @hannahbast

joka921

Thank youy very much, feel free to forward this to @hannahbast

sparql-conformance · 2026-01-30T20:20:58Z

Overview

Number of Tests	Passed ✅	Intended ✅	Failed ❌	Not tested
547	450	73	24	0

Conformance check passed ✅

No test result changes.

Details: https://qlever.dev/sparql-conformance-ui?cur=6b1d3eb6b28b6b08b0c1e0905ad9e0eaf10c1cba&prev=8c2d7c0ae8710cd555004525bedd27ffac060b1b

sonarqubecloud · 2026-01-30T21:55:54Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Add missing internal triples for UPDATEs

d1090bc

RobinTF requested review from hannahbast and joka921 January 28, 2026 14:03

joka921 requested changes Jan 29, 2026

View reviewed changes

src/index/DeltaTriples.cpp Outdated Show resolved Hide resolved

src/index/DeltaTriples.h Outdated Show resolved Hide resolved

RobinTF added 2 commits January 29, 2026 12:19

Address PR comments

e700ff8

Merge remote-tracking branch 'ad-freiburg/master' into add-missing-in…

fb3dafe

…ternal-triples

joka921 requested changes Jan 29, 2026

View reviewed changes

src/engine/SpatialJoinParser.cpp Show resolved Hide resolved

src/index/DeltaTriples.cpp Outdated Show resolved Hide resolved

src/index/DeltaTriples.cpp Show resolved Hide resolved

RobinTF added 2 commits January 30, 2026 12:04

Merge branch 'master' into add-missing-internal-triples

327c6b7

Address PR comments

4874a94

joka921 reviewed Jan 30, 2026

View reviewed changes

RobinTF added 3 commits January 30, 2026 13:44

Fix compilation and address PR comments

11b746b

Fix invalid assertion

4ce3437

Fix unit tests

a7eeba7

joka921 approved these changes Jan 30, 2026

View reviewed changes

Increase cache sizes to 1000

6b1d3eb

Conversation

RobinTF commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

joka921 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joka921 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joka921 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joka921 left a comment

Choose a reason for hiding this comment

Uh oh!

sparql-conformance bot commented Jan 30, 2026

Overview

Conformance check passed ✅

Uh oh!

sonarqubecloud bot commented Jan 30, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RobinTF commented Jan 28, 2026 •

edited

Loading

codecov bot commented Jan 28, 2026 •

edited

Loading