Use sets rather than lists in compiler types #247

ppedrot · 2024-07-24T14:03:48Z

We still have to keep the order for some reason, but at least the filtering process checks for membership in O(log n) rather than O(n). Not sure this matters in practice but at least we are not calling the generic structural equality on potentially arbitrary data.

ppedrot · 2024-07-24T14:05:08Z

@gares the merge_type function is a hotspot of HB for some reason. This PR doesn't solve the underlying efficiency problem, but 1. it makes it algorithmically more reasonable 2. it abstracts aways the implementation and 3. it doesn't rely anymore on structural equality.

ppedrot · 2024-07-24T14:06:15Z

(Full disclaimer: I have no idea what purpose this list of types is supposed to have, I am just messing with the code and hinting at a suspect point.)

gares · 2024-07-24T15:07:05Z

src/compiler.ml

+  if set' == t.set && lst' == t.lst && def' == t.def then t
+  else { set = set'; lst = lst'; def = def' }
+
+let append x t = {


isn't this cons ?

gares · 2024-07-24T15:08:47Z

I'd like to understand why you need ord all over the place.
Also ord for constant is risky, does it do compare x y = x - y or equivalent? I'd rather write that one by hand and be sure it is efficient (no function call)

gares · 2024-07-24T15:10:51Z

merge_type function is a hotspot of HB for some reason. This PR doesn't solve the underlying efficiency problem, but 1. it makes it algorithmically more reasonable 2. it abstracts aways the implementation and 3. it doesn't rely anymore on structural equality.

Thanks your change makes a lot of sense.

At the same time, I don't get why it should be a bottleneck. In which file?
These are the list of types, as declared by the user. If I merge these lists over and over, then I should avoid that. I mean, they should be merged once and forall early in the compilation chain.

TBH, the compilation chain will require some serious scrutiny and speedup, scheduled this fall.

ppedrot · 2024-07-24T20:18:20Z

At the same time, I don't get why it should be a bottleneck. In which file?

The first HB calls in e.g. mathcomp-analysis/lebesgue_* files. There merging types account for ~70% of the runtime of the HB call. But I've seen it in other places. I'm not sure that the problem is the list proper, the merging of map itself seems to be costly as well. The typical call stack is in Compiler.Assemble.assemble, there is a call to ToDBL.merge_types there that is the main root of the issue. Rereading this piece of code, I'm assuming there are many collisions there because of poor algorithmics.

gares · 2024-07-25T11:07:10Z

I'm looking into this, it is very weird this is expensive since most of these maps should be empty

gares · 2024-07-25T11:43:26Z

I did push a commit. I guess you have a setup where you can bench this. If not I will do it myself.

In my local tests the list/sets of types are of size 1 or 2, so anything would do.
What was insane was to use Map.merge instead of Map.union.
I'm still puzzled you got a hot spot in that branch (the one calling Types.merge) since it is called very rarely, as far as I can tell. Could it be the case you saw a merge_type and you optimized the wrong one?

gares · 2024-07-25T11:49:05Z

New make tests ONLY=sepcomp_perf

OK       sepcomp_perf1          0.23   0.00   0.23   55.1M  dune
OK       sepcomp_perf2          0.21   0.00   0.21   54.9M  dune
OK       sepcomp_perf3          0.83   0.00   0.83  275.3M  dune
OK       sepcomp_perf4          1.61   0.00   1.61  486.6M  dune

old

OK       sepcomp_perf1          0.27   0.00   0.27   56.6M  dune
OK       sepcomp_perf2          0.24   0.00   0.24   55.2M  dune
OK       sepcomp_perf3          1.00   0.00   1.00  270.7M  dune
OK       sepcomp_perf4          1.95   0.00   1.95  487.5M  dune

ppedrot · 2024-07-25T12:00:56Z

Could it be the case you saw a merge_type and you optimized the wrong one?

No, I'm confident this is the precise stack call I mentioned before.

In any case, your last commit has indeed solved the issue on mathcomp-analysis.

SkySkimmer · 2024-07-27T14:47:14Z

src/compiler.ml

+  let fold t accu =
+    let t' = f t in
+    if t' == t then accu
+    else Set.add t' (Set.remove t accu)
+  in
+  let set' = Set.fold fold t.set t.set in


ocaml sets have https://github.com/ocaml/ocaml/blob/cea4b6d84161e3d35eb6966bdbfaa225de44832c/stdlib/set.mli#L209-L219 since 4.04

ppedrot added 2 commits July 17, 2024 23:39

Derive ordering functions.

74e1c21

gares reviewed Jul 24, 2024

View reviewed changes

[map] use union instead of merge

1559702

changelog

272b3e5

gares merged commit b0b0d6c into LPCIC:master Jul 25, 2024
7 of 8 checks passed

ppedrot deleted the saner-merge-types branch July 25, 2024 13:28

SkySkimmer reviewed Jul 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use sets rather than lists in compiler types #247

Use sets rather than lists in compiler types #247

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

gares Jul 24, 2024

Uh oh!

gares commented Jul 24, 2024

Uh oh!

gares commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

gares commented Jul 25, 2024 •

edited

Loading

Uh oh!

gares commented Jul 25, 2024

Uh oh!

gares commented Jul 25, 2024

Uh oh!

ppedrot commented Jul 25, 2024

Uh oh!

Uh oh!

SkySkimmer Jul 27, 2024

Uh oh!

Uh oh!

Use sets rather than lists in compiler types #247

Use sets rather than lists in compiler types #247

Uh oh!

Conversation

ppedrot commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

gares Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

gares commented Jul 24, 2024

Uh oh!

gares commented Jul 24, 2024

Uh oh!

ppedrot commented Jul 24, 2024

Uh oh!

gares commented Jul 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gares commented Jul 25, 2024

Uh oh!

gares commented Jul 25, 2024

Uh oh!

ppedrot commented Jul 25, 2024

Uh oh!

Uh oh!

SkySkimmer Jul 27, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gares commented Jul 25, 2024 •

edited

Loading