Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch log2 #2

Merged
merged 14 commits into from
Apr 30, 2024
Merged

Switch log2 #2

merged 14 commits into from
Apr 30, 2024

Conversation

dlesbre
Copy link
Collaborator

@dlesbre dlesbre commented Apr 26, 2024

Todo: check how this works with negative numbers

@dlesbre dlesbre requested a review from mlemerre April 29, 2024 11:59
@dlesbre
Copy link
Collaborator Author

dlesbre commented Apr 29, 2024

@mlemerre this should be ready to merge following your fix of int_builtins. Some points of note:

  • We now support negative keys, as explained in QuickChecking Patricia Trees by Jan Mitgaard, this requires using an unsigned comparison instead of a signed one.
  • I renamed the min_xxx/max_xxx functions to unsigned_min_xxx to avoid any accidental confusion, but it is an interface breaking change.
  • I've switched the tests to using int as a generator instead of small_nat to test for problems with negatives/large numbers.

README.md Outdated
- Supports generic maps and sets: a `'m map` that maps `'k key` to `('k, 'm) value`.
This is especially useful when using [GADTs](https://v2.ocaml.org/manual/gadts-tutorial.html) for the type of keys. This is also sometimes called a dependent map.
- Allows easy and fast operations across different types of maps and set (e.g.
an intersection between a map and a set), since all sets and maps, no matter their key type, are really positive integer sets or maps.
an intersection between a map and a set), since all sets and maps, no matter their key type, are really integer sets or maps.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure what this means (even before the change)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes that wasn't very clear. I rephrased it.

external highest_bit: int -> (int[@untagged]) =
"caml_int_builtin_highest_bit_byte" "caml_int_builtin_highest_bit" [@@noalloc]

let unsigned_lt x y = x - min_int < y - min_int
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you test this one? It is not obvious to me (and it is not the fix of Midtgaard's paper)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll add tests.

I got the idea from NativeInt.unsigned compare in OCaml's standard library.

The fix from Midtgaard's paper only works if we know the arguments are powers of two. This will work for all integer arguments.

Testing in compiler explorer shows OCaml fails to optimize this to an unsigned compare, however using an external C function probably won't improve performance since it prevents inlining.

@mlemerre
Copy link
Contributor

Except for the few commands, sounds good! I don't think the fix would introduces a noticeable slow down (but I am not sure the OCaml compiler can detect that this is just an unsigned comparison. Ideally we would have a primitive for this...). The names are not short but in the spirit of the library!

@dlesbre dlesbre merged commit 5b06e5e into main Apr 30, 2024
3 checks passed
@dlesbre dlesbre deleted the switch_log2 branch April 30, 2024 15:07
@dlesbre dlesbre added the enhancement New feature or request label Apr 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants