Make vendor data more static #32

quininer · 2025-06-17T08:17:06Z

This PR is not yet completed. Its goal is to make browserslist-rs data completely static to reduce resident memory and binary size.

This is a big refactor, so I opened a draft to get early feedback on whether this is a good direction.

Current implemented

use static slice instead of lazy + cell
more use of binary search for .get()
introduction PooledStr to reduce data size (reduce the size of relocation section and compress type from u64*2 to u32*2 )

The refactor of features and region is not yet complete. The data size of these two is quite large, and implement them as code will cause serious compile time regression.
I expected to implement it using include_bytes! and Aligned trick (like https://jack.wrenn.fyi/blog/include-transmute/), which would look a bit ugly, but have good compile time and runtime performance.

generate-data/src/main.rs

g-plane

I just look them at a glance for early feedback.

src/data/mod.rs

src/data/caniuse.rs

generate-data/src/main.rs

quininer · 2025-06-17T13:46:09Z

Also, since the data was converted from json to u32seq binary, our binary size regresses at this point. because json is actually more compact for small numbers. json only uses 3 bytes of extra space for each entry (","). but PooledStr requires 8 bytes per entry for index.

We could use elias-fano encoding as an index to make it smaller than json, but i'm not sure the complexity is worth it. I'm hope that the disadvantage can be offset by more string dedup after implement refactor region mod.

quininer · 2025-06-18T03:38:58Z

By packing PooledStr to 4 bytes, we are now better than json. but this will limit to not allow strings longer than 255. Our strings are mainly browser version numbers, and we are safe as long as no browser publishes version numbers with pi or e.

For reference (without optimizations beyond --release), our .wasm is 1M smaller than before.

commit: ba07c7c
3.5M ../target/wasm32-unknown-unknown/release/browserslist.wasm

commit: 19945b9
4.5M ../target/wasm32-unknown-unknown/release/browserslist.wasm

quininer · 2025-06-18T08:00:22Z

src/data/caniuse.rs


-pub static CANIUSE_GLOBAL_USAGE: &[(&'static str, &'static str, f32)] =
+pub static CANIUSE_GLOBAL_USAGE: &[(PooledStr, PooledStr, f32)] =
    include!("../generated/caniuse-global-usage.rs");

 pub static BROWSER_VERSION_ALIASES: LazyLock<


Well, we are not completely static, we still have two legacy LazyLock. but their data size is not large, so the memory usage is acceptable.

quininer · 2025-06-18T08:29:01Z

I did some simple bench (quininer@e95425a) and I think we have not an order of magnitude regression in performance. I checked that most of our data is retrieval scaled between 10-200, and using binary search at this scale is not significantly slower than hashmap.

"> 0.5%", "last 2 versions", "Firefox ESR", "not dead"
before:
simple query time: [10.756 µs 10.834 µs 10.979 µs]
after:
simple query time: [10.771 µs 10.781 µs 10.791 µs]

"> 0.5%", "last 2 versions", "Firefox ESR", "not dead", "supports objectrtc"
before:
simple query time: [20.729 µs 20.816 µs 20.931 µs]
after
simple query time: [39.584 µs 39.662 µs 39.755 µs]

Currently generate-data uses hashmap data, which cause in a different string order each time. Switch to btreemap will help stabilize this.

g-plane

Thanks!

quininer commented Jun 17, 2025

View reviewed changes

generate-data/src/main.rs Outdated Show resolved Hide resolved

quininer force-pushed the static-map branch from f0f0f45 to 3d9cd30 Compare June 17, 2025 08:34

quininer added 3 commits June 17, 2025 16:37

Update vendor

19945b9

Use static map (basic)

2747d8b

Use static map (caniuse)

4055c5a

quininer force-pushed the static-map branch from 105f74a to 4055c5a Compare June 17, 2025 08:38

g-plane reviewed Jun 17, 2025

View reviewed changes

src/data/mod.rs Outdated Show resolved Hide resolved

src/data/caniuse.rs Outdated Show resolved Hide resolved

Use static map (features)

ee3da24

quininer commented Jun 17, 2025

View reviewed changes

generate-data/src/main.rs Outdated Show resolved Hide resolved

bitpack pooledstr to u32

ba07c7c

Use static map (region)

deb48d2

quininer force-pushed the static-map branch 2 times, most recently from d8500fa to bed014b Compare June 18, 2025 07:44

quininer commented Jun 18, 2025

View reviewed changes

quininer marked this pull request as ready for review June 18, 2025 08:00

quininer added 3 commits June 19, 2025 15:08

Fix clippy

0de564a

Use BTreeMap for generate-data

9a0be6d

Currently generate-data uses hashmap data, which cause in a different string order each time. Switch to btreemap will help stabilize this.

Remove dependencies

cacd833

quininer force-pushed the static-map branch from 517d6c9 to cacd833 Compare June 19, 2025 07:09

g-plane approved these changes Jun 22, 2025

View reviewed changes

g-plane merged commit 526599c into browserslist:main Jun 22, 2025
2 checks passed

quininer deleted the static-map branch June 22, 2025 09:38

quininer changed the title ~~Make vendor data completely static~~ Make vendor data more static Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make vendor data more static #32

Make vendor data more static #32

Uh oh!

quininer commented Jun 17, 2025

Uh oh!

Uh oh!

g-plane left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quininer commented Jun 17, 2025

Uh oh!

quininer commented Jun 18, 2025

Uh oh!

quininer Jun 18, 2025

Uh oh!

quininer commented Jun 18, 2025 •

edited

Loading

Uh oh!

g-plane left a comment

Uh oh!

Uh oh!

Uh oh!

Make vendor data more static #32

Make vendor data more static #32

Uh oh!

Conversation

quininer commented Jun 17, 2025

Uh oh!

Uh oh!

g-plane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quininer commented Jun 17, 2025

Uh oh!

quininer commented Jun 18, 2025

Uh oh!

quininer Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

quininer commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

g-plane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

quininer commented Jun 18, 2025 •

edited

Loading