Use `NarrowNative` for narrowing operations #116539

xtqqczze · 2025-06-11T15:20:06Z

This PR introduces a new helper method, NarrowNative, to handle narrowing operations more consistently. It replaces prior logic (previously using ExtractAsciiVector) with this unified implementation.

Once #118583 is completed, the conditional compilation blocks can be removed, using the unified implementation for all targets.

https://csharp.godbolt.org/z/dc1zn9jMb

EgorBo · 2025-06-11T20:36:13Z

Wonder if that extra AND is a big deal..

Perhaps, it should be called NarrowUnsafe ? and moved to Vector128 as an internal helper. cc @tannergooding

dotnet-policy-service · 2025-06-28T14:22:54Z

Tagging subscribers to this area: @dotnet/area-system-text-encoding
See info in area-owners.md if you want to be subscribed.

tannergooding · 2025-06-30T17:13:12Z

Wonder if that extra AND is a big deal..

👍

I don't expect this to have any significant impact and we'd overall prefer less private/internal helpers where possible.

I think we should just keep it as is and live with the single extra and instruction; likely moving other APIs to do the same over time.

Perhaps, it should be called NarrowUnsafe

I don't think this is one that would be good for an unsafe/estimate API as we typically only use such APIs for handling things that live as undefined behavior. Narrowing basically comes down to only having well-defined behavior (truncation or saturation). I don't think we'd necessarily want one that does one or the other based on whatever is "fastest".

For many of the cases using ExtractToAsciiVector, we could even extend the existing range support to allow implicit elision of the and (because it's doing something like a if (x >= 0x80) { return; } path already)

jeffhandley

@tannergooding This looks good to me now. Can you re-review please?

tannergooding · 2025-09-17T20:20:08Z

@xtqqczze do you have any perf numbers?

The above diff report is showing no real change to the codegen (before and after are the same, just with after having an additional API now for Vector<T>)

This adds some complexity to the codebase that doesn't seem desirable long term.

xtqqczze · 2025-09-17T23:12:30Z

The above diff report is showing no real change to the codegen (before and after are the same, just with after having an additional API now for Vector<T>)

This is expected as the changes just create a unified implementation rather then having logic in different places doing the same thing.

I think this could be marked no-merge until #118583 is completed as the changes could then be considerably simplified.

tannergooding · 2025-10-24T16:01:27Z

src/libraries/Common/src/System/HexConverter.cs

                    Vector128<ushort> vec2 = Vector128.LoadUnsafe(ref srcRef, offset + (nuint)Vector128<ushort>.Count).AsUInt16();

-                    vec = Ascii.ExtractAsciiVector(vec1, vec2);
+                    vec = Vector128.NarrowNative(vec1, vec2);


Can you get numbers if you just use Narrow instead of creating a new NarrowNative?

xtqqczze · 2025-10-24T16:30:47Z

Wonder if that extra AND is a big deal..

Diff between Vector128.Narrow and Vector128.NarrowNative:

; Emitting BLENDED_CODE for generic X64 + VEX on Unix
+        vmovaps  xmm0, xmmword ptr [rsp+0x08]
+        vpackuswb xmm0, xmm0, xmmword ptr [rsp+0x18]
-        vbroadcastss xmm0, dword ptr [reloc @RWD00]
-        vpand    xmm1, xmm0, xmmword ptr [rsp+0x08]
-        vpand    xmm0, xmm0, xmmword ptr [rsp+0x18]
-        vpackuswb xmm0, xmm1, xmm0
         vmovups  xmmword ptr [rdi], xmm0
         mov      rax, rdi
+ 						;; size=19 bbWeight=1 PerfScore 7.25
- 						;; size=32 bbWeight=1 PerfScore 10.25

tannergooding · 2025-10-24T16:42:11Z

The question is more whether or not the potential perf difference there matters outside of micro-benchmarks

We're speculating that it's unlikely, particularly given newer hardware that has non saturating narrowing operations.

xtqqczze · 2025-10-24T17:13:56Z

Vector128.Narrow codegen looks worse on x86-64-v4:

; Emitting BLENDED_CODE for generic X64 + VEX + EVEX on Unix
+       vmovaps  xmm0, xmmword ptr [rsp+0x08]
+       vpackuswb xmm0, xmm0, xmmword ptr [rsp+0x18]
-       vmovaps  xmm0, xmmword ptr [rsp+0x08]
-       vpmovwb  xmm0, xmm0
-       vmovaps  xmm1, xmmword ptr [rsp+0x18]
-       vpmovwb  xmm1, xmm1
-       vmovlhps xmm0, xmm0, xmm1
        vmovups  xmmword ptr [rdi], xmm0
        mov      rax, rdi
+						;; size=19 bbWeight=1 PerfScore 7.25
-						;; size=35 bbWeight=1 PerfScore 13.25

tannergooding · 2025-10-24T18:39:29Z

That looks like a codegen issue. The two vectors are neighbors so it should be generating vmovups ymm0, ymmwork ptr [rsp+0x08]; vpmovmwb xmm0, ymm0 or similar

But that's also a bit beside the point. "Looks worse" doesn't actually mean "performs worse". What matters here is the real world perf for typical scenarios (not microbenchmarks). Without data showing otherwise, the presumption is that the slightly worse codegen doesn't matter and can be improved over time where it isn't doing the "right thing".

tannergooding · 2025-11-06T18:25:33Z

I'm going to say this is one we don't want to take.

We'd prefer to just use Narrow and then fix the codegen issue that was listed.

Thanks for the PR and locating the other codegen issue that we do want to resolve

Use ExtractAsciiVector instead of Vector128.Narrow

a351adc

github-actions bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Jun 11, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jun 11, 2025

teo-tsirpanis added area-System.Text.Encoding and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Jun 28, 2025

tarekgh requested a review from tannergooding June 28, 2025 17:11

tarekgh added this to the 10.0.0 milestone Jun 28, 2025

Add vector NarrowNative

79745ef

xtqqczze changed the title ~~Use ExtractAsciiVector for narrowing operations~~ Use NarrowNative for narrowing operations Jul 10, 2025

Merge branch 'main' into use-extractasciivector

a1a3880

build-analysis bot mentioned this pull request Jul 16, 2025

Android tests timing out #117669

Closed

Merge branch 'main' into use-extractasciivector

41c41d9

This was referenced Jul 29, 2025

Crash dump collection fails due to "Error writing data to dump file: No space left on device" dotnet/dnceng#5944

Closed

TimeZoneInfoTests.NoBackwardTimeZones tests are failing on Android #117731

Open

jeffhandley approved these changes Sep 1, 2025

View reviewed changes

jeffhandley assigned tannergooding Sep 1, 2025

jeffhandley modified the milestones: 10.0.0, 11.0.0 Sep 1, 2025

MihuBot mentioned this pull request Sep 1, 2025

[JitDiff X64] [xtqqczze] Use NarrowNative for narrowing operations MihuBot/runtime-utils#1423

Open

Merge branch 'main' into use-extractasciivector

035075e

This was referenced Sep 17, 2025

[JitDiff X64] [xtqqczze] Use NarrowNative for narrowing operations MihuBot/runtime-utils#1502

Open

[JitDiff ARM64] [xtqqczze] Use NarrowNative for narrowing operations MihuBot/runtime-utils#1503

Open

tannergooding reviewed Oct 24, 2025

View reviewed changes

tannergooding closed this Nov 6, 2025

Use NarrowNative for narrowing operations #116539

Use NarrowNative for narrowing operations #116539

Uh oh!

Conversation

xtqqczze commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EgorBo commented Jun 11, 2025

Uh oh!

dotnet-policy-service bot commented Jun 28, 2025

Uh oh!

tannergooding commented Jun 30, 2025

Uh oh!

jeffhandley left a comment

Choose a reason for hiding this comment

Uh oh!

tannergooding commented Sep 17, 2025

Uh oh!

xtqqczze commented Sep 17, 2025

Uh oh!

tannergooding Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

xtqqczze commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tannergooding commented Oct 24, 2025

Uh oh!

xtqqczze commented Oct 24, 2025

Uh oh!

tannergooding commented Oct 24, 2025

Uh oh!

tannergooding commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Use `NarrowNative` for narrowing operations #116539

Use `NarrowNative` for narrowing operations #116539

xtqqczze commented Jun 11, 2025 •

edited

Loading

xtqqczze commented Oct 24, 2025 •

edited

Loading

tannergooding commented Nov 6, 2025 •

edited

Loading