Skip to content

Add NEON float16 multi-vectors to native aliases #1291

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

stellar-aria
Copy link

Half-precision multi-vector types are available on the VFPv3_fp16 and VFPv4 architectures for ARMv7

@mr-c
Copy link
Collaborator

mr-c commented Apr 1, 2025

Thank you @stellar-aria for this PR. I see nothing objectionable, but it would be better if these new types were used elsewhere in SIMDe, so we could test that the various compilers work with them properly.

Do you have any suggestions?

@stellar-aria
Copy link
Author

The only functions that really use the types are the interleaved load/store intrinsics (i.e.
vld[n]_f16, vld[n]_dup_f16, vld[n]_lane_f16 and vst[n]_f16, where n is the stride between 2 and 4)

How are the other native aliases used/tested? Just attempting to have functions declared that used the types (with the native aliases enabled) was enough to get compiler errors for me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants