-
Notifications
You must be signed in to change notification settings - Fork 97
Replace mfiter loops with the MF parallel for versions #1069
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
be07d0a
to
88f6ea4
Compare
05a9d3f
to
b898303
Compare
e8ae2eb
to
c246dc0
Compare
6ea9655
to
0960107
Compare
a8ae791
to
2100009
Compare
935f54b
to
e586f3e
Compare
858f08f
to
8c85df2
Compare
8c85df2
to
2c3857f
Compare
902aaea
to
ba927e2
Compare
cd8d7d8
to
0abf331
Compare
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days. |
ca660b0
to
c901cf6
Compare
ac6add8
to
c235946
Compare
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days. |
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days. |
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days. |
This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days. |
Summary
Replace mfiter loops with the MF parallel for versions. These take care of a bunch of defaults for use, creates fused parallelfors for the GPU (nominally results in faster code), and generally simplifies the code.
I am looking for volunteers. I have annotated all the mfiter loops that can be simplified. I probably missed some and some might not be doable on a second glance. I did a couple to show how it is done and there are other examples in the code.
Annotations are
// this one
for mfiter loop canditates.Also identified
// needs openmp pragma?
for loops not containing openmp and that maybe should.I am using clang-format 18 to format the code, which causes the formatting check to break. But I am going to leave that alone so we don't waste resources and will fix the formatting later.
Pull request type
Please check the type of change introduced:
Checklist
This PR was tested by running: