Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added support for OpenBLAS #240

Closed
wants to merge 2 commits into from
Closed

Conversation

rdentato
Copy link
Contributor

@rdentato rdentato commented Aug 6, 2023

OpenBLAS performs slightly worse than openmp on my machine.
Nevertheless is around a 5x improvement over the plain CPU run.

@karpathy
Copy link
Owner

karpathy commented Aug 6, 2023

If OpenBLAS is worse then why add it

@xefoci7612
Copy link

OpenBLAS performs slightly worse than openmp on my machine. Nevertheless is around a 5x improvement over the plain CPU run.

@rdentato can you make it work in multi-thread? Does the cblas_sgemv() call start parallel threads to do the work? If not, I'm afraid it's a hard sell...

I don't know BLAS, so I can't help on this one.

@rdentato
Copy link
Contributor Author

rdentato commented Aug 7, 2023

I think this is worthwile to investigate further if anyone confirms that there are cases where openmp is not available and openBLAS is.
Otherwise, I would just rely on OpenMP and discard the PR.

@karpathy
Copy link
Owner

no gain no pain

@karpathy karpathy closed this Aug 10, 2023
@rdentato rdentato deleted the patch-openBLAS branch August 14, 2023 07:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants