Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU Out of memory #27

Open
gaushi opened this issue Nov 26, 2021 · 2 comments
Open

GPU Out of memory #27

gaushi opened this issue Nov 26, 2021 · 2 comments

Comments

@gaushi
Copy link

gaushi commented Nov 26, 2021

Hi CJ and Andy,
I ran into out of memory issue with GPU while running bootstraps.
Here is a portion of the log:
2021-11-26 16:42:28.228503: W tensorflow/core/framework/op_kernel.cc:1745] OP_REQUIRES failed at matmul_op_impl.h:681 : RESOURCE_EXHAUSTED: OOM when allocating tensor with shape[387840,256] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc

Could you please suggest a solution? When I run with different seeds (and no bootstraps) Locator works just fine.

Thanks,
Best,
Gautam

@andrewkern
Copy link
Member

Hi @gaushi -- huh that's strange. it seems to me like a potential memory leak in either our code or tensorflow itself. do you have a minimal working example that you could share so that we can debug this?

@andrewkern
Copy link
Member

also it would be helpful to know the versions of everything you have installed-- pip list should do it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants