-
Notifications
You must be signed in to change notification settings - Fork 398
feat(dataloaders): Custom dataloader registry support #2932
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…try' into ori-2907-custom-dataloader-registry
…module / registry big change
for more information, see https://pre-commit.ci
…un, we will later adjust this file
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2932 +/- ##
==========================================
+ Coverage 80.12% 80.16% +0.04%
==========================================
Files 196 197 +1
Lines 17570 18156 +586
==========================================
+ Hits 14078 14555 +477
- Misses 3492 3601 +109
🚀 New features to boost your workflow:
|
and fix the test for custom dataloaders
…try' into ori-2907-custom-dataloader-registry
Ori, this is not working for me. When I invoke in the notebook: |
Ori, all the examples that I am listing below are run by removing the code ".on_before_batch_transfer()". The way I posted before.
These led me to believe that we are not loading data into GPU memory fast enough.
|
Hi @marianogabitto ,
But im not following on your code, can you share what you are running exactly so we can compare?
I dont think GPU is not utilized, it just that the data load is much slower in tiledb, as you said 100 times slower in that sense. so while with adata the data loading is 1s we see almost continuous use of the GPU and in the tileDB s3 there's a 100sec gap between the same GPU usage, so we mistakenly see it underuse.
|
Ori, Codefrom scvi.dataloaders import DataSplitter scvi.model.SCVI.setup_anndata(adata, batch_key="batch", categorical_covariate_keys=['cell_type', 'donor']) model = scvi.model.SCVI( ad_manager.adata = adata model.train( |
Hi @marianogabitto , Besides that I made several other updates for this PR, census/lamin custom dataloaders should be working now for scvi/scnavi/scarches/load/save/multiGPU/covariates integration |
#3318) Backport PR #2932: Custom dataloader registry support Co-authored-by: Ori Kronfeld <[email protected]>
No description provided.