Sampling #1465
Unanswered
dharik-arsath
asked this question in
Q&A
Sampling
#1465
Replies: 1 comment
-
There is no such option at the moment out-of-the-box. It is something to think about indeed, even for things like train/test split. I guess if you really need it, you can do it manually: make a filtered df for a particular label, subsample, then same for the next.. and so on. At the end you can concat all the subsampled dataframes. You can export the final dataframe to gain on performance. I will think about how to attack this properly. Ideas/PRs are welcome! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, Imagine I've very big dataset and is there a way to take Stratified Sampling in vaex ml so that I'll be having less data to work with.
Note: Stratified Sampling has to be memory efficient.
Beta Was this translation helpful? Give feedback.
All reactions