-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Labels
Description
is it possible to recover some of the goodness of memoization in dplyr.spark? The reason is:
Big data operations can be costly, in $$ and time. When programming interactively, one may run a program, inspect the results, add another step, inspect the results and so on. The computation from the previous steps should not be repeated, but space should also be used carefully.