-
Notifications
You must be signed in to change notification settings - Fork 82
Description
This semester I've been seeing how far I can get in terms of simulation-based inference without using the main part of the infer package. rep_slice_sample()
is all you need to do bootstrapping (and it's also very handy for simulation). I'm curious what y'all think about an analogous function like rep_col_shuffle()
(or rep_shuffle_col()
)?
The motivation here is that the default API for infer is based around the formalism of a NHST. These two functions - rep_slice_sample()
and rep_shuffle_col()
- would allow users (and teachers) to get to through the generate step without the formalism. This helpful for creating a more porous boundary with other forms of simulation; there would be just two fairly generic mechanistically named functions instead of five functions laser focused on the NHST framework.
In terms of implementation, it looks like generate()
takes two paths: rep_slice_sample()
for bootstrapping and permute()
> permute_once()
> permute_col()
> sample()
for permutations. Seems like the easiest approach would be to just wrap permute()
.
Thoughts?