Skip to content

rep_col_shuffle()? #484

@andrewpbray

Description

@andrewpbray

This semester I've been seeing how far I can get in terms of simulation-based inference without using the main part of the infer package. rep_slice_sample() is all you need to do bootstrapping (and it's also very handy for simulation). I'm curious what y'all think about an analogous function like rep_col_shuffle() (or rep_shuffle_col())?

The motivation here is that the default API for infer is based around the formalism of a NHST. These two functions - rep_slice_sample() and rep_shuffle_col() - would allow users (and teachers) to get to through the generate step without the formalism. This helpful for creating a more porous boundary with other forms of simulation; there would be just two fairly generic mechanistically named functions instead of five functions laser focused on the NHST framework.

In terms of implementation, it looks like generate() takes two paths: rep_slice_sample() for bootstrapping and permute() > permute_once() > permute_col() > sample() for permutations. Seems like the easiest approach would be to just wrap permute().

Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    featurea feature request or enhancement

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions