Dont require unused dataloaders #179

scottcanoe · 2025-02-13T19:15:55Z

This PR removes the need to specify unused data loader classes and args in experiment configs.

Motivation

Monty experiment configs currently require specifying both training and eval data loaders, regardless of whether both are needed. This means we end up with loads of lines like

some_pretraining_config = dict(
    .
    .
    .
    eval_dataloader_class=ED.InformedEnvironmentDataLoader,  # just placeholder
    eval_dataloader_args=get_env_dataloader_per_object_by_idx(start=0, stop=1),
)

or

some_eval_config = dict(
    .
    .
    .
    # required but unused
    train_dataloader_class=ED.InformedEnvironmentDataLoader,
    train_dataloader_args=get_env_dataloader_per_object_by_idx(start=0, stop=10),
)

This adds clutter and may also be misleading, especially to newcomers. This is a small PR that improves quality of life and allows us to write cleaner, less confusing experiment configs.

Changes

MontyExperiment.load_dataset_and_dataloaders was modified such that an eval or train dataloaders will only be initialized if the experiment config requires it.
This change was tested by removing unused data loaders from benchmark configs. As such, eval dataloader configs have been removed from benchmark pretraining experiments, and train dataloader configs have been removed benchmark from eval experiments. I reran all pretraining experiments, a couple of experiments from each of the long- and short- YCB experiment list, all unsupervised experiments, and a few monty-meets-world experiments. As expected, all configs run as normal.

Remove eval dataloaders from configs

Remove train dataloaders (except for unsupervised experiments).

Remove unneeded import

Remove train dataloader and unneeded import

Remove train dataloaders

tristanls

I'm not a reviewer, but I was curious, and this looks great, thank you.

suggestion: I suggest this should be a feat: commit (which introduces a new feature to the codebase, the feature of not having to specify extraneous things; you can make it a feat: commit by prefixing the commit message with feat: ; more on this suggested commit prefix here).

scottcanoe · 2025-02-13T21:57:16Z

suggestion: I suggest this should be a feat: commit (which introduces a new feature to the codebase, the feature of not having to specify extraneous things; you can make it a feat: commit by prefixing the commit message with feat: ; more on this suggested commit prefix here).

Thanks @tristanls! I wasn't aware of this convention. If I'm understanding it correctly, only the commit that introduced the change (in MontyExperiment.load_dataset_and_dataloaders) would get the feat: prefix, while the the commits that make use of it (i.e., changes to benchmark configs) would be left without prefix. Is that right?

nielsleadholm

Beautiful, thanks for making this change!

tristanls · 2025-02-14T15:19:39Z

Thanks @tristanls! I wasn't aware of this convention. If I'm understanding it correctly, only the commit that introduced the change (in MontyExperiment.load_dataset_and_dataloaders) would get the feat: prefix, while the the commits that make use of it (i.e., changes to benchmark configs) would be left without prefix. Is that right?

@scottcanoe, the convention is a suggestion. We don't enforce it here. But, if we were to follow it, then every commit would get a prefix. For a commit like this, out of fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test: I think the most likely candidates would be: feat:, chore:, or refactor:. chore: is along the lines of maintaining dependency versions, updating tooling functionality. That doesn't seem to fit here, so we'd be left with feat: or refactor:. refactor: is typically reserved for a code change that neither fixes a bug nor adds a feature. Now, the reason I don't think it is refactor: is because this pull request adds a feature. I don't think of features as code to be written. I think of features from the user's perspective; the user's experience changed due to this code. So, this fits into a feature because a previously required parameter is now optional (the unused config). If we were making announcements, we would announce that this parameter is no longer required for our users. This is why I suggested the feat: prefix. There's also a notion of breaking changes, which adds ! to the prefix, but this is not a breaking change.

vkakerbeck · 2025-02-14T16:22:44Z

I know it's already been merged but just wanted to say this is a really nice change! Thanks for adding :)

scottcanoe added 8 commits February 12, 2025 16:13

Logic for conditional dataloder initialization

b4c60b1

Update pretraining_experiments.py

d7a65dd

Remove eval dataloaders from configs

Update ycb_experiments.py

2c0b909

Remove train dataloaders (except for unsupervised experiments).

Update pretraining_experiments.py

2f2a983

Remove unneeded import

Update ycb_experiments.py

42992b5

Remove unneeded import

Update monty_world_habitat_experiments.py

1bf1640

Remove train dataloader and unneeded import

Update monty_world_experiments.py

58e2afd

Remove train dataloaders

Update ycb_experiments.py

4dcedba

scottcanoe added the enhancement New feature or request label Feb 13, 2025

scottcanoe requested review from vkakerbeck and nielsleadholm February 13, 2025 19:15

scottcanoe assigned nielsleadholm Feb 13, 2025

scottcanoe added triaged This issue or pull request was triaged and removed enhancement New feature or request labels Feb 13, 2025

tristanls approved these changes Feb 13, 2025

View reviewed changes

nielsleadholm approved these changes Feb 14, 2025

View reviewed changes

Merge branch 'main' into dont_require_unused_dataloaders

ce18140

scottcanoe merged commit bb942fc into thousandbrainsproject:main Feb 14, 2025
13 checks passed

scottcanoe deleted the dont_require_unused_dataloaders branch February 14, 2025 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dont require unused dataloaders #179

Dont require unused dataloaders #179

Uh oh!

scottcanoe commented Feb 13, 2025

Uh oh!

tristanls left a comment •

edited

Loading

Uh oh!

scottcanoe commented Feb 13, 2025

Uh oh!

nielsleadholm left a comment •

edited

Loading

Uh oh!

Uh oh!

tristanls commented Feb 14, 2025 •

edited

Loading

Uh oh!

vkakerbeck commented Feb 14, 2025

Uh oh!

Uh oh!

Dont require unused dataloaders #179

Dont require unused dataloaders #179

Uh oh!

Conversation

scottcanoe commented Feb 13, 2025

Motivation

Changes

Uh oh!

tristanls left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

scottcanoe commented Feb 13, 2025

Uh oh!

nielsleadholm left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tristanls commented Feb 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vkakerbeck commented Feb 14, 2025

Uh oh!

Uh oh!

tristanls left a comment •

edited

Loading

nielsleadholm left a comment •

edited

Loading

tristanls commented Feb 14, 2025 •

edited

Loading