Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggested improvements to example scripts involving ML model training #559

Open
bwohlberg opened this issue Oct 7, 2024 · 0 comments
Open
Assignees
Labels
examples Pertaining to SCICO examples improvement Improvement of existing code, including addressing of omissions or inconsistencies

Comments

@bwohlberg
Copy link
Collaborator

The ML model example scripts could be improved by addressing the following two issues:

  1. When updating a notebook from an example script using makenotebooks.py, the generated notebook will not include the training run details if the script has previously been run (unless the cached trained model is manually deleted, the need for which is easy to overlook). Some mechanism is needed for signalling to the script/notebook that the cached trained model should be ignored. Perhaps the simplest approach would be to check an environment variable, e.g. SCICO_ML_RETRAIN.
  2. The example scripts that use cached training data fail when any of the fixed parameters involved in the data generation are changed. It would be better to include all such parameters in the cache filename (or to include a hash generated from these parameters in the filename) to avoid attempts to load cached data that doesn't match the requirements for a specific run.
@bwohlberg bwohlberg added examples Pertaining to SCICO examples improvement Improvement of existing code, including addressing of omissions or inconsistencies labels Oct 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
examples Pertaining to SCICO examples improvement Improvement of existing code, including addressing of omissions or inconsistencies
Projects
None yet
Development

No branches or pull requests

2 participants