Skip to content

Add ability to utilize checkpoints from local run #1361

@ilongin

Description

@ilongin

User should be able to utilize new checkpoints feature with basic datachain query command (or similar).

If we go with datachain query here is the implementation plan:
Currently this command creates Job instance which has name as the name of the script (without path).
We should change this behavior into:

  1. Use full path of the script as Job name
  2. Try to find Job with existing name and without child (this means script is re-run) and if found add it as parent to newly created Job
  3. Add new flag called --reset (maybe there is a better name) that will not use existing checkpoints, but will run the script as it is ran for the first time

Example usage

datachain query run.py   # some error happens
datachain query run.py   # continues where it stopped on the last run using the checkpoints from the last run if they exist
datachain query run.py --reset  # runs the script from the scratch, not utilizing created checkpoints

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions