Open
Description
Requirements:
- we want to create a snapshot of the TEI corpus for the palaeographic annotation purpose
- the original branch will keep being edited over time
- the snapshot will remain the same, ensuring that existing annotations are not invalidated
- ideally we'd like to only snapshot a subset of the corpus, then gradually add more files to it from the main branch. This matches the batch upload currently done on a regular basis to the annotating environment.
- we may want to apply corrections from the main branch to the snapshot, selectively to some files
- we may also want to update the entire snapshot at some point int the future to realign it with the main branch; this may introduce breaking changes to the annotations (e.g. change in token ID or text structure)
- important assumption that all changes to a snapshot always come from the main branch. There is no direct manual edit to the snapshot (e.g. applying correction manually or adding more information to the file). The snapshot is read-only.