-
Notifications
You must be signed in to change notification settings - Fork 257
Description
Summary:
Our CI should detect PRs that modify Delta Lake modules and automatically run the relevant Delta Lake tests before merge. This minor change would close a significant coverage gap, improving project stability and developer experience.
Problem Statement:
Currently, our CI does not automatically run Delta Lake tests before a pull request is merged, even when the PR includes changes to Delta Lake–related components (such as files in delta-xxx modules). This adds risk of bugs or regressions being merged, reduces confidence in PR validation, and creates additional overhead if failures are only discovered after the merge by the nightly jobs failing.
Suggested Solution:
We can significantly improve our test coverage and reliability by updating our CI configuration to:
Automatically detect PRs that affect any delta-xxx module
This can be done by checking the file paths of modified files in the PR for patterns like delta-*. We are already doing something similar when triggering Databricks pipelines when we detect a Databricks change in a PR
Trigger Delta Lake tests in CI for the relevant Spark/Delta version(s) if such changes are detected.
Benefits:
Prevents regressions: Always verifies Delta Lake compatibility before merge.
Increases confidence: Contributors can see test results in their PR.
Reduces follow-up work: Avoids post-merge fixes and revert cycles.