Skip to content

[FEATURE] Delta Lake Tests Not Triggered in CI for Relevant PRs #13134

@razajafri

Description

@razajafri

Summary:
Our CI should detect PRs that modify Delta Lake modules and automatically run the relevant Delta Lake tests before merge. This minor change would close a significant coverage gap, improving project stability and developer experience.

Problem Statement:
Currently, our CI does not automatically run Delta Lake tests before a pull request is merged, even when the PR includes changes to Delta Lake–related components (such as files in delta-xxx modules). This adds risk of bugs or regressions being merged, reduces confidence in PR validation, and creates additional overhead if failures are only discovered after the merge by the nightly jobs failing.

Suggested Solution:
We can significantly improve our test coverage and reliability by updating our CI configuration to:

Automatically detect PRs that affect any delta-xxx module

This can be done by checking the file paths of modified files in the PR for patterns like delta-*. We are already doing something similar when triggering Databricks pipelines when we detect a Databricks change in a PR

Trigger Delta Lake tests in CI for the relevant Spark/Delta version(s) if such changes are detected.

Benefits:
Prevents regressions: Always verifies Delta Lake compatibility before merge.

Increases confidence: Contributors can see test results in their PR.

Reduces follow-up work: Avoids post-merge fixes and revert cycles.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions