Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Automatic Bucket Creation and Data Transfer in with_data API #4255

Open
andylizf opened this issue Nov 4, 2024 · 0 comments
Open

Comments

@andylizf
Copy link
Contributor

andylizf commented Nov 4, 2024

  1. Automatic Bucket Creation in with_data API

    • Based on Implement with_data API for Edge-Based Data Flow in Task DAGs #4254, implement logic within the with_data API to automatically create a storage bucket if one does not already exist.
    • Set up mappings so that the upstream task’s output path is automatically linked to this new bucket, simplifying the data transfer setup for users.
  2. Data Transfer to Bucket and Downstream Access

    • Implement the mechanism to:
      • Transfer data from the upstream task’s output path to the automatically created bucket.
      • Configure the downstream task’s input path to retrieve data directly from this bucket, ensuring seamless access without manual setup.
  3. Documentation and Examples

    • Update documentation to explain the automatic bucket creation and data transfer process:
      • Provide examples showing how with_data can be configured to handle both bucket creation and data flow.
      • Outline any requirements or configurations for customizing bucket settings (e.g., region, permissions).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant