Skip to content

Question about the # Copied from system #12650

@delmalih

Description

@delmalih

Hi team! 👋

While working on improving docstrings and type hints across scheduler files (issue #9567), I've noticed the # Copied from pattern used extensively throughout the codebase.

Examples:

  • Functions like betas_for_alpha_bar are duplicated across multiple schedulers
  • Output classes like DDPMSchedulerOutput are copied with name replacements (e.g., DDPM->EulerDiscrete)

My question: What's the rationale behind this duplication system instead of:

  1. Using a shared utils.py or common.py file for common functions
  2. Using class inheritance for similar Output classes

I understand there might be good architectural reasons (module independence, API stability, avoiding circular dependencies, etc.), but this isn't documented anywhere that I could find.

Suggested action: Regardless of the answer, I think we should either:

  • Option A: Refactor to use inheritance/shared utilities (if the current system is legacy)
  • Option B: Document this design decision in:
      - A CONTRIBUTING.md or architecture doc
      - Comments in the utils/check_copies.py script itself
      - Another README in the diffusers directory

This would help future contributors (like me! 😅) understand why this pattern exists and how to work with it properly when improving documentation. What do you think?

Thanks for maintaining such a great library! 🚀

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions