Implement lazy loading for traceable models #1105

kylesayrs · 2025-01-28T15:24:41Z

Purpose

Some models may contain imports to libraries that are not part of the base installation. We do not want users who want to use one traceable model to be forced to install libraries used by another traceable model
Model definitions are large, so it's better to load them only when needed
Adding new model definitions which rely on newer transformers versions affects other model definitions. For example, attempting to use transformers<4.47.0 raises an error because idefics had not been implemented at that point. While we encourage users to update to the latest transformers versions, this PR allows users to continue using the same environment without unnecessarily forcing an upgrade.

For an example of this lazy module pattern, see transformers/models/llama/__init__.py.

from typing import TYPE_CHECKING
from ...utils import _LazyModule
from ...utils.import_utils import define_import_structure

if TYPE_CHECKING:
    from .configuration_llama import *
    from .modeling_flax_llama import *
    from .modeling_llama import *
    from .tokenization_llama import *
    from .tokenization_llama_fast import *
else:
    import sys

    _file = globals()["__file__"]
    sys.modules[__name__] = _LazyModule(__name__, _file, define_import_structure(_file), module_spec=__spec__)

Changes

Implemented _AliasableLazyModule which extends _LazyModule to allow aliases
Dynamically replace llmcompressor.transformers.tracing with an instance of _AliasableLazyModule which lazily loads submodules as they are needed
- Similar to lazy loading implementation by transformers

Testing

Added passing tests in tests/llmcompressor/transformers/tracing/test_init.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions · 2025-01-28T15:24:55Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

src/llmcompressor/utils/AliasableLazyModule.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

implement lazy loading for traceable models

f03037e

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs added 3 commits January 28, 2025 15:50

wip

f3be499

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

extend to support import with aliases

ab052d5

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

clean tests

6862da6

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs self-assigned this Jan 28, 2025

kylesayrs added the ready label Jan 28, 2025

kylesayrs marked this pull request as ready for review January 28, 2025 17:37

rename file

5d5dbd4

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

horheynm reviewed Jan 28, 2025

View reviewed changes

src/llmcompressor/utils/AliasableLazyModule.py Show resolved Hide resolved

kylesayrs added 4 commits January 29, 2025 05:38

fix typo

d4a36a4

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

add marks

df31d83

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/lazy-tracing-import

b9b3248

Merge branch 'main' into kylesayrs/lazy-tracing-import

f14a17b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement lazy loading for traceable models #1105

Implement lazy loading for traceable models #1105

kylesayrs commented Jan 28, 2025 •

edited

Loading

github-actions bot commented Jan 28, 2025

Implement lazy loading for traceable models #1105

Are you sure you want to change the base?

Implement lazy loading for traceable models #1105

Conversation

kylesayrs commented Jan 28, 2025 • edited Loading

Purpose

Changes

Testing

github-actions bot commented Jan 28, 2025

kylesayrs commented Jan 28, 2025 •

edited

Loading