Passing arbitrary keyword arguments to submodules #2539

Markus28 · 2024-03-12T13:51:42Z

This PR implements a prototype of the proposal #2538. I.e., it allows arbitrary keyword arguments to be passed through SentenceTransformer.encode to the model.

Users may now specify in modules.json what keyword arguments are expected by which module. The SentenceTransformer class no longer inherits from nn.Sequential, but instead from nn.ModuleDict. We implement the forward method ourselves and distribute keyword arguments to the modules.

A model may then provide a modules.json file like this:

[
  {
    "idx": 0,
    "name": "0",
    "path": "",
    "type": "sentence_transformers.models.Transformer",
    "kwargs": ["task", "embedding_dim", "foobar"]
  },
  {
    "idx": 1,
    "name": "1",
    "path": "1_Pooling",
    "type": "sentence_transformers.models.Pooling"
  }
]

and users may use the sentence-transformer model like this:

model = SentenceTransformer('<MODEL>', trust_remote_code=True)
model.encode(['Hello world'], task='sts', embedding_dim=32, foobar=0)

Todo

These changes will be breaking if there is someone using the .append method, which was previously inherited from nn.Sequential. However, we could also implement it here
It would make sense to also implement arbitrary kwargs for the tokenizer

…t pretrained

bwanglzu · 2024-03-27T16:34:28Z

@tomaarsen please let us know how do you think about this PR :) thanks

Bo

Markus28 added 3 commits March 12, 2024 13:31

feat: pass keyword arguments to submodules

ef55d0a

feat: added comments

b412e1a

fix: removed debugging print

dd20172

Markus28 mentioned this pull request Mar 12, 2024

Feature-Request: Passing arbitrary keyword arguments to submodules #2538

Open

Markus28 added 8 commits March 12, 2024 15:31

fix: formatting

eb70907

fix: make _load_auto_model work

eb10ac9

fix: add module_kwargs argument s.t. SentenceTransformer works withou…

3e7444a

…t pretrained

feat: added docstring

a715408

feat: fixed typo in docstring

b2181fb

fix: inherit from ModuleDict

7733c29

fix: always convert to OrderedDict

21ce062

feat: implemented changes also for multiprocess encoding

e7d9128

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Passing arbitrary keyword arguments to submodules #2539

Passing arbitrary keyword arguments to submodules #2539

Uh oh!

Markus28 commented Mar 12, 2024 •

edited

Loading

Uh oh!

bwanglzu commented Mar 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Passing arbitrary keyword arguments to submodules #2539

Are you sure you want to change the base?

Passing arbitrary keyword arguments to submodules #2539

Uh oh!

Conversation

Markus28 commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Todo

Uh oh!

bwanglzu commented Mar 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Markus28 commented Mar 12, 2024 •

edited

Loading