Skip to content

Conversation

@janpf
Copy link

@janpf janpf commented Sep 11, 2025

This PR should load the LLäMmlein2Vec models using the LLM2Vec library as mentioned in the readme: https://huggingface.co/LSX-UniWue/LLaMmlein2Vec_1B, and also i couldn't resist but add our other models as well ;)

and sorry, my autoformatter ran, before i noticed...

hope it works 🤞🏻 let us know (also about the results 😇)

Best

- Implement load_encoder_model() function in util.py for LLM2Vec integration
- Add llm2vec dependency to pyproject.toml
- Update all model classes to use the new encoder loading function
- Add new model configurations:
  - LLämmlein2Vec 120M, 1B, and 7B models
  - ModernGBERT 134M model configuration
- Update existing model configs with proper HuggingFace model names
- Rename model config files to use correct German umlaut spelling
@janpf
Copy link
Author

janpf commented Sep 24, 2025

@fynnos I managed to ask my AI friend to remove the unnecessary whitespace formatting. This should make this PR much simpler. The PR is updated

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant