The best way to evaluate your embedding models!
IMPORTANT: rankwise is in an early stage of development; the interface may change without further notice.
rankwise is a benchmark CLI tool for evaluating the quality of embedding models with your own dataset.
It also facilitates the generation of datasets using LLM models.
- Install dependencies:
poetry install
- Activate the Poetry Shell:
poetry shell
- Run the
calculate-entropy iso-score
command to calculate the isotropy score of a dataset.
rankwise calculate-entropy iso-score --input dataset.jsonl -E "huggingface.HuggingFaceEmbedding('sentence-transformers/all-MiniLM-L12-v2')" -p
- Run the
generate
command to create a dataset containing questions and their related documents in JSONL format from adata.jsonl
file with one document (a json encoded string) per line.
API_KEY=xxx rankwise generate --model "azure_openai.AzureOpenAI(model='gpt-4o',deployment_name='gpt-4o',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" --questions-count 3 --input data.jsonl > dataset.jsonl
This command uses the given LLM model to generate the specified number of questions for every document in the input file.
-
Run the
classify
command to classify the generated questions as good/bad using a given classification model.- If you want to use a cross-encoder model, you can use the following command:
API_KEY=xxx rankwise classify cross-encoder --input dataset.jsonl --cross-encoder-model "cross_encoder.CrossEncoder('BAAI/bge-reranker-v2-m3', default_activation_function=torch.nn.Sigmoid())" --output-file classified_dataset.jsonl
- If you want to use an embedding model, you can use the following command:
API_KEY=xxx rankwise classify cosine-similarity --input dataset.jsonl --embedding-model "azure_openai.AzureOpenAIEmbedding(model='text-embedding-3-large',deployment_name='azure-text-embedding-ada-002',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" --output-file classified_dataset.jsonl
-
Run the
evaluate classification
command to evaluate your classification (from the classify command) against a ground truth dataset.
rankwise evaluate classification --ground-truth classify_ground_truth.jsonl --classification classified_dataset.jsonl
- Run the
evaluate embedding-model
command to assess your dataset and obtain quality metrics for the specified embedding model.
API_KEY=xxx rankwise evaluate embedding-models -E "azure_openai.AzureOpenAIEmbedding(model='text-embedding-3-large',deployment_name='azure-text-embedding-ada-002',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" -m hit_rate -m mrr --input dataset.jsonl
This command uses the given embedding model to evaluate the input dataset and calculate quality metrics.
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
Please read the CONTRIBUTING
file for details on our code of conduct and the process for submitting pull requests.
- Roberto Abdelkader Martínez Pérez
- Pedro Ruiz Pareja
- 0.1.0
- Initial Release
This project is licensed under the Apache v2.0 License - see the LICENSE
file for details
- Thanks to the open-source community for continuous inspiration.
- Special mentions to contributors and collaborators.
If you have any questions or suggestions, feel free to contact the authors.