Skip to content

14th of June Updates #3

@alebjanes

Description

@alebjanes

Evaluation metrics

  1. Embeddings

1.1 Question ID = 1 (6,231 questions)

Model Question ID Correct matches Accuracy (%)
Mixtral 1 333 5.3
Llama3 1 398 6.4
all-mpnet-base-v2 1 4425 71.0
multi-qa-MiniLM-L6-cos-v1 1 4260 68.4
multi-qa-mpnet-base-cos-v1 1 4858 78.0
all-MiniLM-L12-v2 1 4027 64.6

1.2 Question ID = 2 (20,000 out of 46,872 questions)

Model Question ID Correct matches Accuracy (%)
Mixtral 2 ~1.5
Llama3 2 ~6.2
all-mpnet-base-v2 2 19734 98.7
multi-qa-MiniLM-L6-cos-v1 2 19639 98.2
multi-qa-mpnet-base-cos-v1 2 19782 98.9
all-MiniLM-L12-v2 2 19700 98.5

1.3 Question ID = 3 (15,000 out of 34,955 questions)

Model Question ID Correct matches Accuracy (%)
Mixtral 2 ~5.2
Llama3 2 ~7.5
all-mpnet-base-v2 2 13499 90.0
multi-qa-MiniLM-L6-cos-v1 2 13862 92.4
multi-qa-mpnet-base-cos-v1 2 14435 96.2
all-MiniLM-L12-v2 2 12860 85.7
  1. Fine-tuned Model

  2. Wrapper + ApiBuilder

  3. ApiBuilder

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions