Skip to content

Issues: uw-ssec/llmaven

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Create evaluation suite for benchmarking various models SL1: Evaluation We expect to use Trulens for measuring performance of different RAG metrics
#15 opened Feb 25, 2025 by nikiburggraf
3 tasks
chore: GitHub action to validate a successful, error-free end-to-end run of the package SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#14 opened Feb 25, 2025 by nikiburggraf
2 tasks
chore: Create evaluation for context relevance - embedding model choice / context length / chunking | windowing strategy SL1: Evaluation We expect to use Trulens for measuring performance of different RAG metrics
#9 opened Dec 17, 2024 by vanitech
chore: Implement better overlapping chunk for the vector DB creation step SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#8 opened Dec 17, 2024 by vanitech
chore: Research for ssh access to VM SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#7 opened Dec 12, 2024 by lsetiawan
chore: Add cloud configuration for infrastructure SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#6 opened Dec 11, 2024 by lsetiawan
Separate inference endpoint from the backend infrastructure SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#4 opened Dec 3, 2024 by vanitech
Explore evaluation framework to measure accuracy and relevance SL1: Evaluation We expect to use Trulens for measuring performance of different RAG metrics
#3 opened Dec 3, 2024 by vanitech
chore: Create benchmarks using different models SL3: Orchestration Creating an orchestration framework for easily replacing different embedding approaches (vector DB
#2 opened Dec 3, 2024 by vanitech
chore: Create validation dataset from LSST community forum SL1: Evaluation We expect to use Trulens for measuring performance of different RAG metrics SL2: Multimodal RAG In addition to text, include images in the RAG pipeline
#1 opened Dec 3, 2024 by vanitech
ProTip! no:milestone will show everything without a milestone.