Change the repository type filter
All
Repositories list
37 repositories
inspect_k8s_sandbox
Publictask-assets
Publicvivaria
PublicVivaria is METR's tool for running evaluations and conducting agent elicitation research.inspect-tasks-public
Publicinspect_ai
Public- Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity: https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/
cross-domain-horizon
Publicautonomy-evals-guide
Publictriframe_inspect
Publicmcp-reward-hack
Publicterraform-aws-lambda
PublicRE-Bench
Publiceval-analysis-public
Publicinspect_evals
Publictask-protected-scoring
Publicuplift_clone_hypothesis
Publichcast-public
Publicpublic-tasks
Publicagent-prs-on-vivaria
PublicSWE-bench-fork
PublicKernelBenchFiltered
Publicviv-task-dev
Publictask-standard
Publicagent-fork-of-eval-analysis-public
Public archivellm-foundry
Publicmetr-task-boilerplate
Publictask-artifacts
Public