Skip to content

vLLM not computing correct log probs when using GRPO with =/=1 temperature #702

vLLM not computing correct log probs when using GRPO with =/=1 temperature

vLLM not computing correct log probs when using GRPO with =/=1 temperature #702

name: "Hugging Face Issue Labeler"
on:
issues:
types: opened
jobs:
triage:
runs-on: ubuntu-latest
permissions:
issues: write
steps:
- uses: actions/checkout@v3
- uses: August-murr/auto-labeler@main
with:
hf-api-key: ${{ secrets.CI_HF_API_TOKEN }}