Skip to content
View SkyFishMoon's full-sized avatar

Block or report SkyFishMoon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. volcengine/verl volcengine/verl Public

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python 14.5k 2.3k

  2. hiyouga/EasyR1 hiyouga/EasyR1 Public

    EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

    Python 3.8k 289

  3. langfengQ/verl-agent langfengQ/verl-agent Public

    verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

    Python 1k 89

  4. mll-lab-nu/VAGEN mll-lab-nu/VAGEN Public

    Python 233 33

  5. UKPLab/acl2025-rupta UKPLab/acl2025-rupta Public

    This is the official code for the paper: Robust Utility-Preserving Text Anonymization Based on Large Language Models

    Python 7 1