SkyFishMoon

Tianyu Yang SkyFishMoon

世上无难事，只要肯登攀

Pinned Loading

volcengine/verl volcengine/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18.4k 3k
hiyouga/EasyR1 hiyouga/EasyR1 Public

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4.4k 339
langfengQ/verl-agent langfengQ/verl-agent Public

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1.4k 121
mll-lab-nu/VAGEN mll-lab-nu/VAGEN Public

Training VLM agents with multi-turn reinforcement learning

Python 375 43
UKPLab/acl2025-rupta UKPLab/acl2025-rupta Public

This is the official code for the paper: Robust Utility-Preserving Text Anonymization Based on Large Language Models

Python 10 2