Learning in LLMs and MLsys, recently focused on RL training
- 💬 Personal Website: https://yushengsu-thu.github.io/
- Google Scholar: https://scholar.google.com/citations?user=xwy6Va4AAAAJ
- 📫 E-mail: [email protected]
Learning in LLMs and MLsys, recently focused on RL training
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
On Transferability of Prompt Tuning for Natural Language Processing
Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python 1
Forked from fzyzcjy/torch_memory_saver
Allow torch tensor memory to be released and resumed later
Python 2
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM training.
Forked from THUDM/slime
slime is an LLM post-training framework for RL Scaling.
Python 1