-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Labels
backlogNot planned for nowNot planned for nowtrainingFine tuning related featuresFine tuning related features
Description
Some library such as ART supports training AI agents with RL efficiently using Unsloth: https://docs.unsloth.ai/basics/reinforcement-learning-rl-guide/training-ai-agents-with-rl
Currently we support most dataset-based use cases with GRPO and DPO, but this would be also useful!
Metadata
Metadata
Assignees
Labels
backlogNot planned for nowNot planned for nowtrainingFine tuning related featuresFine tuning related features