Skip to content

Conversation

@KANGslay
Copy link

在examples 文件中提交关于文本生成图像任务中的强化学习框架调研报告,基于UnifiedReward-qwen作为reward model完成优化的实验报告,及相关代码

address #9

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant