How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

awinml · 2025-09-07T17:51:59Z

awinml
Sep 7, 2025

I’m implementing a multi-stage reinforcement learning (RL) pipeline for reasoning tasks using GRPO, and I’d like to:

Train a model on Dataset A with LoRA.
Save the trained LoRA adapter weights.
Resume training on Dataset B by continuing to update the same LoRA adapter, rather than initializing a fresh one.

Setup:

Base model: Qwen3
Adapter: LoRA
Rollout engine: vLLM (multi-turn)
Training algorithm: GRPO

In standard Hugging Face + PEFT workflows, I can load a pre-trained LoRA adapter like this:

from peft import PeftModel, PeftConfig
from transformers import AutoModelForCausalLM

config = PeftConfig.from_pretrained("path_to_trained_lora_adapter")
base_model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path)
lora_model = PeftModel.from_pretrained(base_model, "path_to_trained_lora_adapter")

However, in my current GRPO trainer config, LoRA is initialized from scratch via these parameters:

actor_rollout_ref.model.lora_rank=32 \
actor_rollout_ref.model.lora_alpha=32 \
actor_rollout_ref.model.target_modules=all-linear \

There doesn’t appear to be a config option (e.g., lora_path or pretrained_adapter_name_or_path) to load an existing adapter instead of initializing new LoRA weights.

Question:
How can I configure the GRPO trainer to load and continue training from a pre-trained LoRA adapter? Is this supported, and if so, what’s the correct way to specify the adapter path in the config or code?

Answered by piood

Sep 18, 2025

Now verl can't load and continue training from a pre-trained LoRA adapter, after the pr (#3523) it can work.

View full answer

piood · 2025-09-18T16:18:53Z

piood
Sep 18, 2025

Now verl can't load and continue training from a pre-trained LoRA adapter, after the pr (#3523) it can work.

1 reply

awinml Oct 29, 2025
Author

Thanks for adding this functionality!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

Uh oh!

awinml Sep 7, 2025

Replies: 1 comment · 1 reply

Uh oh!

piood Sep 18, 2025

Uh oh!

awinml Oct 29, 2025 Author

awinml
Sep 7, 2025

Replies: 1 comment 1 reply

piood
Sep 18, 2025

awinml Oct 29, 2025
Author