Low Success Rate When Using Diffusion Policy on the StackCube-v1 Task


I’ve noticed that the RGB-based diffusion policy fails on many tasks, which is similar to what the authors reported: https://wandb.ai/stonet2000/ManiSkill/workspace?nw=3d49kizu62e
.

I’m sorry to raise this, but I think the very low success rate is due to an issue in the code rather than an insufficient dataset or other factors. From the training logs you provided, for the diffusion policy, the state-based results achieve very high success rates, whereas the RGB-based results are consistently low across many tasks, which is counterintuitive. For example, why is the success rate for PickCube significantly higher than for StackCube? I don’t think the two tasks should differ that much. I’ve combed through almost all the issues and PRs, and StackCube seems to be a nightmare for the diffusion policy. 

<img width="1338" height="903" alt="Image" src="https://github.com/user-attachments/assets/c5edc10f-cb58-463b-a939-2541cda315b8" />


In particular, after reviewing many inference videos, I noticed it often fails to localize the correct position, consistently deviating to the left or to the right.

<img width="743" height="672" alt="Image" src="https://github.com/user-attachments/assets/f76a4034-8a05-42d2-99ba-e41fe88dc312" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Low Success Rate When Using Diffusion Policy on the StackCube-v1 Task #1325

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Low Success Rate When Using Diffusion Policy on the StackCube-v1 Task #1325

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions