Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Proposal] Probability Mask for sampling a space #1255

Open
1 task done
mariojerez opened this issue Nov 18, 2024 · 2 comments
Open
1 task done

[Proposal] Probability Mask for sampling a space #1255

mariojerez opened this issue Nov 18, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@mariojerez
Copy link

Proposal

Currently, in the gymnasium/spaces/discrete module, the sample() method of the Discrete class only allows you to choose randomly between valid integers. It would be nice to extend it so that it can also accept a probability mask, where it gives the probability that a specific number can be returned. It could be used as an alternative to masking, because if a move is invalid, we can assign it a probability of zero.

Motivation

This would be helpful for me, since I'm implementing an Ant Colony Optimization algorithm, and each action needs to be assigned a probability of being chosen.

Pitch

Add a probability_mask parameter to the sample function, so that the user can choose to either provide a regular mask, probability mask, or neither. A verification should be added to make sure that they only provided one of the mask options, not both. We also need to verify that the probabilities add up to 1.

Alternatives

Rather than adding a new argument to the sample function, we could also create a new function called probability_sample, that takes the arguments n, mask. Both arguments could be required, since if you're not going to add a mask, you might as well use the original sample method.

Additional context

If the maintainers of Gymnasium think that this is a good idea, I would love to implement it myself and make a pull request.

Checklist

  • I have checked that there is no similar issue in the repo
@mariojerez mariojerez added the enhancement New feature or request label Nov 18, 2024
@pseudo-rnd-thoughts
Copy link
Member

Thanks for suggestion, sorry for not replying previously.
If you want to make this, we would be happy for you to add this.

I would suggest a new parameter in space.sample(mask=None, probability=None) where users can either use mask or probability.

@mariojerez
Copy link
Author

Awesome, thanks for getting back to me! I'll go ahead and make it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants