You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
According to the paper "our policies are parameterized by a two-layer ReLU MLP with 64 units per layer. To support discrete communication messages, we use the Gumbel-Softmax estimator [14]." However, I could not find it in the code!
The policy is hardcoded (policy.py )based on the keyboard input, so what if my environment does not require input from the user
Appreciate explaining that point
The text was updated successfully, but these errors were encountered:
According to the paper "our policies are parameterized by a two-layer ReLU MLP with 64 units per layer. To support discrete communication messages, we use the Gumbel-Softmax estimator [14]." However, I could not find it in the code!
The policy is hardcoded (policy.py )based on the keyboard input, so what if my environment does not require input from the user
Appreciate explaining that point
The text was updated successfully, but these errors were encountered: