Skip to content

Separate Model and Environment with GI Interface #220

Open
@lukasgppl

Description

@lukasgppl

Hi all,
I'm playing around with this package, exploring possible options to train an AZ model for my bachelors thesis. My use-case is far beyond games - I've built an environment that simulates a production facility with multiple machines and orders that need to be scheduled in a way to minimize maximize a certain reward. This scheduling problem is far too large to model exhaustively. State and action spaces are practically continous and the scheduling process is stochastic. Nonetheless, research approaches this by building a deterministic sub-problem based on the current environment's state. This subproblem is the one that needs to be solved by alphazero. After fully solving this problem, an action can be derived for the simulation environment and from the resulting state, a new sub-problem can be built and so on.
The current implementation uses the Environment directly as a model. This means I am only able to solve one sub-problem whilst training at the moment by defining the GI Interface to the Model with a sub-problem built from the initial state of the simulation environment. Is there a possibility to split the Model and the Environment for my use-case?
Thanks in advance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions