This is a replication package for the paper Interpreting Workflow Architectures by LLMs by Michal Töpfer, Tomáš Bureš, František Plášil and Petr Hnětynka.
- (Optional): create virtual environment:
python3 -m venv .venvand activate it:source .venv/bin/activate - install requirements:
pip install -r requirements.txt
To use OpenAI (paid API):
- create API key: https://platform.openai.com/api-keys
- rename
.env.exampleto.envand save the API key there
The agent_evaluation folder contains sample test instances for evaluating a LLM-based Agent. The instances are manually created based on the test patterns. The evaluation can be run via the main.py file.
In the agent_evaluation/README.md file, the experimental results are summarized and discussed.
The xxp_agent folder contains code for running a LLM-based agent that chats with the user. Based on the configuration (see examples in the examples folder) the available tools are selected (so the agent can read workflow specifications, ...). To start the agent, run the main.py file.