nextmv-io
diff --git a/‎docs/examples/multifile-workflow.md‎
Lines changed: 186 additions & 0 deletions b/‎docs/examples/multifile-workflow.md‎
Lines changed: 186 additions & 0 deletions
diff --git a/‎docs/tutorials/echo-multi.md‎
Lines changed: 154 additions & 0 deletions b/‎docs/tutorials/echo-multi.md‎
Lines changed: 154 additions & 0 deletions
diff --git a/‎mkdocs.yml‎
Lines changed: 1 addition & 0 deletions b/‎mkdocs.yml‎
Lines changed: 1 addition & 0 deletions
@@ -0,0 +1,186 @@
+# multi-file Workflow Example
+
+!!! tip
+
+    This example uses the [`echo` app](../tutorials/echo.md), make sure to complete
+    that tutorial first.
+
+This example showcases how to use **multi-file** applications _within_ a
+Nextpipe workflow. A multi-file application differs from a JSON-based
+application in that it accepts a directory of files as input and produces a
+directory of files as output. Note that the workflow itself is also a multi-file
+application, however, this is a user choice (i.e., you could also create a
+JSON-based workflow that uses multi-file sub-applications).
+
+For demonstration purposes, we will use the simple [echo-multi application] as
+the sub-application, which echoes the input files as output files.
+
+Find the workflow code below (mind the comments explaining each step):
+
+```python
+import os
+import shutil
+
+import nextmv
+import nextmv.cloud
+
+from nextpipe import FlowSpec, app, log, needs, step
+
+options = nextmv.Options(
+    nextmv.Option("input", str, "inputs/", "Path to input file.", False),
+    nextmv.Option("output", str, "outputs/", "Path to output file.", False),
+)
+
+
+# >>> Workflow definition
+class Flow(FlowSpec):
+    # The first step receives the path to the input files directly (see main()) and
+    # automatically zips the directory and passes it to the 'echo-multi' sub-app.
+    @app(app_id="echo-multi")
+    @step
+    def solve1():
+        """Runs a multi-file model."""
+        pass
+
+    # The second step receives the path to the output files from the first step. This path
+    # will point to a temporary directory containing the output files from the first step.
+    @needs(predecessors=[solve1])
+    @step
+    def transform(result_path: str):
+        """Transforms the result for the next step."""
+        # Just list the content of the result directory.
+        log(f"Contents of result directory {result_path}:")
+        for file_name in os.listdir(result_path):
+            full_file_name = os.path.join(result_path, file_name)
+            if os.path.isfile(full_file_name):
+                log(f"- {file_name}")
+
+        # Add a new file to the result for demonstration purposes.
+        new_file_path = os.path.join(result_path, "additional_file.txt")
+        with open(new_file_path, "w") as f:
+            f.write("This is an additional file added in the transform step.\n")
+        log(f"Added new file: {new_file_path}")
+
+        return result_path
+
+    # The third step receives the (modified) directory from the transform step and runs
+    # another multi-file app on it.
+    @app(
+        app_id="echo-multi",
+        # We specify the content type explicitly here. This is normally done via the app's
+        # manifest, but we can do it explicitly like this too.
+        run_configuration=nextmv.RunConfiguration(
+            format=nextmv.Format(
+                format_input=nextmv.FormatInput(input_type=nextmv.InputFormat.MULTI_FILE),
+                format_output=nextmv.FormatOutput(output_type=nextmv.OutputFormat.MULTI_FILE),
+            )
+        ),
+        full_result=True,
+    )
+    @needs(predecessors=[transform])
+    @step
+    def solve2(result: nextmv.cloud.RunResult):
+        """Runs another multi-file model."""
+        pass
+
+    # The final step receives the output from 'solve2' as a full result object (see
+    # 'full_result=True' above). In this case, the path to the output files is available
+    # via 'result.output'.
+    @needs(predecessors=[solve2])
+    @step
+    def prepare_output(result: nextmv.cloud.RunResult):
+        """Transforms the result for the next step."""
+        # Extract the path to the output files.
+        result_path = result.output
+        # Simply copy the files from the given directory to the expected output directory.
+        os.makedirs(options.output, exist_ok=True)
+        for file_name in os.listdir(result_path):
+            full_file_name = os.path.join(result_path, file_name)
+            if os.path.isfile(full_file_name):
+                shutil.copy(full_file_name, options.output)
+
+
+def main():
+    # Run workflow (simply provide the path to the multi-file input)
+    flow = Flow("DecisionFlow", options.input)
+    flow.run()
+    # The last step of the flow already prepares the output in the requested directory,
+    # so no need to do anything here anymore.
+
+
+if __name__ == "__main__":
+    main()
+```
+
+Run the example:
+
+```bash
+$ python main.py
+[nextpipe] No application ID or run ID found, uplink is inactive.
+[nextpipe] Flow: Flow
+[nextpipe] nextpipe: v0.3.5
+[nextpipe] nextmv: 0.33.0
+[nextpipe] Flow graph steps:
+[nextpipe] Step:
+[nextpipe]   Definition: Step(solve1, StepRun(echo-multi, , {}, False))
+[nextpipe]   Docstring: Runs a multi-file model.
+[nextpipe] Step:
+[nextpipe]   Definition: Step(transform, StepNeeds(solve1))
+[nextpipe]   Docstring: Transforms the result for the next step.
+[nextpipe] Step:
+[nextpipe]   Definition: Step(solve2, StepNeeds(transform), StepRun(echo-multi, , {}, True))
+[nextpipe]   Docstring: Runs another multi-file model.
+[nextpipe] Step:
+[nextpipe]   Definition: Step(prepare_output, StepNeeds(solve2))
+[nextpipe]   Docstring: Transforms the result for the next step.
+[nextpipe] Mermaid diagram:
+[nextpipe] graph LR
+  solve1(solve1)
+  solve1 --> transform
+  transform(transform)
+  transform --> solve2
+  solve2(solve2)
+  solve2 --> prepare_output
+  prepare_output(prepare_output)
+
+[nextpipe] Mermaid URL: https://mermaid.ink/svg/Z3JhcGggTFIKICBzb2x2ZTEoc29sdmUxKQogIHNvbHZlMSAtLT4gdHJhbnNmb3JtCiAgdHJhbnNmb3JtKHRyYW5zZm9ybSkKICB0cmFuc2Zvcm0gLS0+IHNvbHZlMgogIHNvbHZlMihzb2x2ZTIpCiAgc29sdmUyIC0tPiBwcmVwYXJlX291dHB1dAogIHByZXBhcmVfb3V0cHV0KHByZXBhcmVfb3V0cHV0KQo=?theme=dark
+[nextpipe] Running node solve1_0
+[nextpipe] Started app step solve1_0 run, find it at https://cloud.nextmv.io/app/echo-multi/run/latest-a-JAvuFgDR?view=details
+/home/marius/.asdf/installs/python/3.13.7/lib/python3.13/shutil.py:1281: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior.
+  tarobj.extractall(extract_dir, filter=filter)
+[nextpipe] Running node transform_0
+[transform_0] Contents of result directory /tmp/nextpipe_output_igqsibzm:
+[transform_0] - input.xlsx
+[transform_0] - data.csv
+[transform_0] Added new file: /tmp/nextpipe_output_igqsibzm/additional_file.txt
+[nextpipe] Running node solve2_0
+[nextpipe] Started app step solve2_0 run, find it at https://cloud.nextmv.io/app/echo-multi/run/latest-HIwvuFgDg?view=details
+[nextpipe] Running node prepare_output_0
+```
+
+Content of the output directory:
+
+```bash
+tree outputs/
+outputs/
+├── additional_file.txt
+├── data.csv
+└── input.xlsx
+
+1 directory, 3 files
+```
+
+The resulting Mermaid diagram for this flow looks like this:
+
+```mermaid
+graph LR
+  solve1(solve1)
+  solve1 --> transform
+  transform(transform)
+  transform --> solve2
+  solve2(solve2)
+  solve2 --> prepare_output
+  prepare_output(prepare_output)
+```
+
+[echo-multi application]: ../tutorials/echo-multi.md
@@ -0,0 +1,154 @@
+# The `echo-multi` app
+
+Several examples assume you have a Nextmv application called `echo-multi`. This
+is just a simple application created for demonstration purposes. It takes the
+input files and echoes them as output files.
+
+Let's get set up with the `echo-multi` application. Before starting:
+
+1. [Sign up][signup] for a Nextmv account.
+2. Get your API key. Go to [Team > API Key][api-key].
+
+Make sure that you have your API key set as an environment variable:
+
+```bash
+export NEXTMV_API_KEY="<YOUR-API-KEY>"
+```
+
+Now that you have a valid Nextmv account and API key, let's create the
+`echo-multi` Nextmv app (start in an empty directory).
+
+1. Create a folder `inputs/` and add some sample input files to it. For example,
+   you can create two text files `input.csv` and `input.txt` with some sample
+   content.
+1. In a new directory, create a file called `main.py` with the code for the
+   basic app that echoes the input.
+
+    ```python
+    import glob
+
+    import os
+    import time
+
+    import nextmv
+
+    def main():
+        options = nextmv.Options(
+            nextmv.Option("input", str, "inputs/", "Path to input file.", False),
+            nextmv.Option("output", str, "outputs/", "Path to output file.", False),
+            nextmv.Option("duration", float, 1.0, "Runtime duration (in seconds).", False),
+        )
+
+        # Read and prepare the input data.
+        input_data = read_input(options.input)
+
+        # Log information about the input files.
+        nextmv.log(f"Size of input files (count: {len(input_data)}):")
+        for file_path, content in input_data.items():
+            nextmv.log(f"  {file_path}: {len(content)} bytes")
+
+        # Sleep for the specified duration.
+        nextmv.log(f"Sleeping for {options.duration} seconds...")
+        time.sleep(options.duration)
+        nextmv.log("Woke up from sleep.")
+
+        # Write the output.
+        write_output(options.output, input_data)
+
+    def read_input(input_path: str) -> dict[str, bytes]:
+        """Reads the input files to memory."""
+        input_files = glob.glob(os.path.join(input_path, "**/*"), recursive=True)
+        content = {}
+        for file_path in input_files:
+            if os.path.isfile(file_path):
+                with open(file_path, "rb") as file:
+                    nextmv.log(f"Reading file: {file_path}")
+                    content[file_path] = file.read()
+        return content
+
+    def write_output(output_path: str, content: dict[str, bytes]) -> None:
+        """Writes the given output files."""
+        if not os.path.exists(output_path):
+            os.makedirs(output_path)
+
+        for file_path, data in content.items():
+            output_file_path = os.path.join(output_path, os.path.basename(file_path))
+            with open(output_file_path, "wb") as file:
+                nextmv.log(f"Writing file: {output_file_path}")
+                file.write(data)
+
+    if __name__ == "__main__":
+        main()
+    ```
+
+    Note that the application uses the [`nextmv`][nextmv-docs] library. This
+    library is a dependency of `nextpipe` and should be installed automatically
+    when you install `nextpipe`.
+
+    You may run the app locally to test it:
+
+    ```bash
+    python main.py
+    ```
+
+1. Create a `requirements.txt` file with the following requirements for running
+   the app:
+
+    ```requirements.txt
+    nextmv
+    ```
+
+1. Create an `app.yaml` file (the app manifest) with the following instructions:
+
+    ```yaml
+    type: python
+    runtime: ghcr.io/nextmv-io/runtime/python:3.11
+    files:
+        - main.py
+    python:
+        pip-requirements: requirements.txt
+    ```
+
+1. Push the application to your Nextmv account. Create a `push.py` script in
+   the same directory with the following code:
+
+    ```python
+    import os
+
+    from nextmv import cloud
+
+    client = cloud.Client(api_key=os.getenv("NEXTMV_API_KEY"))
+    app = cloud.Application.new(client=client, name="echo-multi", id="echo-multi", description="Sample echo multi-file app.", exist_ok=True)
+    app.push(verbose=True)
+    ```
+
+1. Execute the `push.py` script to push the app to your Nextmv account:
+
+    ```bash
+    $ python push.py
+    💽 Starting build for Nextmv application.
+    🐍 Bundling Python dependencies.
+    📋 Copied files listed in "app.yaml" manifest.
+    📦 Packaged application (588 files, 5.39 MiB).
+    🌟 Pushing to application: "echo-multi".
+    💥️ Successfully pushed to application: "echo-multi".
+    {
+    "app_id": "echo-multi",
+    "endpoint": "https://api.cloud.nextmv.io",
+    "instance_url": "v1/applications/echo-multi/runs?instance_id=devint"
+    }
+    ```
+
+    Alternatively, you can use the [Nextmv CLI][nextmv-cli] to create and push the app:
+
+    ```bash
+    nextmv app create -a echo-multi -n echo-multi -d "Sample echo multi-file app."
+    nextmv app push -a echo-multi
+    ```
+
+Now you are ready to run the examples.
+
+[signup]: https://cloud.nextmv.io
+[api-key]: https://cloud.nextmv.io/team/api-keys
+[nextmv-docs]: https://nextmv-py.readthedocs.io/en/latest/nextmv/
+[nextmv-cli]: https://docs.nextmv.io/docs/using-nextmv/reference/cli
@@ -12,6 +12,7 @@ nav:
           - The echo app: tutorials/echo.md
       - Examples:
           - Basic chained workflow: examples/basic-chained-workflow.md
+          - Multi-file workflow: examples/multifile-workflow.md
           - Fanout workflow: examples/fanout-workflow.md
           - Ensemble workflow: examples/ensemble-workflow.md
           - Complex workflow: examples/complex-workflow.md