Updated docs (#286)

JSabadin · kozlov721 · web-flow · commit ad9a0431d5bb · 2025-03-19T20:57:44.000+01:00
Co-authored-by: Martin Kozlovský &lt;martin.kozlovsky@luxonis.com&gt;
diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
@@ -50,7 +50,7 @@ Each of these steps will be explained in more detail in the following examples.
 
 We will be using our toy dataset `parking_lot` in all examples. The dataset consists of images of cars and motorcycles in a parking lot. Each image has a corresponding annotation in the form of a bounding box, keypoints and several segmentation masks.
 
-**Dataset Annotations:**
+**Dataset Information:**
 
 | Task                        | Annotation Type   | Classes                                                                                                |
 | --------------------------- | ----------------- | ------------------------------------------------------------------------------------------------------ |
@@ -74,12 +74,15 @@ You can create as many datasets as you want, each with a unique name.
 
 Datasets can be stored locally or in one of the supported cloud storage providers.
 
+> \[!NOTE\]
+> 📚 For a complete list of all parameters and methods of the `LuxonisDataset` class, see the [datasets README.md](datasets/README.md).
+
 ### Dataset Creation
 
 First we import `LuxonisDataset` and create a dataset with the name `"parking_lot"`.
 
 ```python
-from luxonisml.data import LuxonisDataset
+from luxonis_ml.data import LuxonisDataset
 
 dataset_name = "parking_lot"
 
@@ -104,58 +107,93 @@ Each data entry should be a dictionary with the following structure:
 ```python
 {
     "file": str,  # path to the image file
-    "annotation": Optional[dict]  # annotation of the file
+    "task_name": Optional[str], # task type for this annotation
+    "annotation": Optional[dict]  # annotation of the instance in the file
 }
 ```
 
-The content of the `"annotation"` field depends on the task type and follows the Annotation Format described later in this document.
+Luxonis Data Format supports **annotations optionally structured into different tasks** for improved organization. Tasks can be explicitly named or left unset - if none are specified, all annotations will be grouped under a single `task_name` set by default to `""` . The [example below](#adding-data-with-a-generator-function) demonstrates this with instance keypoints and segmentation tasks.
+
+The content of the `"annotation"` field depends on the task type and follows the [Annotation Format](#annotation-format) described later in this document.
 
 #### Adding Data with a Generator Function
 
 The recommended approach for adding data is to create a generator function that yields data entries one by one.
 
-Here's an example that loads object detection annotations:
+The following example demonstrates how to load **bounding box annotations** along with their corresponding **keypoints annotations**, which are linked via `"instance_id"`.
+
+Additionally, we yield **segmentation masks** while ensuring a clear separation between task groups. To achieve this, we use the `"task_name"` field—assigning `"instance_keypoints_car"` and `"instance_keypoints_motorbike"` for instance-keypoint-related annotations, and `"segmentation"` for the semantic segmentation task.
 
 ```python
 import json
 from pathlib import Path
+import cv2
+import numpy as np
 
 # path to the dataset, replace it with the actual path on your system
 dataset_root = Path("data/parking_lot")
 
 def generator():
     for annotation_dir in dataset_root.iterdir():
-        with open(annotation_dir / "annotations.json") as f:
-            data = json.load(f)
-
-        # get the width and height of the image
-        W = data["dimensions"]["width"]
-        H = data["dimensions"]["height"]
+        annotation_file = annotation_dir / "annotations.json"
+        if not annotation_file.exists():
+            continue
 
-        image_path = annotation_dir / data["filename"]
+        with open(annotation_file) as f:
+            data = json.load(f)
 
-        for instance_id, bbox in data["BoundingBoxAnnotation"].items():
+        W, H = data.get("dimensions", {}).get("width", 1), data.get("dimensions", {}).get("height", 1)
+        image_path = str(annotation_dir / data.get("filename", ""))
 
-            # get unnormalized bounding box coordinates
+        # Process Bounding Box Annotations
+        for instance_id, bbox in data.get("BoundingBoxAnnotation", {}).items():
             x, y = bbox["origin"]
             w, h = bbox["dimension"]
-
-            # get the class name of the bounding box
-            class_name = bbox["labelName"]
             yield {
                 "file": image_path,
+                "task_name": "instance_keypoints" + "_" + bbox["labelName"],
                 "annotation": {
-                    "class": class_name,
-
+                    "class": bbox["labelName"],
+                    "instance_id": instance_id,
                     "boundingbox": {
-                      # normalized bounding box
-                      "x": x / W,
-                      "y": y / H,
-                      "w": w / W,
-                      "h": h / H,
+                        "x": x / W, "y": y / H, "w": w / W, "h": h / H
                     }
                 },
             }
+
+        # Process Keypoints Annotations
+        for instance_id, keypoints_data in data.get("KeypointsAnnotation", {}).items():
+            keypoints = [
+                (kp["location"][0] / W, kp["location"][1] / H, kp["visibility"])
+                for kp in keypoints_data["keypoints"]
+            ]
+            yield {
+                "file": image_path,
+                "task_name": "instance_keypoints" + "_" + keypoints_data["labelName"],
+                "annotation": {
+                    "instance_id": instance_id,
+                    "keypoints": {"keypoints": keypoints},
+                },
+            }
+
+        # Process Segmentation Annotations
+        segmentation_data = data.get("VehicleTypeSegmentation", {})
+        if "filename" in segmentation_data:
+            mask_path = annotation_dir / segmentation_data["filename"]
+            mask_rgb = cv2.cvtColor(cv2.imread(str(mask_path)), cv2.COLOR_BGR2RGB)
+            if mask_rgb is not None:
+                for instance in segmentation_data.get("instances", []):
+                    label = instance["labelName"]
+                    color = np.array(instance["pixelValue"], dtype=np.uint8)
+                    binary_mask = (mask_rgb == color).all(axis=-1).astype(np.uint8)
+                    yield {
+                        "file": image_path,
+                        "task_name": "segmentation",
+                        "annotation": {
+                            "class": label,
+                            "segmentation": {"mask": binary_mask},
+                        },
+                    }
 ```
 
 The generator is then passed to the `add` method of the dataset.
@@ -264,6 +302,9 @@ This guide covers the loading of datasets using the `LuxonisLoader` class.
 
 The `LuxonisLoader` class can also take care of data augmentation, for more info see [Augmentation](#augmentation).
 
+> \[!NOTE\]
+> 📚 For a complete list of all parameters of the `LuxonisLoader` class, see the [loaders README.md](loaders/README.md).
+
 ### Dataset Loading
 
 To load a dataset with `LuxonisLoader`, we need an instance of `LuxonisDataset`, and we need to specify what view of the dataset we want to load.
@@ -299,6 +340,7 @@ The supported formats are:
 - [**MT YOLOv6**](https://roboflow.com/formats/mt-yolov6)
 - [**CreateML JSON**](https://roboflow.com/formats/createml-json)
 - [**TensorFlow Object Detection CSV**](https://roboflow.com/formats/tensorflow-object-detection-csv)
+- [**SOLO**](https://docs.unity3d.com/Packages/com.unity.perception@1.0/manual/Schema/SoloSchema.html)
 - **Classification Directory** - A directory with subdirectories for each class
 
 ```plaintext
@@ -348,6 +390,12 @@ The dataset directory can either be a local directory or a directory in one of t
 
 The directory can also be a zip file containing the dataset.
 
+The `task_name` argument can be specified as a single string or as a dictionary. If a string is provided, it will be used as the task name for all records.
+Alternatively, you can provide a dictionary that maps class names to task names for better dataset organization. See the example below.
+
+> \[!NOTE\]
+> 📚 For a complete list of all parameters of the `LuxonisParser` class, see the [parsers README.md](parsers/README.md).
+
 ```python
 from luxonisml.data import LuxonisParser
 from luxonis_ml.enums import DatasetType
@@ -357,8 +405,12 @@ dataset_dir = "path/to/dataset"
 parser = LuxonisParser(
   dataset_dir=dataset_dir,
   dataset_name="my_dataset",
-  dataset_type=DatasetType.COCO
-)
+  dataset_type=DatasetType.COCO,
+  task_name={
+      "semantic_segmentation": "TorsoLimbs",
+      "semantic_segmentation": "HeadNeck",
+      "instance_keypoints": "FullPersonBody"
+  },
 ```
 
 After initializing the parser, you can parse the dataset to create a `LuxonisDataset` instance. The `LuxonisDataset` instance will contain the data from the dataset with splits for training, validation, and testing based on the dataset directory structure.
diff --git a/luxonis_ml/data/datasets/README.md b/luxonis_ml/data/datasets/README.md
@@ -0,0 +1,73 @@
+# LuxonisML Dataset
+
+The `LuxonisDataset` class provides functionality for creating, managing, and interacting with datasets.
+
+## Table of Contents
+
+- [LuxonisML Dataset](#luxonisml-dataset)
+  - [Parameters](#parameters)
+  - [Core Methods](#core-methods)
+    - [Adding Data](#adding-data)
+    - [Creating Splits](#creating-splits)
+    - [Merging Datasets](#merging-datasets)
+    - [Cloning the Dataset](#cloning-the-dataset)
+
+## Parameters
+
+### LuxonisDataset Constructor Parameters
+
+| Parameter         | Type            | Default               | Description                                           |
+| ----------------- | --------------- | --------------------- | ----------------------------------------------------- |
+| `dataset_name`    | `str`           | Required              | The unique name for the dataset                       |
+| `team_id`         | `Optional[str]` | `None`                | Optional team identifier for the cloud                |
+| `bucket_type`     | `BucketType`    | `BucketType.INTERNAL` | Whether to use external cloud buckets                 |
+| `bucket_storage`  | `BucketStorage` | `BucketStorage.LOCAL` | Underlying storage (local, GCS, S3, Azure)            |
+| `delete_existing` | `bool`          | `False`               | Whether to delete existing dataset with the same name |
+| `delete_remote`   | `bool`          | `False`               | Whether to delete remote data when deleting dataset   |
+
+## Core Methods
+
+### Adding Data
+
+The `add()` method is used to add data to a dataset.
+
+#### Parameters
+
+| Parameter    | Type              | Default     | Description                                           |
+| ------------ | ----------------- | ----------- | ----------------------------------------------------- |
+| `generator`  | `DatasetIterator` | Required    | Generator yielding dataset records                    |
+| `batch_size` | `int`             | `1_000_000` | Number of annotation records to process in each batch |
+
+### Creating Splits
+
+The `make_splits()` method divides the dataset into separate splits (train/val/test) for machine learning workflows.
+
+#### Parameters
+
+| Parameter            | Type                                                                                          | Default | Description                                   |
+| -------------------- | --------------------------------------------------------------------------------------------- | ------- | --------------------------------------------- |
+| `splits`             | `Mapping[str, float]` or<br>`Tuple[float, float, float]` or<br>`Mapping[str, List[PathType]]` | `None`  | Proportions or explicit file paths for splits |
+| `replace_old_splits` | `bool`                                                                                        | `False` | Whether to replace existing splits            |
+
+### Merging Datasets
+
+The `merge_with()` method combines data from another dataset into the current one.
+
+#### Parameters
+
+| Parameter          | Type             | Default  | Description                                               |
+| ------------------ | ---------------- | -------- | --------------------------------------------------------- |
+| `other`            | `LuxonisDataset` | Required | Dataset to merge with                                     |
+| `inplace`          | `bool`           | `True`   | Whether to modify the current dataset or create a new one |
+| `new_dataset_name` | `str`            | `None`   | Name for the new dataset if `inplace=False`               |
+
+### Cloning the Dataset
+
+The `clone()` method creates a complete copy of a dataset with a new name. It copies all data, metadata, and splits from the original dataset.
+
+#### Parameters
+
+| Parameter          | Type   | Default  | Description                                                                                |
+| ------------------ | ------ | -------- | ------------------------------------------------------------------------------------------ |
+| `new_dataset_name` | `str`  | Required | Name for the cloned dataset                                                                |
+| `push_to_cloud`    | `bool` | `True`   | Whether to push the cloned dataset to cloud storage. Only if the current dataset is remote |
diff --git a/luxonis_ml/data/loaders/README.md b/luxonis_ml/data/loaders/README.md
@@ -0,0 +1,26 @@
+# LuxonisML Loader
+
+The `LuxonisLoader` class provides efficient access to dataset samples with configurable preprocessing options.
+
+## Table of Contents
+
+- [LuxonisML Loader](#luxonisml-loader)
+  - [Parameters](#parameters)
+
+## Parameters
+
+### LuxonisLoader Constructor Parameters
+
+| Parameter                     | Type                                      | Default             | Description                                                    |
+| ----------------------------- | ----------------------------------------- | ------------------- | -------------------------------------------------------------- |
+| `dataset`                     | `LuxonisDataset`                          | Required            | The dataset to load data from                                  |
+| `view`                        | `Union[str, List[str]]`                   | `"train"`           | Dataset split to use ("train", "val", "test")                  |
+| `augmentation_engine`         | `str`                                     | `"albumentations"`  | Augmentation engine to use                                     |
+| `augmentation_config`         | `Optional[Union[List[Params], PathType]]` | `None`              | Configuration for the augmentations                            |
+| `height`                      | `Optional[int]`                           | `None`              | Height of the output images                                    |
+| `width`                       | `Optional[int]`                           | `None`              | Width of the output images                                     |
+| `keep_aspect_ratio`           | `bool`                                    | `True`              | Whether to keep image aspect ratio                             |
+| `exclude_empty_annotations`   | `bool`                                    | `False`             | Whether to exclude empty annotations                           |
+| `color_space`                 | `Literal["RGB", "BGR"]`                   | `"RGB"`             | Color space of output images                                   |
+| `keep_categorical_as_strings` | `bool`                                    | `False`             | Whether to keep categorical metadata as strings                |
+| `update_mode`                 | `UpdateMode`                              | `UpdateMode.ALWAYS` | Whether to always download dataset from cloud or only if empty |
diff --git a/luxonis_ml/data/parsers/README.md b/luxonis_ml/data/parsers/README.md
@@ -0,0 +1,30 @@
+# LuxonisML Parsers
+
+The `LuxonisParser` class provides functionality for converting various dataset formats to the Luxonis Dataset Format (LDF).
+
+## Table of Contents
+
+- [LuxonisML Parsers](#luxonisml-parsers)
+  - [Parameters](#parameters)
+  - [Parse Method Parameters](#parse-method-parameters)
+
+## Parameters
+
+### LuxonisParser Constructor Parameters
+
+| Parameter        | Type                                   | Default  | Description                                                                                                                     |
+| ---------------- | -------------------------------------- | -------- | ------------------------------------------------------------------------------------------------------------------------------- |
+| `dataset_dir`    | `str`                                  | Required | Path or URL to dataset directory (local path, `gcs://`, `s3://` or `roboflow://`)                                               |
+| `dataset_name`   | `Optional[str]`                        | `None`   | Name for the dataset (if None, derived from directory name)                                                                     |
+| `save_dir`       | `Optional[Union[Path, str]]`           | `None`   | Where to save downloaded datasets if remote URL is provided (if None, uses current directory)                                   |
+| `dataset_plugin` | `Optional[str]`                        | `None`   | Dataset plugin to use (if None, uses `LuxonisDataset`)                                                                          |
+| `dataset_type`   | `Optional[DatasetType]`                | `None`   | Force specific dataset format type instead of auto-detection                                                                    |
+| `task_name`      | `Optional[Union[str, Dict[str, str]]]` | `None`   | Task name(s) for the dataset. Used to link the classes to the desired tasks, with class names as keys and task names as values. |
+
+### Parse Method Parameters
+
+| Parameter      | Type                         | Default | Description                                                                                                                                 |
+| -------------- | ---------------------------- | ------- | ------------------------------------------------------------------------------------------------------------------------------------------- |
+| `split`        | `Optional[str]`              | `None`  | Split name if parsing a single split                                                                                                        |
+| `random_split` | `bool`                       | `True`  | Whether to create random splits                                                                                                             |
+| `split_ratios` | `Optional[Dict[str, float]]` | `None`  | Ratios for train/validation/test splits. If set to `None`, the default behavior of the `LuxonisDataset`'s `make_splits` method will be used |
diff --git a/luxonis_ml/data/requirements.txt b/luxonis_ml/data/requirements.txt
@@ -10,3 +10,4 @@ filelock~=3.0
 bidict~=0.21
 gdown~=4.7
 defusedxml~=0.7
+pillow-heif<0.22.0