Skip to content

Releases: open-edge-platform/datumaro

Release v1.12.0

25 Sep 09:19
ad204a8

Choose a tag to compare

This release streamlines Datumaro by removing a number of lesser-used features, helping to simplify the tool and reduce its dependencies. These changes are part of an effort to keep Datumaro focused on its core strengths: dataset management and integration with machine learning frameworks. As part of this update, inference-related features have been removed. For inference tasks, we recommend using the OpenVINO model API. If you rely on a specific feature that is no longer available, you can still access it from the previous version of Datumaro.

Removed features

  • CLI commmands:
    • explain, explore, generate, prune
      (#1812, #1813, #1814, #1815, #1880)
    • model: add, remove, run, info
      (#1816)
    • project: add, create, export, import, remove, checkout, commit, log, info, status
      (#1816)
    • source: import, add, remove
      (#1816)
  • API features:
  • SAM Docker image
    (#1830)

New features

Enhancements

  • Mark several dependencies as optional
    (#1849, #1862)
  • Removal of unneeded dependencies
    (#1837)
  • Documentation tidy-up
    (#1840)
  • DCO introduction; readme, PR template, and contribution guide tidy-up
    (#1844, #1860, #1856, #1847)
  • Fix code coverage upload to Codecov in the CI
    (#1861)
  • Fix crashes with certain datasets in the compare command
    (#1892)
  • Added Semgrep security scan in the CI
    (#1883)

Release v1.11.1

28 Jul 12:02
c2242c7

Choose a tag to compare

Enhancements

  • Bump version of NumPy and OpenVINO
    (#1803)
  • Support for Python 3.13 and MacOS
    (#1804)

Release v1.11.0

18 Jul 08:06
35a319c

Choose a tag to compare

This release includes a significant number of deprecations in the CLI and API.
This is a one-off action to remove unused features as well as features
such as inference which do not fit well in Datumaro. We intend to remove those
features in Datumaro 1.12.0.

New features

  • Convert Cuboid2D annotation to/from 3D data
    (#1639)
  • Add label groups for hierarchical classification in ImageNet
    (#1645)

Enhancements

  • Add non-strict mode to JsonPageMapper in rust API and enable it for COCO
    (#1753)
  • Enhance 'id_from_image_name' transform to ensure each identifier is unique
    (#1635)
  • Optimize path assignment to handle point cloud in JSON without images
    (#1643)
  • Add documentation for framework conversion
    (#1659)

Bug fixes

  • Fix assertion to compare hashkeys against expected value
    (#1641)
  • Mark pyemd as optional since it does not support Python 3.12
    (#1770)

Deprecations

  • Added deprecation to the following CLI commmands:
    • explain, explore, generate, prune
    • model: add, remove, run, info
    • project: add, create, export, import, remove, checkout, commit, log, info, status
    • source: import, add, remove
      (#1792)
  • Added deprecation notices to the following features that will soon be removed:
    • Model inference
    • Model-based transformations
    • Crypter
    • Synthetic dataset generation
    • Data exploration
    • BBox to mask using SAM
    • Telemetry
    • Anchor generation
    • Missing annotation detection
    • Model inference explanation
    • Near-duplicate removal
    • Pruning
    • Pseudo-labels
      (#1776, #1780, #1792, #1795)
  • Deprecation of the SAM Docker image
    (#1783)
  • Deprecation of Project and related features (#1793)

Release v1.10.0

14 Mar 13:08
22c3121

Choose a tag to compare

New features

  • Add default position information to PointsCategories class
    (#1702)
  • Support KITTI 3D format
    (#1619)
    (#1621)
  • Add PseudoLabeling transform for unlabeled dataset
    (#1594)

Enhancements

  • Raise an appropriate error when exporting a datumaro dataset if its subset name contains path separators.
    (#1615)
  • Update docs for transform plugins
    (#1599)
  • Update ov ir model for explorer openvino launcher with CLIP ViT-L/14@336px model
    (#1603)
  • Optimize path assignment to handle point cloud in JSON without images
    (#1643)
  • Set TabularTransform to process clean transform in parallel
    (#1648)
  • Add support for Python 3.12
    (#1701)

Bug fixes

  • Fix datumaro format to load visibility information from Points annotations
    (#1644)

Release v1.10.0rc1

23 Oct 05:09
0a4abc4

Choose a tag to compare

Release v1.10.0rc1 Pre-release
Pre-release

What's Changed

  • Update ov ir model for explorer openvino launcher with CLIP ViT-L/14@336px model by @sooahleex in #1603
  • Fix datumaro keypoint loading by @jihyeonyi in #1644
  • Update assets for explorer by @sooahleex in #1647
  • Optimize path assignment to handle point cloud in JSON without images by @sooahleex in #1649
  • Fix to get image of tabular data in FrameConverter by @sooahleex in #1650
  • Set TabularTransform to process clean transform in parallel by @sooahleex in #1648

Full Changelog: v1.10.0rc0...v1.10.0rc1

Release v1.10.0rc0

29 Sep 23:58
fcdb37d

Choose a tag to compare

Release v1.10.0rc0 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v1.9.1...v1.10.0rc0

Release v1.9.1

29 Sep 23:57
4619baf

Choose a tag to compare

What's Changed - Brief version

Enhancements

  • Support multiple labels for kaggle format
    (#1607)
  • Use DataFrame.map instead of DataFrame.applymap
    (#1613)

Bug fixes

  • Fix StreamDataset merging when importing in eager mode
    (#1609)

What's Changed - Full version

Full Changelog: v1.9.0...v1.9.1

Release 1.9.0

11 Sep 07:34
6ec38e5

Choose a tag to compare

What's Changed - Brief version

New features

  • Add a new CLI command: datum format
    (#1570)
  • Support language dataset for DmTorchDataset
    (#1592)

Enhancements

  • Change _Shape to Shape and add comments for subclasses of Shape
    (#1568)

Bug fixes

  • Fix KITTI-3D importer and exporter
    (#1596)

What's Changed - Full version

Full Changelog: v1.8.0...v1.9.0

Release 1.8.0

12 Jul 06:14
1669c4f

Choose a tag to compare

What's Changed - Brief version

New features

  • Add TabularValidator
    (#1498)
  • Add Clean Transform for tabular data type
    (#1520)

Enhancements

  • Set label name with parents to avoid duplicates for AstypeAnnotations
    (#1492)
  • Pass Keyword Argument to TabularDataBase
    (#1522)
  • Support hierarchical structure for ImageNet dataset format
    (#1528)
  • Enable dtype argument when calling media.data
    (#1546)

Bug fixes

  • Preserve end_frame information of a video when it is zero.
    (#1541)
  • Changed the Datumaro format to ensure exported videos have relative paths and to prevent the same video from being overwritten.
    (#1547)

What's Changed - Full version

Full Changelog: v1.7.0...v1.8.0

Release 1.7.0

10 Jun 06:58
4d3b307

Choose a tag to compare

What's Changed - Brief Version

New features

  • Support 'Video' media type in datumaro format
    (#1491)
  • Add ann_types property for dataset
    (#1422, #1479)
  • Add AnnotationType.rotated_bbox for oriented object detection
    (#1459)
  • Add DOTA data format for oriented object detection task
    (#1475)
  • Add AstypeAnnotations Transform
    (#1484)
  • Enhance DatasetItem annotations for semantic segmentation model training use case
    (#1503)

Enhancements

  • Fix ambiguous COCO format detector
    (#1442)
  • Get target information for tabular dataset
    (#1471)
  • Add ExtractedMask and update importers who can use it to use it
    (#1480)
  • Improve PIL and COLOR_BGR context image decode performance
    (#1501)
  • Improve get_area() of Polygon through Shoelace formula
    (#1507)
  • Improve _Shape point converter
    (#1508)

Bug fixes

  • Split the video directory into subsets to avoid overwriting
    (#1485)

What's Changed - Full Version

Full Changelog: v1.6.1...v1.7.0