Releases: google-ai-edge/LiteRT
Releases · google-ai-edge/LiteRT
v2.1.0rc1
Release 2.1.0rc1
Major Features and Improvements
- NPU: Added support for Qualcomm Snapdragon Gen5
- NPU: Added support for MediaTek Dimensity 9500
- NPU: Added support for NPU JIT mode on Qualcomm and MediaTek
Bug Fixes and Other Changes
- Fixes Android min SDK version to 23.
- NPU: Fixes partition algorithm when the full model cannot be offloaded to NPU.
Breaking Changes
- Removed direct C headers usage. Users no longer need to include C headers.
TensorBuffer::CreateManaged()requires Environment always.- All
TensorBuffercreation requires Environment except HostMemory types. - LiteRT C++ constructors are hidden. All LiteRT C++ objects should be created by Create() methods.
- Move internal only C++ APIs(such as litert_logging.h) to litert/cc/internal
- Remove
Tensor,Subgraph,Signatureaccess fromlitert::Model. Instead users can accessSimpleTensor,SimpleSignaturefromCompiledModel. - The
CompiledModel::Create()API no longer needslitert::Model. They can be created from filename, model buffers directly. - Annotation, Metrics APIs are removed from
CompiledModel. - Removed individual
OpaqueOptionscreation. TheseOpaqueOptionsobjects are obtained byOptionsdirectly.Options::GetCpuOptions()Options::GetGpuOptions()Options::GetRuntimeOptions()- …
v1.4.1
v2.0.3
Release 2.0.3
Major Features and Improvements
- Add Python backend for Google Tensor. The backend doesn't yet register itself, so it's available by default.
- Change manufacturer to Google and SoC models to include the Tensor_ prefix for Google Tensor.
- Minor naming changes to some flags for the Google Tensor compiler plugin.
Bug Fixes and Other Changes
- N/A
v2.0.2
Release 2.0.2
Major Features and Improvements
LiteRT GPU Accelerator
- Added an option to control GPU inference priority.
LiteRT API Refactoring
- Introduced target
litert/cc:litert_api_with_dynamic_runtimeThis is a convenience Bazel target containing LiteRt C++ and C APIs. Users of this library are responsible to bundle LiteRT C API RuntimelibLiteRtRuntimeCApi.so. - C++ APIs that need LiteRT C API Runtime are moved to
litert/cc/dynamic_runtime/
Note: This is for internal usage. If you want to use dynamic API, uselitert/cc:litert_api_with_dynamic_runtime. - All static public C++ APIs (including
litert/cc/internal) are moved tolitert/cc/
Note: You shouldn't mix static API targets with dynamic API targets.
Bug Fixes and Other Changes
- Fixed a segmentation fault error on
//litert/tools:apply_plugin_test - Refactored example backend compiler plugin and dispatch implementation.
- Improved LiteRT op coverage for Qualcomm and MediaTek backends.
v2.0.2a1
Release 2.0.2a1
LiteRT
Major Features and Improvements
Breaking Changes
com.google.ai.edge.litert.TensorBufferRequirements- It becomes a data class, so all fields could be accessed directly without getter methods.
- The type of field
strideschanges fromIntArrytoList<Int>to be immutable.
com.google.ai.edge.litert.Layout- The type of field
dimensionsandstrideschanges fromIntArrytoList<Int>to be immutable.
- The type of field
- Rename GPU option
NoImmutableExternalTensorsModetoNoExternalTensorsMode
Known Caveats
Major Features and Improvements
- [tflite] Add error detection in TfLiteRegistration::init(). When a Delegate
kernel returnsTfLiteKernelInitFailed(), it is treated
as a critical failure on Delegate. This error will be detected in
SubGraph::ReplaceNodeSubsetsWithDelegateKernels() will cause
Delegate::Prepare() to fail, ultimately leading
InterpreterBuilder::operator() or Interpreter::ModifyGraphWithDelegate() to
return an error. - Added Profiler API in Compiled Model: source.
- Added Error reporter API in Compiled Model: source.
- Added resize input tensor API in Compiled Model: source.
Bug Fixes and Other Changes
- The Android
minSdkVersionhas increased to 23. - Update tests to provide
kLiteRtHwAcceleratorNpufor fully AOT compiled
models.
LiteRT v1.4.0 release
Release 1.4.0
Bug Fixes and Other Changes
- Fixed support for 16 kb page
v2.0.0-alpha
Full Changelog: https://github.com/google-ai-edge/LiteRT/commits/v2.0.0-alpha
v1.2.0
Full Changelog: https://github.com/google-ai-edge/LiteRT/commits/v1.2.0
v1.0.1
This is the first release of the LiteRT, the new name of TensorFlow Lite. Please see more details in this blog post.
In its current state, the LiteRT repository is not intended for open source development because it is pulling in existing TensorFlow code via a git submodule. We intend to evolve this repo to a point where developers can directly build and contribute here, at which time we will make a separate announcement.
This LiteRT release is pinned to TF commit 2adc36c and is compatible with the following packages:
-
PyTorch support:
- ai-edge-torch-0.2.0
- ai-edge-torch-nightly releases prior to 0.3.0.dev20240828
-
TensorFlow & JAX support:
- Tensorflow 2.17.0
- Tf-nighlty releases prior to 2.18.0.dev20240828
Prebuilt artifacts for this release: