If you're looking to use SHARK check out our User Guide. For developers continue to read on.
The shortfin sub-project is SHARK's high performance inference library and serving engine.
- API documentation for shortfin is available on readthedocs.
The SHARK Tank sub-project contains a collection of model recipes and conversion tools to produce inference-optimized programs.
- See the SHARK Tank Programming Guide for information about core concepts, the development model, dataset management, and more.
- See Direct Quantization with SHARK Tank for information about quantization support.
The SHARK Tuner sub-project assists with tuning program performance by searching for optimal parameter configurations to use during model compilation. Check out the readme for more details.
The SHARK Fuser sub-project is home to Fusili - a C++ Graph API and Frontend to the IREE compiler and runtime stack for JIT compilation and execution of training and inference graphs. It allows us to expose cuDNN-like primitives backed by IREE code-generated kernels. Check out the readme for more details.
If you're looking to develop SHARK, check out our Developer Guide.