MLOps Tasks & Experiments:
- Inference Engine - Async/Sync Generator 👨🏻💻
- Embeddings Engine - Sparse, Dense, etc.. 👨🏻💻
- Retriever Engine - Hybrid, etc..👨🏻💻
- HyperParameter fine-tuner [axolotl, unsloth, etc..]
- Evaluations (seft, peft, etc..)[TODO]
MLOps Providers:
- BentoML 👨🏻💻
- Ray 👨🏻💻
- ...
Quantization techniques:
- GPTQ 👨🏻💻
- AWQ 👨🏻💻
- GGUF (locally, smaller devices, etc..) []