Highlights
- Pro
Lists (29)
Sort Name ascending (A-Z)
analytics
ANN
blogs
cloud
contrastive
cpplib
crypto
dl
frontend
golib
interesting-reads
k8s
kv
learned-index
llm
mec
mllib
mlsys
python
rag
rdb
recsys
replication
serverless
sgx
tools
tsdb
vectordb
WebAssembly
Stars
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
A working example of a NestJS project using PassportJWT
A pytorch quantization backend for optimum
High-performance Go package to read and write Parquet files
Goavro is a library that encodes and decodes Avro data.
An implementation of a deep learning recommendation model (DLRM)
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
DSPy: The framework for programming—not prompting—language models
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …
FlashInfer: Kernel Library for LLM Serving
Fast and memory-efficient exact attention
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Performance-portable, length-agnostic SIMD with runtime dispatch
NUS CS5284 Graph Machine Learning course, Xavier Bresson, 2024
A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)