Skip to content
View hugy718's full-sized avatar
🐝
🐝

Highlights

  • Pro

Block or report hugy718

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,566 1,769 Updated Mar 7, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,663 193 Updated Mar 4, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,723 2,387 Updated Aug 12, 2024

Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 340 46 Updated Mar 8, 2025
C++ 13 6 Updated Jan 12, 2024

A working example of a NestJS project using PassportJWT

TypeScript 26 5 Updated Jan 15, 2024

A pytorch quantization backend for optimum

Python 892 71 Updated Mar 6, 2025

High-performance Go package to read and write Parquet files

Go 396 60 Updated Mar 2, 2025

Goavro is a library that encodes and decodes Avro data.

Go 1,010 223 Updated Jan 20, 2025

Official Go implementation of Apache Arrow

Assembly 112 18 Updated Mar 7, 2025
C++ 37 8 Updated Jun 10, 2023

An implementation of a deep learning recommendation model (DLRM)

Python 3,837 851 Updated Oct 11, 2024

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,598 252 Updated Mar 4, 2025

🙌 OpenHands: Code Less, Make More

Python 49,355 5,431 Updated Mar 7, 2025

DSPy: The framework for programming—not prompting—language models

Python 22,328 1,707 Updated Mar 7, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,454 169 Updated Jun 25, 2024

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …

C 1,269 73 Updated Feb 26, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,310 239 Updated Mar 8, 2025

Uniform Manifold Approximation and Projection

Python 7,650 821 Updated Feb 28, 2025

Fast and memory-efficient exact attention

Python 16,152 1,528 Updated Mar 7, 2025

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 8,166 638 Updated Mar 8, 2025

Gorse open source recommender system engine

Go 8,812 803 Updated Mar 7, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,702 10,827 Updated Mar 8, 2025
Python 4 Updated Dec 7, 2022

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

11,238 2,352 Updated Mar 4, 2025

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,444 333 Updated Mar 7, 2025

NUS CS5284 Graph Machine Learning course, Xavier Bresson, 2024

Jupyter Notebook 65 9 Updated Nov 11, 2024

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

C++ 19,142 3,088 Updated Jan 5, 2025

Expressive Vector Engine - SIMD in C++ Goes Brrrr

C++ 1,159 61 Updated Mar 7, 2025
Next
Showing results