Skip to content
Change the repository type filter

All

    Repositories list

    • nyuntam

      Public
      Python
      1267873Updated Jan 22, 2025Jan 22, 2025
    • This is the official documentation for nyuntam
      Python
      0300Updated Jan 21, 2025Jan 21, 2025
    • Python
      0221Updated Jan 15, 2025Jan 15, 2025
    • Python
      21012Updated Dec 21, 2024Dec 21, 2024
    • Python
      1732Updated Oct 28, 2024Oct 28, 2024
    • lmquant

      Public
      Python
      0301Updated Oct 25, 2024Oct 25, 2024
    • Python
      0800Updated Oct 25, 2024Oct 25, 2024
    • PatchGD

      Public
      Python
      1400Updated Sep 5, 2024Sep 5, 2024
    • C++
      0000Updated Aug 22, 2024Aug 22, 2024
    • qserve

      Public
      QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
      Python
      52000Updated Aug 2, 2024Aug 2, 2024
    • FLAP

      Public
      Patch for Grouped Query Attention
      Python
      17201Updated Aug 2, 2024Aug 2, 2024
    • AQLM

      Public
      Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
      Python
      188000Updated Aug 1, 2024Aug 1, 2024
    • Python
      0000Updated Jul 1, 2024Jul 1, 2024
    • SFSD-LLM

      Public
      Python
      1600Updated May 31, 2024May 31, 2024
    • PruneGPT

      Public
      Python
      35100Updated May 31, 2024May 31, 2024
    • Python
      64230Updated Apr 23, 2024Apr 23, 2024