Skip to content
Change the repository type filter

All

    Repositories list

    • datahub-gma

      Public
      General Metadata Architecture
      Java
      601341320Updated Nov 28, 2025Nov 28, 2025
    • Efficient Triton Kernels for LLM Training
      Python
      4385.9k8033Updated Nov 28, 2025Nov 28, 2025
    • helix

      Public
      Mirror of Apache Helix
      Java
      2421011Updated Nov 27, 2025Nov 27, 2025
    • openhouse

      Public
      Open Control Plane for Tables in Data Lakehouse
      Java
      633721023Updated Nov 26, 2025Nov 26, 2025
    • venice

      Public
      Venice, Derived Data Platform for Planet-Scale Workloads.
      Java
      1085761723Updated Nov 26, 2025Nov 26, 2025
    • ambry

      Public
      Distributed object store
      Java
      2831.8k13118Updated Nov 25, 2025Nov 25, 2025
    • This is a read-only mirror of apache/gobblin
      Java
      4600Updated Nov 24, 2025Nov 24, 2025
    • zookeeper

      Public
      Mirror of Apache Hadoop ZooKeeper
      Java
      7.3k637Updated Nov 24, 2025Nov 24, 2025
    • ignite-3

      Public
      Apache Ignite 3
      Java
      133000Updated Nov 22, 2025Nov 22, 2025
    • The official AWS SDK for Java - Version 2
      Java
      964002Updated Nov 21, 2025Nov 21, 2025
    • coral

      Public
      Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
      Java
      2008725932Updated Nov 21, 2025Nov 21, 2025
    • iceberg

      Public
      A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
      Java
      3563025Updated Nov 20, 2025Nov 20, 2025
    • Multi-hop declarative data pipelines
      Java
      1412210Updated Nov 20, 2025Nov 20, 2025
    • Burrow

      Public
      Kafka Consumer Lag Checking
      Go
      8133.9k22519Updated Nov 20, 2025Nov 20, 2025
    • LiTr

      Public
      Lightweight hardware accelerated video/audio transcoder for Android.
      Java
      89638560Updated Nov 19, 2025Nov 19, 2025
    • Shake to send feedback for Android.
      Java
      54161105Updated Nov 13, 2025Nov 13, 2025
    • A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform inference.
      Scala
      5224930Updated Nov 13, 2025Nov 13, 2025
    • brooklin

      Public
      An extensible distributed system for reliable nearline data streaming at scale
      Java
      1409511716Updated Nov 11, 2025Nov 11, 2025
    • cruise-control

      Public
      Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
      Java
      6433k21237Updated Nov 6, 2025Nov 6, 2025
    • rest.li

      Public
      Rest.li is a REST+JSON framework for building robust, scalable service architectures using dynamic discovery and simple asynchronous APIs.
      Java
      5582.5k5157Updated Nov 4, 2025Nov 4, 2025
    • This repo is specifically for the Grace Hopper 2025 DS Workshop
      Jupyter Notebook
      3700Updated Nov 1, 2025Nov 1, 2025
    • Listing of all our public GitHub projects.
      JavaScript
      4664162Updated Nov 1, 2025Nov 1, 2025
    • transport

      Public
      A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
      Java
      743022411Updated Oct 30, 2025Oct 30, 2025
    • avro-util

      Public
      Collection of utilities to allow writing java code that operates across a wide range of avro versions.
      Java
      67855714Updated Oct 29, 2025Oct 29, 2025
    • Repo for talent-solutions-java-sdk project
      Java
      1100Updated Oct 27, 2025Oct 27, 2025
    • fmchisel

      Public
      fmchisel: Efficient Compression and Training Algorithms for Foundation Models
      Python
      87400Updated Oct 23, 2025Oct 23, 2025
    • goavro

      Public
      Goavro is a library that encodes and decodes Avro data.
      Go
      2291k6121Updated Oct 22, 2025Oct 22, 2025
    • diderot

      Public
      A fast and flexible implementation of the xDS protocol
      Go
      31800Updated Sep 17, 2025Sep 17, 2025
    • forthic

      Public
      Python
      72700Updated Sep 16, 2025Sep 16, 2025
    • DuaLip

      Public
      DuaLip: Dual Decomposition based Linear Program Solver
      Scala
      106510Updated Sep 8, 2025Sep 8, 2025