Skip to content
Change the repository type filter

All

    Repositories list

    • This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.
      Scala
      31600Updated Oct 3, 2025Oct 3, 2025
    • Python DataSource for Apache Spark 4 to read ROOT files (High Energy Physics) as DataFrames, powered by uproot, awkward, and PyArrow.
      Python
      0100Updated Oct 2, 2025Oct 2, 2025
    • Grafana Mimir dashboards used for cardinality exploration
      95710Updated Sep 17, 2025Sep 17, 2025
    • Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
      Dockerfile
      2012900Updated Aug 29, 2025Aug 29, 2025
    • Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
      Jupyter Notebook
      61700Updated May 13, 2025May 13, 2025
    • Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
      Scala
      149330Updated May 9, 2025May 9, 2025
    • Spark Executor Plugins Examples for Spark 2.4
      Java
      2600Updated May 7, 2025May 7, 2025
    • Contrib repository for the OpenTelemetry Collector
      Go
      3.1k000Updated Apr 12, 2025Apr 12, 2025
    • Mirror of CERN db/hadoop-xrootd. Hadoop-XRootD Filesystem Connector
      Java
      3631Updated Sep 25, 2024Sep 25, 2024
    • Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
      Jupyter Notebook
      143100Updated Jun 11, 2024Jun 11, 2024
    • argo-helm

      Public
      ArgoProj Helm Charts
      Mustache
      2k000Updated May 28, 2024May 28, 2024
    • This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery
      Jupyter Notebook
      1100Updated May 16, 2024May 16, 2024
    • Aiven's JDBC Sink and Source Connectors for Apache Kafka®
      Java
      58000Updated Nov 8, 2023Nov 8, 2023
    • zkpolicy

      Public
      Zookeeper Policy Audit Tool (aka zkPolicy) for checking and enforcing ACLs on ZNodes.
      Java
      1710Updated Oct 25, 2023Oct 25, 2023
    • dbod-api

      Public
      DB On Demand API
      Python
      3492Updated Aug 14, 2023Aug 14, 2023
    • TF-Spawner is an experimental tool for running TensorFlow distributed training on Kubernetes clusters.
      Python
      2800Updated Mar 22, 2023Mar 22, 2023
    • Unified RESTful interface for managing CERNs data storage back-ends
      Python
      2712Updated Jan 31, 2022Jan 31, 2022
    • Python Re-implementation of the cern-get-sso-cookie functionality
      Python
      61110Updated Jan 11, 2022Jan 11, 2022
    • Analyzes network traffic of HBase RegionServers
      Clojure
      5100Updated Nov 5, 2021Nov 5, 2021
    • A re-implementation of (parts of) NetApp's ZAPI in idiomatic Python using Requests
      Python
      1300Updated Sep 13, 2021Sep 13, 2021
    • binderhub

      Public
      Run your code in the cloud, with technology so advanced, it feels like magic!
      Python
      398000Updated Aug 19, 2021Aug 19, 2021
    • Java
      0350Updated Mar 12, 2021Mar 12, 2021
    • Set of valves classes that helps CERN applications with the integration in the CERN Authentication
      Java
      0200Updated Oct 22, 2020Oct 22, 2020
    • This image generates configuration and war files for Oracle Rest DataServices based on data provided by dadEdit3 database.
      Python
      0000Updated Sep 4, 2020Sep 4, 2020
    • dbod-web

      Public
      Future DB On Demand Web Interface implementation
      TypeScript
      35152Updated Aug 28, 2020Aug 28, 2020
    • dbod-core

      Public
      DB On Demand management infrastructure core library
      Perl
      05200Updated Apr 1, 2020Apr 1, 2020
    • Java
      0001Updated Mar 5, 2020Mar 5, 2020
    • TypeScript
      1000Updated Feb 20, 2020Feb 20, 2020
    • HDFS Connector for Oracle Cloud Infrastructure
      Java
      27000Updated Jan 20, 2020Jan 20, 2020
    • Rundeck plugin running jobs on Nomad cluster.
      Java
      6000Updated Aug 9, 2019Aug 9, 2019