Skip to content
Change the repository type filter

All

    Repositories list

    • Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
      Dockerfile
      Apache License 2.0
      2311110Updated Nov 15, 2024Nov 15, 2024
    • Contrib repository for the OpenTelemetry Collector
      Go
      Apache License 2.0
      2.4k000Updated Oct 19, 2024Oct 19, 2024
    • Mirror of CERN db/hadoop-xrootd. Hadoop-XRootD Filesystem Connector
      Java
      Apache License 2.0
      3631Updated Sep 25, 2024Sep 25, 2024
    • Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
      Jupyter Notebook
      Apache License 2.0
      132900Updated Jun 11, 2024Jun 11, 2024
    • argo-helm

      Public
      ArgoProj Helm Charts
      Mustache
      Apache License 2.0
      1.9k000Updated May 28, 2024May 28, 2024
    • Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
      Jupyter Notebook
      Creative Commons Attribution 4.0 International
      51200Updated May 23, 2024May 23, 2024
    • This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery
      Jupyter Notebook
      Apache License 2.0
      0100Updated May 16, 2024May 16, 2024
    • Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
      Scala
      Apache License 2.0
      158531Updated Apr 2, 2024Apr 2, 2024
    • This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.
      Scala
      Apache License 2.0
      31400Updated Mar 11, 2024Mar 11, 2024
    • Aiven's JDBC Sink and Source Connectors for Apache Kafka®
      Java
      Apache License 2.0
      57000Updated Nov 8, 2023Nov 8, 2023
    • zkpolicy

      Public
      Zookeeper Policy Audit Tool (aka zkPolicy) for checking and enforcing ACLs on ZNodes.
      Java
      MIT License
      1710Updated Oct 25, 2023Oct 25, 2023
    • Grafana Mimir dashboards used for cardinality exploration
      Apache License 2.0
      52620Updated Oct 10, 2023Oct 10, 2023
    • dbod-api

      Public
      DB On Demand API
      Python
      GNU General Public License v3.0
      3492Updated Aug 14, 2023Aug 14, 2023
    • TF-Spawner is an experimental tool for running TensorFlow distributed training on Kubernetes clusters.
      Python
      Apache License 2.0
      2800Updated Mar 22, 2023Mar 22, 2023
    • Unified RESTful interface for managing CERNs data storage back-ends
      Python
      GNU General Public License v3.0
      2712Updated Jan 31, 2022Jan 31, 2022
    • Python Re-implementation of the cern-get-sso-cookie functionality
      Python
      61110Updated Jan 11, 2022Jan 11, 2022
    • Analyzes network traffic of HBase RegionServers
      Clojure
      Apache License 2.0
      5100Updated Nov 5, 2021Nov 5, 2021
    • A re-implementation of (parts of) NetApp's ZAPI in idiomatic Python using Requests
      Python
      GNU General Public License v3.0
      1300Updated Sep 13, 2021Sep 13, 2021
    • binderhub

      Public
      Run your code in the cloud, with technology so advanced, it feels like magic!
      Python
      BSD 3-Clause "New" or "Revised" License
      390000Updated Aug 19, 2021Aug 19, 2021
    • Java
      GNU General Public License v3.0
      0350Updated Mar 12, 2021Mar 12, 2021
    • Set of valves classes that helps CERN applications with the integration in the CERN Authentication
      Java
      GNU General Public License v3.0
      0100Updated Oct 22, 2020Oct 22, 2020
    • This image generates configuration and war files for Oracle Rest DataServices based on data provided by dadEdit3 database.
      Python
      GNU General Public License v3.0
      0000Updated Sep 4, 2020Sep 4, 2020
    • dbod-web

      Public
      Future DB On Demand Web Interface implementation
      TypeScript
      MIT License
      35152Updated Aug 28, 2020Aug 28, 2020
    • dbod-core

      Public
      DB On Demand management infrastructure core library
      Perl
      GNU General Public License v3.0
      15200Updated Apr 1, 2020Apr 1, 2020
    • Java
      GNU General Public License v3.0
      0001Updated Mar 5, 2020Mar 5, 2020
    • TypeScript
      GNU General Public License v3.0
      1000Updated Feb 20, 2020Feb 20, 2020
    • HDFS Connector for Oracle Cloud Infrastructure
      Java
      Other
      26000Updated Jan 20, 2020Jan 20, 2020
    • Spark Executor Plugins Examples for Spark 2.4
      Java
      Apache License 2.0
      1610Updated Sep 9, 2019Sep 9, 2019
    • Rundeck plugin running jobs on Nomad cluster.
      Java
      MIT License
      6000Updated Aug 9, 2019Aug 9, 2019
    • Python
      Apache License 2.0
      0201Updated Aug 5, 2019Aug 5, 2019