spark-notebook

Development

Must be developed on dsmlp-login as it's coupled to that system

Architecture

start-cluster.sh and stop-cluster.sh are meant to run as kubernetes lifecycle hooks to start and stop the cluster as the pod starts

Cluster Configuration

The cluster and master/worker nodes may be configured via the following environment variables.

SPARK_CHART_NAME : Helm chart used to instantiate Spark cluster : Default =

SPARK_CLUSTER_IMAGE_REGISTRY
: Default = ghcr.io
SPARK_CLUSTER_IMAGE_REPO ucsd-ets/spark-node
: Default = ucsd-ets/spark-node
SPARK_CLUSTER_IMAGE_TAG
: Default = fa22-3

SPARK_CLUSTER_MASTER_CPU
: Number of CPU cores assigned to Master node (sets kubernetes request&limit)
: Default = 2
SPARK_CLUSTER_MASTER_MEM
: Memory assigned to Master node (sets kubernetes request&limit)
: Default = 8G

SPARK_CLUSTER_WORKER_CPU
: Number of CPU cores assigned to Worker nodes (sets kubernetes request&limit)
: Default = 2
SPARK_CLUSTER_WORKER_MEM
: Memory assigned to Worker node (sets kubernetes request&limit)
: Default = 20G
SPARK_CLUSTER_WORKER_APP_MEM
: Spark application memory limit (should be ~2GB less than WORKER_MEM)
: Default = 18G
SPARK_CLUSTER_REPLICAS
: Number of worker nodes to start up
: Default = 3

SPARK_CLUSTER_RUNASGROUP 
: Primary Unix group ID assigned to cluster nodes
: Default value: 0
SPARK_CLUSTER_FSGROUP 
: Supplemental Unix group ID assigned to cluster nodes
: Default value: 0

Name		Name	Last commit message	Last commit date
Latest commit History 192 Commits
.github/workflows		.github/workflows
k8s-yamls		k8s-yamls
spark-image		spark-image
DSMLP Spark.docx		DSMLP Spark.docx
DSMLP Spark.pdf		DSMLP Spark.pdf
Dockerfile		Dockerfile
README.md		README.md
bash.bashrc		bash.bashrc
connect-to-jobs-ui.sh		connect-to-jobs-ui.sh
jupyter_config.py		jupyter_config.py
port-forward.sh		port-forward.sh
sanity_check.ipynb		sanity_check.ipynb
spark-cluster-configuration-profile.sh		spark-cluster-configuration-profile.sh
start-cluster.sh		start-cluster.sh
start-notebook.sh		start-notebook.sh
start-singleuser.sh		start-singleuser.sh
start.sh		start.sh
stop-cluster.sh		stop-cluster.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

spark-notebook

Development

Architecture

Cluster Configuration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

ucsd-ets/spark-notebook

Folders and files

Latest commit

History

Repository files navigation

spark-notebook

Development

Architecture

Cluster Configuration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages