Distributed XGBoost pipeline

These tutorials implement an end-to-end XGBoost application including:

Distributed data preprocessing and model training: Ingest and preprocess data at scale using Ray Data. Then, train a distributed XGBoost model using Ray Train. See Distributed training of an XGBoost model.
Model validation using offline inference: Evaluate the model using Ray Data offline batch inference. See Model validation using offline batch inference.
Online model serving: Deploy the model as a scalable online service using Ray Serve. See Scalable online XGBoost inference with Ray Serve.
Production deployment: Create production batch Jobs for offline workloads including data prep, training, batch prediction, and potentially online Services.

:hidden:

notebooks/01-Distributed_Training
notebooks/02-Validation
notebooks/03-Serving

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
ci		ci
config		config
dist_xgboost		dist_xgboost
images		images
notebooks		notebooks
.anyscaleignore		.anyscaleignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.ipynb		README.ipynb
README.md		README.md
containerfile		containerfile
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt

Provide feedback