Welcome to the Apache Sedona Tutorial 🗺️ – a hands-on guide to large-scale spatial data processing using Apache Sedona with Python and Jupyter.
This repo contains a Jupyter Notebook that walks through how to use Sedona for scalable spatial operations.
This tutorial uses UV for package management and sdkman to install Java.
curl -s "https://get.sdkman.io" | bash
source "$HOME/.sdkman/bin/sdkman-init.sh"
sdk install java 17.0.13-zulu
# Confirm install
java -version
echo $JAVA_HOMEpip3 install uv
# Alternatively, via shell script
curl -LsSf https://astral.sh/uv/install.sh | shuv pip install -r pyproject.tomluv run ipython kernel install --user --name=SedonaDemouv run --with jupyter jupyter labThis notebook depends on sample geospatial data. You can download a zip file of all the data and place it in the main directory after download.
Please download and place the data in a local folder before running the notebook.
- 🔹 Coordinate reference system transformations
- 🔹 Spatial joins
- 🔹 Raster + vector handling (with extensions)
- 🔹 Integration with SparkSQL
- Python 3.9+
- Java 17
- Jupyter Lab
- Apache Sedona (via PySpark)
This project is licensed under the MIT License.
Made with ❤️ for spatial data engineers.