Skip to content
This repository has been archived by the owner on Jul 25, 2024. It is now read-only.
/ labelspark Public archive

This library makes it easy to take unstructured data in your Data Lake and prepare it for analysis and AI work in Databricks. The Labelbox Connector for Apache Spark takes in a Spark DataFrame to create a dataset in Labelbox, and it also brings labeled, structured data back into Databricks also as a Spark DataFrame.

License

Notifications You must be signed in to change notification settings

Labelbox/labelspark

Repository files navigation

Data connector libraries

Starting in July 2024, we will begin making all data connector libraries private, including labelspark, labelpandas, labelsnow, and labelbox-bigquery libraries. To import data from remote sources such as Databricks and Snowflake, set up Census integrations directly on the Labelbox platform.

About

This library makes it easy to take unstructured data in your Data Lake and prepare it for analysis and AI work in Databricks. The Labelbox Connector for Apache Spark takes in a Spark DataFrame to create a dataset in Labelbox, and it also brings labeled, structured data back into Databricks also as a Spark DataFrame.

Resources

License

Security policy

Stars

Watchers

Forks

Packages

No packages published