Skip to content

GeoParquet: All batches must have same schema #1199

Open
@kylebarron

Description

@kylebarron
from obstore.auth.planetary_computer import PlanetaryComputerCredentialProvider
from obstore.store import AzureStore
from lonboard import viz
import shapely
from geoarrow.rust.io import GeoParquetDataset
from arro3.core import struct_field
from urllib.parse import urlparse

credential_provider = PlanetaryComputerCredentialProvider(
    account_name="pcstacitems",
    container_name="items",
)
store = AzureStore(credential_provider=credential_provider)

files = store.list("naip.parquet").collect()
dataset = GeoParquetDataset(files, store=store)
all_data = dataset.read(parse_to_native=False)

raises:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
Cell In[25], line 1
----> 1 all_data = dataset.read(parse_to_native=False)

TypeError: All batches must have same schema

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions