Skip to content

Data access from jetstream2 object store is failing #13

@brian-rose

Description

@brian-rose

Builds are currently failing with a data access problem.

The notebook has a cell that accesses a series of netcdf files stored in the Pythia object store. Paths to the data are created as follows:

URL = 'https://js2.jetstream-cloud.org:8001/' #Locate and read a file
path = f'pythia/cesmLME' # specify data location
fs = fsspec.filesystem("s3", anon=True, client_kwargs=dict(endpoint_url=URL)) 
pattern = f's3://{path}/*.nc'
files = sorted(fs.glob(pattern))
base_name = 'pythia/cesmLME/b.ie12.B1850C5CN.f19_g16.LME.002.cam.h0.'
time_period =  '085001-184912'
names = [name for name in files if base_name in name and time_period in name]
fileset = [fs.open(file) for file in names]

and the files are then opened with

for idx,item in enumerate(fileset):
    ds_u = xr.open_dataset(item)

This was working fine until recently. Now it's throwing an xarray error

ValueError: did not find a match in any of xarray's currently installed IO backends ['h5netcdf', 'scipy']. Consider explicitly selecting one of the installed engines via the ``engine`` parameter, or installing additional IO dependencies, see:
https://docs.xarray.dev/en/stable/getting-started-guide/installing.html
https://docs.xarray.dev/en/stable/user-guide/io.html

Directly reading an individual data file with

xr.open_dataset('https://js2.jetstream-cloud.org:8001/pythia/cesmLME/b.ie12.B1850C5CN.f19_g16.LME.002.cam.h0.PRECRC_H216Or.085001-184912.nc')

seems to work fine.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions