concurrent reads with dask and xarray

I've never been clear on whether or not xarray+dask computations using threads are hampered by file locks. It's complicated because GDAL-->rasterio-->xarray-->dask are all involved.

Rasterio recently updated their example of concurrent processing. This example specifically reads files from local disk rather than over a network. 
https://rasterio.readthedocs.io/en/latest/topics/concurrency.html

There is some good discussion here about multiple threads reading the same file concurrently here
https://github.com/pangeo-data/pangeo-example-notebooks/issues/21#issuecomment-432457955 

And finally the rasterio mailing list has some relevant discussion
https://rasterio.groups.io/g/main/topic/72528118#468

A simple example that shows timing improvement would go a long way in helping to clarify this for people. This might require a PR to xarray to deal with locks...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

concurrent reads with dask and xarray #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

concurrent reads with dask and xarray #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions