v1.1.0
GeoParquet 1.1.0 is our second stable release. There are two major additions to the specification:
- #191 introduces the bbox covering encoding which defines a way to add an extra column that represents the bounding box of each geometry as a 'struct' in Parquet. This can accelerate spatial queries by allowing consumers to inspect row group and page index bounding box summary statistics.
- #189 adds an option for Native encodings (based on GeoArrow), which introduces an alternative to Well Known Binary that enables readers to leverage more features of the Parquet format to accelerate geospatial queries (e.g., row group-level min/max statistics).
Both work to better leverage Parquet's native capabilities to enable GeoParquet readers to query spatial data more efficiently. The first is designed to work with the way geometries are encoded in GeoParquet 1.0, and should be easy for any reader or writer to add, while the second is a new alternative encoding that is more efficient and more compatible with the Parquet way of doing things, but will take more work to implement. They could theoretically be used together, but practically only one is necessary in a GeoParquet file. Both features are completely optional, and follow our version compatibility guidelines to be fully backwards compatible changes.
There were also a number of minor clarifications and improvements since 1.0, including recommendations about media type and .parquet as the file extension, along with new test data. The full list of PR's, new contributors and changes follows below.
What's Changed
- Spec: add a note about mapping of Parquet BYTE_ARRAY type to Arrow types by @rouault in #190
- Add missing link, make sure people find stable by @m-mohr in #194
- fix typo on geoparquet.md by @Jap8nted in #195
- Fixes the link from
compatible-parquet.md
to the schema. by @xaviernogueira in #196 - Introduce bounding box column definition by @jwass in #191
- Add GeoArrow encoding as an option to the specification by @paleolimbot in #189
- Require two elements in covering elements by @m-mohr in #210
- Update README.md by @ghobona in #214
- Added file extension recommendation by @cholmes in #212
- Added Media Type by @cholmes in #213
- Added disclosure that 1.0.0 is not yet OGC approved by @jatorre in #205
- Add test data covering different native (geoarrow-based) encodings by @jorisvandenbossche in #204
- Improves Covering Schema #208 by @m-mohr in #209
- Bounding box column updates by @kylebarron in #202
- Clarify null/missing crs key by @kylebarron in #225
- clarified that the bbox must be in the same crs as the geometry by @cholmes in #226
- Updated projjson schema link to 0.7 (and new link) by @cholmes in #227
- Add version compatibility documentation by @TheNeuralBit in #229
New Contributors
- @Jap8nted made their first contribution in #195
- @xaviernogueira made their first contribution in #196
- @jwass made their first contribution in #191
- @paleolimbot made their first contribution in #189
- @ghobona made their first contribution in #206
- @jatorre made their first contribution in #205
- @TheNeuralBit made their first contribution in #229
Full Changelog: v1.0.0...v1.1.0