You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The `polars.GPUEngine` object may be configured in several different ways.
4
+
5
+
## Parquet Reader Options
6
+
Reading large parquet files can use a large amount of memory, especially when the files are compressed. This may lead to out of memory errors for some workflows. To mitigate this, the "chunked" parquet reader may be selected. When enabled, parquet files are read in chunks, limiting the peak memory usage at the cost of a small drop in performance.
7
+
8
+
9
+
To configure the parquet reader, we provide a dictionary of options to the `parquet_options` keyword of the `GPUEngine` object. Valid keys and values are:
10
+
-`chunked` indicates that chunked parquet reading is to be used. By default, chunked reading is turned on.
11
+
-[`chunk_read_limit`](https://docs.rapids.ai/api/libcudf/legacy/classcudf_1_1io_1_1chunked__parquet__reader#aad118178b7536b7966e3325ae1143a1a) controls the maximum size per chunk. By default, the maximum chunk size is unlimited.
12
+
-[`pass_read_limit`](https://docs.rapids.ai/api/libcudf/legacy/classcudf_1_1io_1_1chunked__parquet__reader#aad118178b7536b7966e3325ae1143a1a) controls the maximum memory used for decompression. The default pass read limit is 16GiB.
13
+
14
+
For example, to select the chunked reader with custom values for `pass_read_limit` and `chunk_read_limit`:
15
+
```python
16
+
engine = GPUEngine(
17
+
parquet_options={
18
+
'chunked': True,
19
+
'chunk_read_limit': int(1e9),
20
+
'pass_read_limit': int(4e9)
21
+
}
22
+
)
23
+
result = query.collect(engine=engine)
24
+
```
25
+
Note that passing `chunked: False` disables chunked reading entirely, and thus `chunk_read_limit` and `pass_read_limit` will have no effect.
Try out the GPU engine for Polars in a free GPU notebook environment. Sign in with your Google account and `launch the demo on Colab <https://colab.research.google.com/github/rapidsai-community/showcase/blob/main/accelerated_data_processing_examples/polars_gpu_engine_demo.ipynb>`__.
0 commit comments