Widen type promotion for decimals with larger scale in Parquet Read [databricks] #11727

nartal1 · 2024-11-16T00:38:34Z

This PR contributes to #11433 and contributes to #11512

This PR supports additional type promotion to decimals with larger precision and scale.
As long as the precision increases by at least as much as the scale, the decimal values can be promoted without loss of precision.
A similar change is added in Apache Spark-4.0 version - apache/spark#44513
Currently, the code throws an Exception if the scale of read schema is not as same as the schema that was written for all versions previous to Spark-4.0 on CPU. This fix is available for all versions in spark-rapids.

We have removed separate checks for the decimal if they can be read as int, long and byte_array and consolidated into one function canReadAsDecimal. Added integration test to verify that the conditions of the type promotions are met.

Signed-off-by: Niranjan Artal <[email protected]>

nartal1 · 2024-11-16T01:38:29Z

build

Widen type promotion for decimals with larger scale in Parquet Read

656b2fa

Signed-off-by: Niranjan Artal <[email protected]>

nartal1 requested review from revans2 and mythrocks November 16, 2024 00:38

nartal1 self-assigned this Nov 16, 2024

nartal1 changed the title ~~Widen type promotion for decimals with larger scale in Parquet Read~~ Widen type promotion for decimals with larger scale in Parquet Read [databricks] Nov 16, 2024

fix error

1f7c5c3

nartal1 marked this pull request as draft November 16, 2024 00:59

sameerz added the bug Something isn't working label Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Widen type promotion for decimals with larger scale in Parquet Read [databricks] #11727

Widen type promotion for decimals with larger scale in Parquet Read [databricks] #11727

nartal1 commented Nov 16, 2024

nartal1 commented Nov 16, 2024

Widen type promotion for decimals with larger scale in Parquet Read [databricks] #11727

Are you sure you want to change the base?

Widen type promotion for decimals with larger scale in Parquet Read [databricks] #11727

Conversation

nartal1 commented Nov 16, 2024

nartal1 commented Nov 16, 2024