You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Unsupported types in DataFrameReader.dbapi(PuPr) are ingested as StringType now.
Improved error message to list available columns when dataframe cannot resolve given column name.
Added a new option cacheResult to DataFrameReader.xml that allows users to cache the result of the XML reader to a temporary table after calling xml. It helps improve performance when subsequent operations are performed on the same DataFrame.
Snowpark pandas API Updates
New Features
Improvements
Downgraded to level logging.DEBUG - 1 the log message saying that the
Snowpark DataFrame reference of an internal DataFrameReference object
has changed.
Eliminate duplicate parameter check queries for casing status when retrieving the session.
Retrieve dataframe row counts through object metadata to avoid a COUNT(*) query (performance)
Added support for applying Snowflake Cortex function Complete.
Introduce faster pandas: Improved performance by deferring row position computation.
The following operations are currently supported and can benefit from the optimization: read_snowflake, repr, loc, reset_index, merge, and binary operations.
If a lazy object (e.g., DataFrame or Series) depends on a mix of supported and unsupported operations, the optimization will not be used.
Updated the error message for when Snowpark pandas is referenced within apply.
Added a session parameter dummy_row_pos_optimization_enabled to enable/disable dummy row position optimization in faster pandas.
Dependency Updates
Updated the supported modin versions to >=0.35.0 and <0.37.0 (was previously >= 0.34.0 and <0.36.0).
Bug Fixes
Fixed an issue with drop_duplicates where the same data source could be read multiple times in the same query but in a different order each time, resulting in missing rows in the final result. The fix ensures that the data source is read only once.
Fixed a bug with hybrid execution mode where an AssertionError was unexpectedly raised by certain indexing operations.
Snowpark Local Testing Updates
New Features
Added support to allow patching functions.ai_complete.