-
I’ve been trying to set up a simple dedup pipeline using Splink (see code below) on databricks, but when I execute I went through some discussion threads and relevant issues identified by others, but still unable to get past this error message. Databricks cluster configs: DB runtime: 9.1 LTS; Spark: 3.1.2 Appreciate any help!
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hey @Mehul903 I think you are missing the unique_id_column_name parameter in your settings object. It defaults to
|
Beta Was this translation helpful? Give feedback.
Hey @Mehul903
I think you are missing the unique_id_column_name parameter in your settings object. It defaults to
unique_id
so I think it should work if you change your settings object to: