Skip to content

Expressing blocking rule where token in left is present in right or vice versa #1032

Answered by adelinor
adelinor asked this question in Q&A
Discussion options

You must be logged in to vote

Thanks to @NickCrews 's answer on #1053 and also the codes samples from the Splink in Databricks - Expert Answer.py, I am able to answer myself:

  1. Split values to an array column, say names_array
  2. For the DuckDB, the blocking rule would then be (list_unique(l.names_array) + list_unique(r.names_array) - list_unique(list_concat(l.names_array, r.names_array)) > 0)

Replies: 2 comments 6 replies

Comment options

You must be logged in to vote
6 replies
@adelinor
Comment options

@checkbook-org
Comment options

@checkbook-org
Comment options

@NickCrews
Comment options

@adelinor
Comment options

Answer selected by adelinor
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants