-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deduplication of researchers #141
Comments
Hi, |
Thanks Jan. We couldn't get hold of Fabian last week while he was here. We now have to write to him to ask if he has got any such utility/code which can be used for deduplication. In parallel, let's initiate the discussion with the guys from Paderborn university. The more, the merrier. Then, whichever utility we get first or is more optimal, we can embed in our solution. |
Ok. I found out, that the guy (Adrian) I wanted to speak with, left the DICE-group this year. I will try to find another expert there. |
Thanks again. We discussed this with Junbo almost a week back. We explained our scenario and asked him how to apply deduplication/entity resolution in our solution. He suggested starting with rule-based record duplication filtering/merging to obtain some baseline (so to speak), and later we can incorporate any entity resolution plugin, if found, and compare its output with the baseline results. |
No description provided.
The text was updated successfully, but these errors were encountered: