Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deduplication of researchers #141

Open
Meisenburger13 opened this issue Aug 10, 2023 · 4 comments
Open

Deduplication of researchers #141

Meisenburger13 opened this issue Aug 10, 2023 · 4 comments
Assignees

Comments

@Meisenburger13
Copy link
Collaborator

No description provided.

@janreineke
Copy link
Member

Hi,
Is it still needed to get in contact with guys from the Uni Paderborn for the Deduplication? Or are you in contact with Fabian Pause?

@abdullah-rana
Copy link
Collaborator

Thanks Jan. We couldn't get hold of Fabian last week while he was here. We now have to write to him to ask if he has got any such utility/code which can be used for deduplication. In parallel, let's initiate the discussion with the guys from Paderborn university. The more, the merrier. Then, whichever utility we get first or is more optimal, we can embed in our solution.

@janreineke
Copy link
Member

Ok. I found out, that the guy (Adrian) I wanted to speak with, left the DICE-group this year. I will try to find another expert there.
By the way. In Coypu we have a task "Event deduplication" where Junbo is in charge of. Let's talk to him, too.

@abdullah-rana
Copy link
Collaborator

Thanks again. We discussed this with Junbo almost a week back. We explained our scenario and asked him how to apply deduplication/entity resolution in our solution. He suggested starting with rule-based record duplication filtering/merging to obtain some baseline (so to speak), and later we can incorporate any entity resolution plugin, if found, and compare its output with the baseline results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants