Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Find out if the seed data packages are publicly available, and annotate them respectively 🔗 #5

Open
1 task
sdruskat opened this issue Mar 31, 2021 · 0 comments
Labels
required Something that needs to be done to make the hack successful

Comments

@sdruskat
Copy link
Collaborator

What do we have?

A seed dataset of n software package mentions.

The issue

Just the mentions aren't useful for most of our research questions.

What do we really need?

How can we achieve this?

  1. Crowdourcing! We each of us take a list of mentions and try to find the public repository on, e.g., GitHub, GitLab, Bitbucket, elsewhere.
  2. We annotate the dataset with this information.
@sdruskat sdruskat added the required Something that needs to be done to make the hack successful label Mar 31, 2021
@sdruskat sdruskat added this to the Habeas useful corpus milestone Mar 31, 2021
@sdruskat sdruskat changed the title Find out if the seed data packages are publicly available, and link them 🔗 Find out if the seed data packages are publicly available, and annotate them respectively 🔗 Mar 31, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
required Something that needs to be done to make the hack successful
Projects
None yet
Development

No branches or pull requests

1 participant