Implement a lighter format for wikipedia importance tables #3424
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Adds support for the new simpler CSV format for wikipedia importance values. This also comes with a much simplified table structure: redirects and articles are now in the same table and all unnecessary information has been dropped leaving only wikipedia article, wikidata ID and importance.
Support for the old-style wikipedia importance dumps remains in place for now. There will be official CSV dumps once we have removed the last obstacles in the generation process in https://github.com/osm-search/wikipedia-wikidata.