-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Parser Fix]: distribution.contentUrl for Zenodo #129
Comments
@jal347 can you double-check the url you used in the correction? The data on staging has the following url format The download urls are actually to their api: |
@jal347 I found a few issues when looking at the data in Staging: The url to access the record does not work for many Zenodo records:
The content.url is broken for many records in spite of the base url being correct. The reason for this is linked to the above issue and has to do with whether or not a zenodo id is a canonical id or a versioned record id. Cause of issue:
Potential solution:
Other observations:
|
Issue Name
distribution.contentUrl for Zenodo
Issue Description
The Zenodo parser currently does not appear to be parsing values for the
distribution
field. Based on a quick review of 10 Zenodo records on their site, Zenodo uses the following url format to enable access of the files available for download:https://zenodo.org/api/records/{canonical id}/files-archive
While this link is for the
download all
button on the Zenodo site instead of the link for each individual file download, it can still be parsed to the 'distribution.contentUrl' field.Issue Example
Example Zenodo record on prod: https://data.niaid.nih.gov/resources?id=ZENODO_6983398
Same record in Zenodo: https://zenodo.org/records/6983398
file download url from record in Zenodo: https://zenodo.org/api/records/6983398/files-archive
Related WBS task
For internal use only. Assignee, please select the status of this issue
Status Description
No response
The text was updated successfully, but these errors were encountered: