Skip to content

Idea: Support supplementary files in the DwC-A and FrictionlessData output #2109

@timrobertson100

Description

@timrobertson100

I'd like to share an idea to see if this would be of interest to the IPT community. Feedback sought.

I propose the IPT be enhanced to allow a user to upload additional supplementary files that relate to the dataset, and have them included in the DwC-A or FrictionlessData package output.

I expect the uses for this could be many and would make the IPT an even more useful data repository, but concretely we're moving towards the inclusion of Newick files for phylogenies (example) that would be well suited for this. Over the years I'm aware of a desire for the IPT to host some images for a dataset too, although that may require special attention due to size.

I imagine the output for DwC-A could include:

/ 
  - eml.xml
  - occurrence.txt
  - meta.xml
  - supplementary-data
      - images
        - image1.jpg
        - image2.jpg
      - sequences
        - newick1.nwk
      - other
        - procedures.pdf

When uploading supplementary data, the user would select a category from a drop-down (e.g. images) and the IPT could enforce limits on individual file size, total file count, and total size or so.

I'd expect the IPT to be able to serve the files on a URL such as https://ipt.example.com/resource/2021-dwc-updates/supplementary_data/images/image1.jpg or so.

I haven't given much thought to licensing but imagine we might need a license per file to e.g. allow for restrictive licenses on images. We might also take the opportunity to enforce permissive licenses that allow research (e.g. only allow CC0, CC-BY) for this, or even require that it falls under the same license as the dataset as a whole.

Thanks

Metadata

Metadata

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions