Controlled vocabularies such as taxonomies allow an accurate and controlled approach in describing datasets. One of such controlled vocabulary is Wind Energy Taxonomy of Activities. This taxonomy is the result of EERA JP WIND IRPWind Open Data initiative that took place in 2017 in which wind energy experts generated the first version of the taxonomy as an input for defining and structuring wind energy metadata.
The report of this work is available at Zenodo: https://www.zenodo.org/record/1199489#.XSD6haeQ3RY
In 2018, the taxonomy of topics was improved during the internal project of DTU Wind Energy titled 'FAIR Digitalization': https://www.zenodo.org/record/1493874#.XSD7TaeQ3RY
In 2021, the taxonomy was converted into FAIR machine-actionable controlled vocabulary using sheet2rdf. The controlled vocabulary is served to humans and machines using an instace of OntoStack hosted by DTU Wind Energy. sheet2rdf
and OntoStack
are developed by Nikola Vasiljevic.
Additionally, in 2021, the persistent URL purl.org/weave has been registered for the taxonomy. This allows us to:
- If there is a needed to move the taxonomy to another domain (currently under data.windenergy.dtu.dk) URLs will be still resolvable and unchanged
- Use short URls as PIDs for the taxonomy terms and properties such as:
This repository hosts automatic workflow, executed by means of Github actions, and underlying shell and python scripts which:
- Fetches Google Sheet, containing the taxonomy terms and their defitions, from Google Drive and stores is at
xlsx
andcsv
files - Converts fetched sheet to machine-actionable and FAIR RDF vocabulary using xls2rdf
- Tests the resulting RDF vocabulary using qSKOS
- Commits conversion results and tests logs to this repository
- and deploy RDF vocabulary to OntoStack to be served to humans and machines
OntoStack is a set of orchestrated micro-services configured and interfaced such that they can intake vocabularies and resolve their terms and RDF properties upon requests either by humans or machines.
Some of OntoStack micro-services are:
- Jena Fuseki a graph database
- SKOSMOS a web-based SKOS browser acting as a front-end for the vocabularies persisted by the graph database
- Træfik an edge router responsible for proper serving of URL requests
NEAT is served by DTU Wind Energy instance of OntoStack
:
https://data.windenergy.dtu.dk/ontologies/view
The taxonomy is implemented in following services:
- DTU Data
- CEDAR through integration with BioPortal
- NEWA Microscale Atlas data access point
- NEWA Mesoscale Atlas data access point
NEWA: New European Wind Atlas
If you are interested to visualize taxonomy follow steps described in this YouTube video.
The taxonomy is intended to be used and further developed by the community. Therefore, we welcome collaborators willing to take part in the further development of the taxonomy. If you are one of them either request to become one of the taxonomy admins and/or post GitHub issues on what we can improve in the current taxonomy.