-
Notifications
You must be signed in to change notification settings - Fork 1
RDF Serialization - issue with the dct:format property value #3
Comments
the issue seems more complex than expected. The issue involves the format (CKAN metadata) and distribution_format (we introduced for DCAT-AP_IT profile) fields. We have three possible cases:
The code involved should be the following: in ckanext-dcatapit/ckanext/dcatapit/dcat/profiles.py Line 326 distribution_format is never materialized. We need to materialize distribution_format. Alternatively, another solution can be to dynamically derive the distribution_format from CKAN's format every time the serialization and the visualization are to be executed. |
in the RDF serialization of a dataset, the dct:format property may assume the value OP_DATPRO even if the source catalog correctly indicates the format using the EU controlled vocabulary, as requested by the DCAT-AP_IT specs.
This does not happen if the format of the distribution is CSV for instance. It seems happing during the harvesting phase and for specific formats (e.g., all those related to RDF serializations such as RDF_XML, RDF_TURTLE, RDF_N_TRIPLES, etc.)
Example:
Source Catalogue: Linked Data Platform with metadata compliant with DCAT-AP_IT
http://dati.beniculturali.it/resource/Distribuzione/complessoArchivistico-GGASI-nt a dcatapit:Distribution,
dcat:Distribution ;
dct:description "Distribuzione in formato N triples del dataset complessoArchivistico-GGASI " ;
dct:format http://publications.europa.eu/resource/authority/file-type/OP_DATPRO ;
dct:license https://w3id.org/italia/controlled-vocabulary/licences/C1_Unknown,
"https://creativecommons.org/licenses/by-nc/2.5/it/legalcode/" ;
dct:title "Distribuzione in formato N triples del dataset complessoArchivistico-GGASI" ;
dcat:downloadURL http://dati.san.beniculturali.it/dataset/nt/complessoArchivistico-GGASI.nt
In this case the format is OP_DATPRO while in the source catalogue is correctly materialized with the following URI: http://publications.europa.eu/resource/authority/file-type/RDF_N_TRIPLES
It may be a problem of a limited set of format_mapping values https://github.com/geosolutions-it/ckanext-dcatapit/blob/master/ckanext/dcatapit/dcat/profiles.py#L76 ?
In any case, the expected behaviour is that if the source correctly includes the format using the requested controlled vocabulary, no format mapping should be applied. We should simply use what is included in the source catalogue.
The text was updated successfully, but these errors were encountered: