-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DD3 R7. Where possible, add Schema.org “corresponding element” entries to GEMINI elements #41
Comments
The W3C mapping, on which this is largely based, is at https://www.w3.org/2015/spatial/wiki/ISO_19115_-_DCAT_-_Schema.org_mapping |
Andrea Perego’s ISO 19139 - DCAT mapping in GitHub (James’ link) provides more detail e.g. the range of each element, and also maps somethings outside the DCAT namespace(s). https://github.com/GeoCat/iso-19139-to-dcat-ap/blob/master/documentation/Mappings.md (Thanks to James Reid) |
Just been contacted by the CDDO data standards team looking to state how to describe "where" in DCAT metadata to be used in the UK government data marketplace. This will include updating the mapping above for DCAT v3. |
Should we also adapt the GEMINI mapping to DCAT 3 as this now includes better description of dataset series? |
That will be a necessary part of the CDDO work; I'll make sure it is available as an update to this GEMINI change request. It's also being discussed (& likely to happen) in the OGC GeoDCAT SWG. |
Need to annotate this to show how it aligns (or not!) with the UK Cross-Government Metadata Exchange Model which may be re-branded as a UK Application Profile of DCAT |
@PeterParslow to update table, then @archaeogeek to update elements with equivalent mappings, also publish this table as guidance |
We'll also need to include guidance or at least comment on converting GEMINI to DCAT covering how many dcat distributions to create (depending on e.g. GEMINI Use constraints & Resource locators). Revised table, with extra columns for DCAT v3 & UK government metadata exchange model. Note, the UK Gov work is supposed to consider adding spatial & some other things; they also plan to convert it to a full AP of DCAT v3.
|
@PeterParslow what do I need to do next? I can't remember... |
See if what I've come up with in a desk exercise matches what you'd expect from the GeoNetwork implementation of DCAT? |
@archaeogeek do you have a link to where this transformation is mapped in GeoNetwork 4. It is available (in theory) though the OGC API - Records interface, though links aren't working for us |
@nmtoken it's not the mapping. We have it working here: https://spatialdata.gov.scot/geonetwork/api/collections/main/items/fa510351-8e30-4147-b984-862be84a6f90. You need to check the log files- I suspect you're missing the relevant xsl files in https://github.com/geonetwork/geonetwork-microservices/tree/main/modules/services/ogc-api-records/src/main/resources/xslt/ogcapir/formats/copy (which is completely undocumented). Basically you need a gemini one that matches the iso19139 one |
Not the headers then (geonetwork/geonetwork-microservices#114) ? |
@nmtoken the above is all I had to do to get it working, YMMV. |
@archaeogeek Just checking we are not talking at cross purposes, you seem to be saying that in your Tree Preservation Orders - Argyll and Bute example the fact that the For us (for example https://metadata.bgs.ac.uk/geonetwork/api/collections/main/items/a2b1143b-5c5d-23d6-e054-002128a47908) and the EEA geospatial data catalogue (for example https://sdi.eea.europa.eu/catalogue/api/collections/main/items/71c47f78-27b6-4080-acd5-47b306b273d8) these tabs don't give any content (only errors). |
@nmtoken to be precise, what I'm saying is that the only change we ever needed to make to get a full set of working links in the ogc-api service is to add the gemini xsl record, eg adding a iso19139.gemini23 equivalent of the files here: https://github.com/geonetwork/geonetwork-microservices/tree/main/modules/services/ogc-api-records/src/main/resources/xslt/ogcapir/formats/copy, which is identical to the ones already included. This does trigger an error in the ogc-api service logs if you dig deep enough. |
I no longer know what I meant by dcat:keyword.DefinedTerm! It's not something that seems to exist in DCAT (v2 or v3). Conveniently, that makes it clearer (to me) that dcat:keyword if for uncontrolled ones and dcat:theme is for controlled lists (on the assumption they are published as SKOS) |
Finally coming back to this- I have dug into the iso19139.gemini23 code and extracted the mapping of Gemini to schema.org (from https://github.com/AstunTechnology/iso19139.gemini23/blob/4.2.x/src/main/plugin/iso19139.gemini23/formatter/jsonld/iso19139.gemini23-to-jsonld.xsl). I've created this as a google sheet for the moment, which anyone can access and comment on. https://docs.google.com/spreadsheets/d/1uHiH-huQ9VuNeAJ7vRNUlpYtMGZrhGZkVLVS4GaGYs0/edit?usp=sharing. I think this does match the table above (#41 (comment)) mostly! I see two actions:
|
Noting also that there's #42 which refers to a similar thing- just making sure we don't forget it! |
Thoughts on https://docs.google.com/spreadsheets/d/1uHiH-huQ9VuNeAJ7vRNUlpYtMGZrhGZkVLVS4GaGYs0/edit?usp=sharing:
Could someone check my observations? Then we need to discuss what to do about them? e.g. should we (GEMINI WG) concentrate on mapping to DCAT or to Schema.org? Which differences do we see as issues to be fixed in GeoNetwork & where is the GeoNetwork mapping better than the one above? |
Many of my comments above do not apply to the (more recent) XSLTs now in GeoNetwork core: https://github.com/SPW-DIG/schemas/iso19115-3.2018/src/main/plugin/iso19115-3.2018/formatter/dcat
I haven't yet found where/whether Topic category is mapped. I'll see if the authors of those XSLTs have a 'conceptual mapping' spreadsheet/spec that they were working from. |
|
Odd; I'm sure I was reviewing it yesterday! Anyway, meanwhile, here is a spreadsheet mapping that was used to create the XSLTs & might be more useful for us to review: https://docs.google.com/spreadsheets/d/1pJkKgGa655Dv06_UFwzeYoje9hEwzifE/edit?usp=sharing&ouid=115738132131001964515&rtpof=true&sd=true (created by GeoCat staff working on GeoNetwork; shared just now in the OGC Metadata Code Sprint). |
Useful table; are they also looking at any additional mappings related to dcat3 or dcat-AP 3 and other profiles? |
I guess that depends who you meant by "they".
|
Looking at this from our OS GeoNetwork instance (now we have 4.2), I'm surprised by the mapping given to GEMINI "Conformity" (ISO DQ_DataQuality > DQ_Element.result > DQ_ConformanceResult) - it comes out in GeoNetwork API DCAT as prov:wasUsedBy/prov:Activity/prove:qualifiedAssociation/prove:hadPlan/prov:wasDerivedFrom with the 'result' ("gmd:pass") as prov:generated with the correct value from the INSPIRE register* I guess that makes sense, saying that the dataset was used in an activity involving the INSPIRE spec, resulting in "not conformant". But I did expect it to map to dcat's dcterms:conformsTo as described at https://www.w3.org/TR/vocab-dcat-3/#quality-conformance-statement - but now I see that GN's choice kind of follows Example 47 in DCAT, in order to allow for "not compliant". Maybe that's why I didn't include that mapping in the tables above! *clever: we should probably reference these in the recommended GEMINI encoding: https://inspire.ec.europa.eu/metadata-codelist/DegreeOfConformity/notConformant |
Further reading on (my) GeoNetwork instance: XSLTs in 4.2.x branch: https://github.com/geonetwork/core-geonetwork/blob/f4890d56b0a61427ea5c9731cc8315ba7e247125/schemas/iso19139/src/main/plugin/iso19139/convert/dcat.xsl, and/or https://github.com/geonetwork/core-geonetwork/blob/f4890d56b0a61427ea5c9731cc8315ba7e247125/web/src/main/webapp/xslt/services/dcat/rdf.xsl GN 4.4 says it uses iso-19139-to-dcat-ap/iso-19139-to-dcat-ap.xsl at main · SEMICeu/iso-19139-to-dcat-ap They both have the same prov:wasUsedBy approach to Conformity, for example. The various occurences of dct:conformsTo in the 4.4 output are about the metadata record (conforms to http://data.europa.eu/930/ and source conforms to <dct:Standard rdf:about="http://vocab.nerc.ac.uk/collection/M25/current/GEMINI/"/> (from input record) The 4.4 one does populate dct:license, dct:rights, dct:accessRights, all from the input record Why/how does this matter to GEMINI (as opposed to GeoNetwork users!)? In particular, I am working with CDDO to get DCAT harvested (from a GEMINI-populated GeoNetwork) into the Data Marketplace - this GEMINI issue therefore become more urgent (at least for me & other government publishers) |
We have used W3C’s recommendations for mapping from ISO 19115 to Schema.org. This table summarises the Schema.org equivalence statements given for each element below.
Whilst there is no specific DD2 recommendation concerning DCAT, we believe a DCAT2 “equivalent element” for each GEMINI element would be useful, by supporting those whose web publication of GEMINI records uses DCAT as opposed to Schema.org. Where this is easily available from the same W3C source, we have included this below. You will see that the two vocabularies are very similar, but note that:
• some of the DCAT elements sit in the DCAT “distribution” section, not their “dataset”;
• many DCAT properties have structured content, so this is not a complete list of how to implement it; and
• there are many other DCAT properties that should also be used, beyond those that exist in Schema.org (e.g. conformsTo, creator, spatialResolutionInMeters, format).
Use .description for the textual content of the Anchor or CodeList
Use .url for the target of the Anchor
Where the page links on to download
.abstract (with the free text) and .url (with the Anchor target URL)
The text was updated successfully, but these errors were encountered: