-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal to include a property for "type of data" contained in a dataset #1548
Comments
This speaks to the ambiguity in the semantics of dcterms:conformsTo... does it relate to the subject or the resource the subject describes? Any data service is going to have at least 5 different conformance aspects.. access method, data model, data content type and service level. Possibly an aspect oriented qualified relationship is needed. Dimensions of data may be separate aspects.. or a compound aspect. Rdf datacube provides quite powerful starting point for this. |
@svituz , your requirement it is not completely clear to me. Is it about conformity and data structure definition, as per @rob-metalinkage 's comment? Or is it about the classification of a dataset - which in DCAT is done via It would be useful if you could provide a full example, ideally also with its RDF representation. |
@andrea-perego let's say I have a Dataset that collects data about a clinical study regarding a specific disease (the theme). The data you can find in the Dataset contains laboratory exams. I think these are two different concepts.
This RDF would describe a Dataset containing laboratory exams of clinical cases with breast cancer, for example. You can have another one with the same theme (the disease) but a different type of data for example digital x ray
Even in the case that Hope to have clarified my issue and that it makes sense. I looked a lot in dcat (and also outside of dcat) to solve this issue we have but couldn't find a satisfying solution. If you're interested in the context, here you can read about it |
Could you use the PROV Ontology? |
I'll circle back to my comment - there are multiple aspects - do you want a property for every possible one or a flexible mechanism to support documentation. a well known ontology could provide predicates (such as prov:wasGeneratedBy) in this case you need to describe data structure - typically container organisation or custom application schema (and specialised profiles thereof), data dimensions (e.g. RDF datacube), nature of data elements within the containers etc. Profiles of DCAT supporting available descriptive ontologies would be better than half-implementing via a limited set of properties. |
@svituz when reading your case I get the feeling it could be resolved with as @rob-metalinkage mentioned creating a proper DCAT profile. For you specific profiling case, DCAT has 3 options for classifications:
In the DCAT-AP ecosystem we encounter this need regulary that one would like to classify the datasets according to some domain specific needs.
The second option is the safest in case one deals with datasets that must be documented by multiple DCAT profiles. It means that in your domain you can express a specific set of constraints on that one, and that any other user of that metadata can use it as if it was dct:subject.
You can use this pattern for as many classifications you want without loosing compatibility with DCAT (because of the subPropertyOf relationship). |
I think it can be useful for
Dataset
to have a property that indicates the type of data that can be found in it (such asdcat:collectedData
ordcat:typeOfData
).For example, for a Dataset with clinical data, it can be useful to have such a property, whose value may be taken from controlled vocabularies or ontologies (e.g. LOINC, SNOMED) to express concepts such as "Laboratory Exams", "Vital Signs Obervation" etc..
I think it can be useful to increase the findability of datasets, also for other domains.
Any thoughts about this?
The text was updated successfully, but these errors were encountered: