-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GRSciColl - collection descriptors - vocabulary for objectClassificationName #157
Comments
The SYNTHESYS+ report (https://doi.org/10.3897/rio.10.e118244) you mentioned was the foundation of the work we're doing around our data modelling. However, our schema has evolved a bit since then. I recommend looking at the schema page, particularly the Digital Specimen json schema. We've changed the structure by adding three main categories: topicOrigin In the JSON structure, we've added "enum" (enumeration) -- to use as predefined list of acceptable values. Until we have a proper vocabulary server, this approach helps us maintain consistency in how data is categorised. I think the "other" category is still needed to capture the rest. We are calling this Other Biodiversity and Other Geodiversity.
|
Thanks @sharifX for letting me know, I wasn't aware of that. Does it mean that the categories like If it the type of classification I have seen before in several institutions and I think it would be great to make them searchable (things like Algae collections cannot be searched easily otherwise). Will you use some other controlled value to work with these cases? (I am trying to make these traditional collections more easily discoverable and I am not sure how best to proceed). |
@ManonGros yes, good point. We have it now inside our FDO profile (json) under topicCategory. We will update the list in the digital specimen json schema so the profile and the object data schemas are aligned. |
Thanks @sharifX I think the |
@ManonGros Yes. that make sense. Perhaps good to have this historical context documented as well. This is a sort of condensed historical context and background. The development of Latimer Core formalised these concepts into The DiSSCo initiative retained the "Discipline" concept in its openDS schema (https://schemas.dissco.tech/), creating terms such as:
To enhance consistency, DiSSCo plans to enforce values through a JSON enum (a controlled vocabulary constraint in the JSON schema) and leverage vocabularies from resources like GBIF and Catalogue of Life. DiSSCo will also establish a vocabulary server to manage terms specific to openDS use cases. |
I would like to have a controlled vocabulary for interpreting the Latimer Core field
objectClassificationName
: https://ltc.tdwg.org/quick-reference/#ObjectClassification.objectClassificationName.The Latimer core term objectClassificationName is very convenient to describe subsets of collections that do not necessarily have other ways of being grouped. For example, this is helpful for groups of non-monophyletic taxa (for example Algae).
Currently we don't have any vocabulary but it would make sense to integrate the categories of the DISSCO discipline vocabulary which is described here: DOI 10.3897/rio.10.e118244
Human Biology
Archaeology
Other
Algae
Bryophytes
Fungi/Lichens (including Myxomycetes)
Pteridophytes
Seed plants
Collected on Earth
Collected in space
Other
Mineralogy
Petrology
Loose sediment
Other
Bacteria and Archaea
Phages
Plasmids
ProtozoaVirus - animal / human
Virus - plant
Yeast and fungi
Other
Botany & Mycology
Invertebrates
VertebratesTrace
fossils MicrofossilsOtherArthropods - insects (Lepidoptera, Diptera, Hymenoptera, Coleoptera)
Arthropods - other insects
Arthropods - arachnids
Arthropods - crustaceans & myriapods
Porifera (sponges)
Mollusca (bivalves, gastropods, cephalopods)
Other
Fishes
Amphibians
Reptiles
Birds
Mammals
Other
Other biological or geological objects which fit into none of the other defined categories
Note that there is some overlap with the GRSciColl discipline vocabulary for institution (https://registry.gbif.org/vocabulary/Discipline) and the GRSciColl collection content type vocabulary (https://registry.gbif.org/vocabulary/CollectionContentType/concepts). However, I think the DISSCO list of proposed values seems quite practical and reflects a lot of the sub-collection divisions I have encountered.
I am not necessarily suggesting that the DISSCO vocabulary be the final one used for the objectClassificationName but that it be integrated in the vocabulary used for interpretation of the field. Perhaps we could remove the "other" categories there?
The text was updated successfully, but these errors were encountered: