Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Entry Points for RO-Crate Profiles #371

Open
floWetzels opened this issue Nov 8, 2024 · 6 comments
Open

Entry Points for RO-Crate Profiles #371

floWetzels opened this issue Nov 8, 2024 · 6 comments
Assignees
Labels
use-case A (potential) use-case for ROLite creation, consumption or integration
Milestone

Comments

@floWetzels
Copy link
Contributor

floWetzels commented Nov 8, 2024

In the current specification, profiles are always defined on a complete RO-Crate. In case that such profile specifies requirements on the root data entity, they cannot be used in a modular or composable way, since subdirectories in RO-Crates don't specify a root data entity. An example for such a profile is the Workflow-Run-RO-Crate.

This issue came up at the BioHackathon Europe 2024, project 19 (@elichad @dnlbauer @floWetzels). Since composed research objects seem to be common and it should be possible to model them in RO-Crates without redundance in profiles, we propose the introduction of a Entrypoint mechanic into the RO-Crate specification, see https://github.com/dnlbauer/bh24-ro-crate-extension for details (will be fleshed out in the future).

Subscribe to this issue to stay updated on the development.

@floWetzels floWetzels added the use-case A (potential) use-case for ROLite creation, consumption or integration label Nov 8, 2024
@elichad
Copy link
Contributor

elichad commented Nov 12, 2024

To consider:

  • can/should this work recursively? Having an entry point inside another entry point?
  • what should happen if multiple entry points conform to the same profile? e.g. in the case of uploading a crate with multiple workflow entry points to WorkflowHub

@dnlbauer
Copy link
Contributor

  • what should happen if multiple entry points conform to the same profile? e.g. in the case of uploading a crate with multiple workflow entry points to WorkflowHub

In this case, the service could decide to decompose the RO-Crate into "atomic" (for the lack of a better term) RO-Crates which can be handled like normal RO-Crates. I.e. WorkflowHub could decide to create multiple entries - one for each entry point in the crate; a workflow execution engine executing a workflow from RO-Crate could present a drop down selection to the user or require to specify an entrypoint during workflow submisson.

@elichad
Copy link
Contributor

elichad commented Nov 13, 2024

Intending to discuss this issue at the RO-Crate community call at 8:00 UTC tomorrow

@marc-portier
Copy link
Contributor

Missed the meeting and thus the discussion in the call.
Still sharing my two cents.

From a semantic point of view there is nothing preventing you to declare additional triples with the dcterms:conformsTo-predicate attached to any available part (subject) in the graph. And I don't think the ro-crate spec is formulating any restriction on that either. If anything, the jsonld-context just makes it handy to use conformsTo: keys in the json-ld to add these.

Its value is expected to contain an identifier (URI) for a standard, that

  • just conceptually represents a number of assumptions clients can make about the subject
  • allows those clients to verify if they have the knowledge on board to deal with that

As such one could be using the conformsTo in ro-crates in combination with

  • data entities of type File to express e.g. the file is not just a netcdf file but conforming to th cf-conventions, or even a CSV file that sticks to some layout or schema, ...
  • conceptual entities describing dataservices that e.g. conform to some webserrvice api standard (like ogc-wms, erddap, ...)

This way of applying dcterms:conformsTo exists outside the RO-Crate concept and can be applied to any part of it as far as I see. The fact that the RO-Crate specification additionally introduced some specific suggestions to express conformity of ro-crates was considered as a useful and clear mechanism to guide people into some kind of "duck-type" declaring of valid assumptions on the crate contents. The fact RO-Crate 1.2 introduces some guidance on this level

  • does not in any way limit other usage of this mechanism (including the suggested nesting)
  • nor should it raise the expectation that because of that the RO-Crate specification suddenly needs to control, document, or worse: forbid any more nested/detailed application of that same mechanism.

If anything, IMHO the RO-Crate spec should state it does deliberately not want to interfere with that detail level. And

@elichad
Copy link
Contributor

elichad commented Nov 15, 2024

Summarising discussion from community call 2024-11-14:

  • Using the EntryPoint type is not strictly required since it could be inferred from checking if any conformsTo on an entity is an RO-Crate profile. However it is convenient (for tooling) to make entry points explicit within the crate metadata.
  • Using @type to indicate entry points may not be the best choice, as @type usually describes what the actual thing represented by the entity is (e.g. a File, a Person, a Place), and an entry point is just a construct in the metadata
    • the existing EntryPoint type in schema.org is intended for describing API endpoints and such, this isn't quite the same as our idea, we wouldn't want someone to describe an API using RO-Crate and end up with weird conflicts because of the overloaded type
    • we could find a different property to use (but we haven't found a good one so far)
    • in our example we also included the entry points under "about" in the metadata entity, which is another way to make them easily discoverable by tools
    • ISA profile uses additionalType to indicate Investigation, Study, Assay https://github.com/nfdi4plants/isa-ro-crate-profile/blob/release/profile/isa_ro_crate.md - potentially do something similar to indicate entry points?
    • Do we need a “Crate” type?
  • Highlighting of entrypoints GUI wise - they are possible views of the crate
  • Profiles often talk about the root crate - the entrypoint would be a mechanic for profiles to talk about their root without necessarily that being the RO-Crate Root
  • If we add EntryPoint there may be only be certain properties that we should follow recursively to scope the “sub-crate” e.g. hasPart, mentions, mainEntity. But what links NOT to follow?
  • Alternative of using named @graph { fragments } to isolate the scope? Can get quite complicated.

@elichad elichad added this to the RO-Crate 2.0 milestone Dec 12, 2024
@elichad
Copy link
Contributor

elichad commented Jan 13, 2025

We'll revisit this after the release of v1.2 when we discuss what will be included in v2 (February at the earliest, I think)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
use-case A (potential) use-case for ROLite creation, consumption or integration
Projects
None yet
Development

No branches or pull requests

4 participants