Guidance on Linking Resources
This working document discusses OSLC guidance on how to represent, manage and interpret links between OSLC resources, and in general any linked data resources.
OSLC version 2.0 provided guidance on linking. This document will provide an updated set of guidance to be included as part of the Core 3.0 set of specifications.
Design Goals and Constraints
- Keep links simple.
- The term “Link” is a relationship between two resources when one resource (representation) refers to the other resource by means of a URI.
- Links are uni-directional.
- Links are represented in RDF with a single statement (triple).
- A link triple may be stored in any graph. It may be stored in the graph of the subject URI, the object URI, or some other arbitrary graph.
- The RDF representation of a resource is a general graph. The graph may have statements about the resource or other resources. The resource graph provides the context (which graph is asserting the statement) of the triple. The decision of which resource graph to insert a triple should be consistent with the normal lifecycle of the resource.
- Avoid creating aliases (multiple names for same meaning) for link predicates (true for properties in general).
- Promote the reuse of existing vocabularies.
- Do not rely on OWL and RDFS inferencing.
- Predicate URIs should be directly resolvable and its properties include an rdfs:label that is used in user interfaces. There may be multiple rdfs:label for various languages, where the format indicates how to represent associated encoding, see RDF plain literal.
- Properties on a link can be stored using reified statements, however do NOT use reified statements to cache information about subject or object resources. They should only represent properties of the link itself.
- Avoid caching resource information outside of the service provider that owns that information. If caching is used, it should be treated like HTTP caching.
- To improve performance, minimize HTTP message flows.
- Maintain alignment with principles in the Linked Data Platform.
- Maintain alignment with the Core 3.0 guidance on vocabularies.
HTTP resource and RDF resource
A resource accessible with an HTTP URI using the standard HTTP methods (HEAD/GET/PUT/DELETE) is considered an HTTP resource. It may contain a graph of RDF data, whose subject URI may or may not include the HTTP resource URI. In the absence of named graphs in the resource, consider all the statements part of the graph identified by the HTTP Request-URI. An HTTP resource has an ETag and a Last-Modified date. These values apply to the graph, not necessarily to the HTTP resource. Most of the time the triples in the graph have the same subject URI as the HTTP resource URI.
An RDF resource is simply a URI, about which statements can be made. When we refer to an RDF resource, we are talking about the subject URI shared by a set of triples. Hence, a graph may refer to more than one RDF resource.
It is possible to assert statements about a subject URI yet store the statements at a different URI than the subject. This might be the case when the user does not have rights to edit the HTTP resource
Requirements
A set of requirements that the link guidance will satisfy. The link guidance statements to be part of the OSLC Core 3.0 set of specifications shall ensure that the following requirements could be satisfied by OSLC service provider and client implementations.
- Users can create links from one HTTP resource to another from any resource’s context (i.e. user interface native to the resource’s origin).
- When viewing an HTTP resource in a user interface, the user wishes to see all the links that the resource is either a subject or an object of.
- User interfaces show links with human readable link types, and human readable labels (oslc:shortTitle, or dcterms:title) for both the subject and object of the link.
- User interfaces should use human-readable link role names. Role names may vary for subject, object and link type, but not by link instance (i.e., there will not be support for two links of the same type to have different role names).
- Additional properties can be stored on the link itself, however they should describe the link and NOT the object or subject.
- If a user does not have view rights to a graph that a link triple is stored in the link is not visible to the user.
- Conceptually a link appears as a property of an RDF resource. This property is normally stored in the graph identified by the subject URI, but it may be stored anywhere. When a client GETs a URI used as the subject URI in an RDF triple, the server returns the entire graph stored at that URI. This graph is the HTTP resource. ETag and Last-Modified headers represent HTTP resources. A HTTP resource is a single graph that may or may not contain all the properties of the RDF resource with the same subject URI.
- When a link is created or edited the subject RDF resource’s dcterms:modified and dcterms:contributor properties should change. However if the link is managed in some other graph, the service provider might not know to change those property values. Clients should be aware that those properties might not be conceptually accurate across all graphs in all service providers.
- Any system of OSLC service providers should provide a way/mechanism for a client to get all links to and from a given RDF resource.
Use Cases
- Find all links - A client finds all the links that a resource is either the subject or object of.
- Create link - A client creates a link from a resource to another
- Create link in other resource - A client creates a link from a resource that it does not have edit rights on, to another resource that it does.
Additional use cases to be discussed
- As a administrator, I should be able to restrict what links can be made between what resources, so that I can enforce a consistent information architecture.
OSLC Core 3.0 Linking Guidance
- The link (statement) should be stored with a single document/graph that asserts the relationship.
- In cases where there are multiple and possibly conflicting assertions have been made, an application may have domain knowledge of which source (assertion) it has more confidence in.
- Link predicates should be directly resolvable.
- Clients should not make assumptions about the type or properties of a resource that is the target of a link based solely on the link predicate and the rdf:type of the subject resource.
- When absolutely necessary properties can be associated with a link statement itself using reified statements. However these properties should not be used to cache or store properties about the subject or object resources. They should only store properties about the link itself, not the resources.
- When defining a new vocabulary with link types, there should be exactly one predicate for each link type concept. That is, avoid defining two different link predicates representing the same meaning, with just the subject and object reversed (e.g. validates / validatedBy ).
- Clients should be aware that links may be stored in other graphs. Clients wishing to get all links to/from a given resource (or set of resources) may use query solution. For example, solutions such as a central index, SPARQL endpoints, OSLC Simple Query Syntax could be used.