[oslc-core] Discussion on vocabulary design
Ian Green1
ian.green at uk.ibm.com
Thu Aug 26 06:19:01 EDT 2010
Hello
We briefly discussed predicate names and Range specifications on
yesterday's call.
Concern has been expressed in the past that OSLC is designing vocabularies
/ specifications which require assumptions to be made about linked data,
and which are specialized rather than generalized.
The dublin core vocabulary, which we use, has dcterms:creator. It does
not have "dcterms:creatorFOAFPerson" "dcterms:creatorFOAFAgent" and so on.
This would be a unwieldy vocabulary. It would be difficult to maintain
as new types of "person" were defined, would not be forwards compatible (a
client that knew about creatorFOAFPerson would not deal with
creatorFOAFRobot). If that client also knew about contributorFOAFPerson
etc. each new person type would induce two new predicates that the client
would need to deal with - queries, UI, etc.
One reason these vocabularies scale is that they are loosely coupled and
highly cohesive. Do we think the same is true of OSLC vocabularies?
For example, a ChangeRequest implements a Requirement:
This is reflected in CM specification as follows (i'm eliding the
namespaces):
- the name of the predicate - implementedByChangeRequest
- the Range specifier in the written specification - Requirement
And in the RM specification as follows:
- the name of the predicate - implementedBy
- the Range specifier - unspecified.
In the RM specification there is no suggestion/requirement that a
Requirement be implemented by a ChangeRequest - the name of the predicate
is enough to capture the notion of "implementation", but makes no other
constraint or implication (to the human reader of the specification, and
to consumers). The Range is also unspecified. Whilst OSLC Core is silent
on the meaning of the Range (at least I can't see it explained), there is
a risk that clients will misbehave in the case that the object of a
implementsRequirement link were something other than a Requirement.
But this is not just about writing robust clients - it is about designing
an open resource model that is flexible, extensible, composable etc.
Characteristics such as forwards compatibility are desirable. For
example, if we followed the "type-in-the-name" style a new predicate
"implementsModel" would be needed to support a scenario in which a
ChangeRequest could implement an AM resource. Clients interested in
"implementation" relationships would have to be upgraded to know about
implementsModel in addition to implementsRequirement. There is a
combinatorial problem here, since over time the number of relationships
will grow, as will the number of resource types.
My inclination is to factor "implementsRequirement" these into
"implements" and "type of thing - Requirement". We already have each of
these notions separately in our OSLC resource models - name of predicate
and rdf:type.
Another extreme is to consider all such relationships to be equal, and
call them all say "relatedTo". This would be be problematic for another
reason - it does not say enough about the nature of the relationship. In
RDF we can't specialize a predicate - each edge on the graph has a fixed
URI, so to give additional meaning we need to pick a different predicate -
there is no way to factor "implements" into "related" and something else
[1,2].
best wishes,
-ian
[1] Link properties could be used to express this "specialization" of a
predicate - but that is a specialization of an instance, not a
specialization of the predicate.
[2] RDFS would be one way to express such relations between predicates,
but I'm not suggesting that here.
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number
741598.
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU
More information about the Oslc-Core
mailing list