[oslc-core] Discussion on vocabulary design

Tue Aug 31 17:04:25 EDT 2010

Ian,

All vocabularies are not created equal. In the case of RDF, there is the 
notion of Upper Ontologies which are very general and intended to be 
extended by others. Creating a good Upper Ontology is a lot of work and 
probably not what we are trying to achieve at OSLC. I think we are trying 
to create data models for well-defined domains of software and systems 
engineering. I don't expect others to cherry pick useful predicates from 
our vocabularies and use them in other contexts, the way we are trying to 
use Dublin Core.

It is very important that OSLC specs do use other vocabularies when 
appropriate ones exists since that helps create a connected Web of data. 
Dublin Core and FOAF are good examples, however we do sometimes struggle 
to adapt  Dublin Core terms to our case, In some cases it may be clearer 
to coin a new predicate that has a well-defined meaning. Using Upper 
Ontologies is problematic for OSLC since doing so requires that we express 
the extensions using concepts from RDFS or OWL, which leads us into 
inferencing. That means in practice we have to invent stand-alone 
predicates.

I do agree that in general we shouldn't have to burn the exact range into 
the URI, however it may be appropriate in some cases to have more 
specialized predicates. For example, in genealogy having predicates for 
mother and father is handy even though a minimalist might be happy with 
just parent. However, not many would advocate parentMale and parentFemale.

Regards, 
___________________________________________________________________________ 

Arthur Ryman, PhD, DE

Chief Architect, Project and Portfolio Management

IBM Software, Rational

Markham, ON, Canada | Office: 905-413-3077, Cell: 416-939-5063
Twitter | Facebook | YouTube

From:
Ian Green1 <ian.green at uk.ibm.com>
To:
oslc-core at open-services.net
Date:
08/26/2010 07:06 AM
Subject:
[oslc-core] Discussion on vocabulary design
Sent by:
oslc-core-bounces at open-services.net

Hello

We briefly discussed predicate names and Range specifications on 
yesterday's call. 

Concern has been expressed in the past that OSLC is designing vocabularies 

/ specifications which require assumptions to be made about linked data, 
and which are specialized rather than generalized.

The dublin core vocabulary, which we use, has dcterms:creator.  It does 
not have "dcterms:creatorFOAFPerson" "dcterms:creatorFOAFAgent" and so on. 

 This would be a unwieldy vocabulary.  It would be difficult to maintain 
as new types of "person" were defined, would not be forwards compatible (a 

client that knew about creatorFOAFPerson would not deal with 
creatorFOAFRobot).  If that client also knew about contributorFOAFPerson 
etc. each new person type would induce two new predicates that the client 
would need to deal with - queries, UI, etc.

One reason these vocabularies scale is that they are loosely coupled and 
highly cohesive.  Do we think the same is true of OSLC vocabularies?

For example, a ChangeRequest implements a Requirement:

This is reflected in CM specification as follows (i'm eliding the 
namespaces):
        - the name of the predicate - implementedByChangeRequest
        - the Range specifier in the written specification - Requirement

And in the RM specification as follows:
        - the name of the predicate - implementedBy
        - the Range specifier - unspecified.

In the RM specification there is no suggestion/requirement that a 
Requirement be implemented by a ChangeRequest - the name of the predicate 
is enough to capture the notion of "implementation", but makes no other 
constraint or implication (to the human reader of the specification, and 
to consumers).  The Range is also unspecified. Whilst OSLC Core is silent 
on the meaning of the Range (at least I can't see it explained), there is 
a risk that clients will misbehave in the case that the object of a 
implementsRequirement link were something other than a Requirement.

But this is not just about writing robust clients - it is about designing 
an open resource model that is flexible, extensible, composable etc. 
Characteristics such as forwards compatibility are desirable.  For 
example, if we followed the "type-in-the-name" style a new predicate 
"implementsModel" would be needed to support a scenario in which a 
ChangeRequest could implement an AM resource. Clients interested in 
"implementation" relationships would have to be upgraded to know about 
implementsModel in addition to implementsRequirement.  There is a 
combinatorial problem here, since over time the number of relationships 
will grow, as will the number of resource types.

My inclination is to factor "implementsRequirement" these into 
"implements" and "type of thing - Requirement".  We already have each of 
these notions separately in our OSLC resource models - name of predicate 
and rdf:type. 

Another extreme is to consider all such relationships to be equal, and 
call them all say "relatedTo".  This would be be problematic for another 
reason - it does not say enough about the nature of the relationship.  In 
RDF we can't specialize a predicate - each edge on the graph has a fixed 
URI, so to give additional meaning we need to pick a different predicate - 

there is no way to factor "implements" into "related" and something else 
[1,2].

best wishes,
    -ian

[1] Link properties could be used to express this "specialization" of a 
predicate - but that is a specialization of an instance, not a 
specialization of the predicate.

[2] RDFS would be one way to express such relations between predicates, 
but I'm not suggesting that here.

Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU

_______________________________________________
Oslc-Core mailing list
Oslc-Core at open-services.net
http://open-services.net/mailman/listinfo/oslc-core_open-services.net