[oslc-plm] Versioning, branching, configurations, baselines, streams, work flow process

David Honey david.honey at uk.ibm.com
Wed Sep 14 07:29:54 EDT 2011


Dear OSLC PLM workgroup member,

As promised in the OSLC PLM workgroup meeting (2011-09-13), I have 
described below some concepts relating to versioning, branching, 
configurations and baselines and streams as seen across a range of SCM 
tools. The intent of this is to:
Act as a catalyst for the workgroup meeting next week (2011-09-20).
Help establish which of these concepts exist in the PLM world.
Determine if different terminology is used.
Determine the versioning requirement concepts needed to support the use 
cases for the PLM reference model.

The challenge in getting such concepts into the OSLC core spec is to find 
the right balance between conflicting issues:
The abstraction needs to be flexible enough that it can be supported by 
most tools.
It needs to be specific enough to allow OSLC consumers and providers to 
understand the semantics and derive value from it.

Regards,
David.

-------------------------------------------------------------------------

Version, Revision
In the SCM world, the terms version and revision commonly mean the same 
thing. There are two versioning models commonly in use:
The versioned object model. Each CI (configuration item) such as a file 
can undergo changes that result in new versions. Each such object version 
has a specific identity, often using the version or revision id as part of 
that UUID. Most SCM tools use this model: SCCS, RCS, CVS, CMS, PVCS, 
Clearcase, Synergy to name a few. This is the most common versioning 
model.
The versioned repository model. The tool provides a versioned repository 
and each change committed to that repository results in a new version or 
revision of the repository. Files and directories are not individually 
versioned. A question such as "Who made the change in version 5 of file 
F?" are meaningless. Instead, one asks "What owner does file F have in 
version 5 of the repository?". Subversion is the best example of this 
model. It also means that you can have the same file version in many 
different versions of the repository.

Reconciling these two models is awkward. One can partly map #2 into #1 by 
representing the history as the repository versions for each changed CI. 
For example, a file might have a history such as:  234 --> 467 --> 1234. 
The first version was created in repository revision 234. The next change 
was in repository revision 467. The last change was in repository revision 
1234. However, there can be other significant complexities.

Locking models
There are two locking models commonly used in SCM tools:
Locking at check out - In this model, when a user wants to make a change, 
they typically create a lock on the leaf version to prevent another user 
from attempting to make a parallel change. Some tools support both 
exclusive locks and shared locks. A new version of the object is not 
created until the user commits a change to the system. That commit 
releases any lock on the object that was checked out. This is the most 
common model.
New version created at check out. There is no locking as such. When a user 
wants to make a change, they check out the file, and that creates a new 
version that is modifiable by the user. If parallel versions are to be 
restricted, the restriction is applied at check out and/or check in. This 
model is used by Rational Synergy.

Some tools such as Subversion effectively allow one or many users to 
create a local workspace that has a copy of the original files. Those 
copies are not recorded in the repository. Users can delete these 
workspaces without needing any access to the tool repository.

While the locking model itself is probably not of interest to the OSLC 
core group, it may be relevant in terms of when OSLC resources are known 
and how OSLC consumers can find out who has checked out a particular file.

Branching and parallel development
SCM tools have various concepts around branching and parallel development. 
Commonly used models here are:
Branch as an explicit entity. Some tools require that in order to create a 
parallel version, this must be done on a branch. The branch is created as 
a separate resource, either explicitly, or implicitly as part of the 
parallel check out. Rational Clearcase is an example of this type.
Implied branching. In this model, parallel versions may be checked out and 
are associated with some context. That context is often related to a 
release or a stream. In this model, if a change is required for a variant, 
this does not require a specific branch. The object version's associated 
context (such as a release, task, stream etc) determines that the change 
only gets included in a specific variant or release. Rational Synergy and 
Rational RTC are examples of tools that follow this model.
No branching. In this model, there is no concept of a branch at all. The 
effect of branching may be simulated by copying data to a different area 
of the same version repository, or a different repository. Subversion is a 
modern tool using this model.

Configurations
If an SCM tool supports a configuration it usually means a versioned 
container. That container records the structure of its members and which 
versions of those members are used in the configuration. Support for this 
varies widely:
No configurations. Some tools only version files and do not version 
structures. Tools such as SCCS, RCS and CMS provided no means to version 
control structure.
A tool using the versioned repository model simply treats a configuration 
as a directory somewhere in the respository at some respository revision 
level. Subversion is a good example of this.
Explicit configurations. These are explicit versioned objects in the 
repository, and can be checked out, updated, compared and so on. Rational 
Synergy is an example of this type.

Baselines
In the SCM world, the term baseline usually means an immutable record of a 
collection of files. The details vary between SCM tools, and some tools 
may not have an explicit representation. Examples include:
Tools that only support tagging. Tools of this type allow the user to use 
some user-defined string tag to one or a collection of objects. An object 
may have multiple tags. SCCS, RCS and CVS are examples.
Tools that implicitly support baselining. Subversion is an example. It 
does not directly support baselining. Common practice is to copy a set of 
directories and files to a separate location within the repository, and in 
doing so, a new version of the whole repository is created. However, the 
user can make additional changes. The state of the repository immediately 
after the copy represents an immutable snapshot. However, those files can 
still be changed, and if so, that creates later revisions of the 
repository.
Tools that explicitly support baselines. When a baseline is created, it is 
represented as an explicit entity with its own meta-data. Rational Synergy 
is a good example.

Streams
Tools that support parallel development for different releases or 
variants, usually provide some means of defining a stream or a release. 
This allows the changes made by developers to target one or many 
deliverable releases or variants. The details tend to vary a lot between 
tools. Some examples include:
Streams by tagging. Early version control tools relied on the user 
appropriately tagging the new versions they created.
Explicit streams. Rational RTC is an example of this. Users work in a 
project area associated with a stream. Not all tools of this type make it 
easy to share changes across streams, or selectively merge parallel 
streams.
Explicit releases. This is very similar to explicit streams. In Rational 
Synergy, make changes against a task that is associated with a release 
value. That task may also be included in other releases according to 
process rules set up by a build manager.

Work flow process
Most modern SCM tools support a flexible and configurable work flow 
process. This process controls:
When users see changes from other developers.
What changes are collected for integration test builds and testing.
What changes are collected for system test builds and testing.

Inherent in this are the concepts of insulation and sharing. Insulation 
means that a work area (sometimes called a staging area) is not changed 
until the owner or manager of that work area decides they want changes. 
For example, developers may not want to see later versions from other 
developers until they have been checked in, and perhaps undergone some 
level of testing. However, at some time, it is usually required that those 
changes are shared with the team so that testing can use those changes, 
and shared with build managers so those changes get included in official 
builds and final deliverables.

The details of how tools support this varies widely:
In Rational Clearcase, config specs define the criteria for which version 
should appear in a view (work area). This supports both a push model that 
allows changes to be pushed to other work areas (sometimes suitable for 
RAD), and a pull model in which the owner has control over when that work 
area is updated.
In Rational Synergy, users decide when to perform an update members 
operation and this selects the appropriate versions according to process 
rules, and any corresponding work areas are updated.
In Rational RTC, users can choose which change sets to accept in the work 
area associated with a project (and stream). Users see the list of pending 
changes waiting to be either accepted into that work area, or delivered 
from that work area.

Change sets or task-based CM
Early SCM versioning tools simply versioned individual files. The only 
means of indicating that, say, 3 files were changed for a common purpose 
might be to either use some form of tagging, or they used a separate tool 
to describe a task or change request and entered that data at the time of 
check in. More modern tools support the concept of a change set or task 
that is a grouping of all of the changed files and directories for a 
single logical purpose. Again, the details vary between tools:
Rational RTC uses change sets. All the changes delivered in a single 
deliver are associated with a change set. A change set may optionally be 
associated with a work item.
Rational Synergy uses tasks. Developers must explicitly create a task, and 
all changes must be associated with a task. The completion of the task 
automatically checks in its associated objects. Work flow process is based 
on tasks.
Subversion treats a single commit of changes from work area to the 
repository as a single change that results in a new revision of the 
repository. In this way, all the changes made are implicitly linked by 
sharing the same repository revision.

UUIDs - Universally unique identifiers
Most but not all SCM tools have some property or sets of properties that 
form a UUID. SCM tools users tend not to be too concerned with UUIDs 
unless they want to create links from other tools to a specific version of 
a file or directory. However, from an OSLC provider viewpoint, the 
convention would be that dcterms:identifier would be such a UUID. Such a 
UUID might be constructed:
>From an archive file plus archive revision. For example, if you have a 
path for an SCCS or RCS archive file, and know the revision, you have 
uniquely identified the object.
>From a property that the repository created automatically. Examples here 
include Rational RTC and Rational Synergy.

Unfortunately not all tools make it easy to support UUIDs, In Subversion, 
the only UUID is one for the entire repository. If you access the 
repository with a specific path and a repository revision, that does 
unique identify a specific object version. However, there is a problem 
with aliasing. Users can copy the object to many different locations, and 
in effect this copies a link to the same object. Thus the same object 
version may be accessible over many different paths and repository 
revisions. Hence you might have different UUIDs that are aliases of an 
original UUID. 

Why is aliasing  a problem?

One of the newest OSLC areas is the TRS (Tracked Resource Set). This 
requires that an OSLC provider must be able to keep a change log that a 
TRS consumer can use to determine which cached resources need to be 
updated to reflect in near-real-time changes made since the data was 
cached. The problem is that a change made via an alias needs to be 
reflected in all the URIs for the same object version. Some tools may have 
problems conforming to the OSLC TRS specification because they do not have 
explicit UUIDs.






Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU





-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://open-services.net/pipermail/oslc-plm_open-services.net/attachments/20110914/ad4e7448/attachment-0003.html>


More information about the Oslc-Plm mailing list