OASIS Topic Maps Published Subjects Technical Committee
Pubsubj > Documents > Deliverables > 1. Definitions, Requirements and Examples

 

OASIS Topic Maps Published Subjects TC Deliverables
1. Published Subjects - Definitions, Requirements and Examples

Version 0.1 - last updated 2002, June 27
Latest version : http://www.oasis-open.org/committees/tm-pubsubj/docs/recommendations/general.htm
Editor: Bernard Vatant

Status of this document :
Draft proposal, following May 24 Barcelona meeting and June 25 conference call


1 - Scope and Statement of Purpose

The OASIS Topic Maps Published Subjects Technical Committee has been set forth to help wide-spread adoption of topic maps specification ISO 13250, singularly by publishers and users of classifications, taxonomies, thesaurus, registries, catalogues, directories ... needing standard ways to make their legacy available for topic map applications.

The first and main target of this recommendation is therefore topic maps interoperability, through efficient definition and identification of subjects represented by topics in topic maps.

This initial target is likely to be extended in the future to a wide range of applications or technologies making explicit use of abstract representations of subjects. Those include for example applications leveraging ontologies or vocabularies, search engines, intelligent agents, and other foreseeable or yet unknown Semantic Web tools. Throughout the present document, the generic term applications is used to refer to either topic maps applications or any other such technologies.

Both identification of subjects by applications and definition of subjects for their human users, can be provided by stable resources, made available through trustable publication process, as defined by the present and following recommendations. Subjects defined and identified in such a way are called published subjects.

The purpose of this document is to set recommendations for published subject documentation, that will provide structured sets of published subject indicators (human-readable definition of subjects), addressable and identified by published subject identifiers (stable URIs used by applications to identify the subjects).

2 - A gentle introduction to Published Subjects terminology

2.1 - Subjects and Topics

A subject can be an individual, like "Isaac Newton", "IBM, Inc.", or "Paris (France)" ... or a class of such individuals, like "famous scientists" "software companies" or "towns" ... or a more abstract concept like "gravitation" "economic growth" or "baroque style"... In short, a subject can be anything deserving to be identified, named, represented and generally talked about - otherwise said a subject of conversation. Topic Maps specification XTM 1.0 proposes an extremely general definition of a subject.

Definition 1

A subject is anything whatsoever, regardless of whether it exists or has any other specific characteristics, about which anything whatsoever may be asserted by any means whatsoever.

Applications deal with a subject by handling a formal representation or proxy, that will be called throughout this document a topic, to conform to topic maps terminology.

Definition 2

A topic is a representation, inside an application, of an unique, well-defined, and non-ambiguous subject.

2.2 - Subject Indicators and Subject Identifiers

Applications inter-operability needs that authors, users, and applications themselves, must have ways to agree if two or more topics in the same or different applications represent the same subject.

How can that be achieved? An application can indicate to human users what is the subject of a topic by referring to a document, or any other kind of resource, where the subject appears to be defined or described in a human-readable and non-ambiguous way. Such a resource is called a subject indicator. Provided with this resource, an human user will be able, hopefully, to know what subject this topic represents.

Definition 3

A subject indicator is a resource that is referred to by an application, to provide an unambiguous indication of the identity of a subject to a human being.

In the Web universe, a subject indicator has to be an addressable (network-retrievable) resource. The reference to a subject indicator will therefore use some URI, which will both address the subject indicator and identify the subject. Applications will use this URI as a subject identifier.

Definition 4

A subject identifier is an URI that refers to a subject indicator, and provides an unambiguous identification of a subject to an application.

2.3 - Published Subjects

Unfortunately, the whole above scenario is too simple to be sustainable. Any resource can become a subject indicator by being referred to as such by an application, whether or not this resource was intended by its publisher to be a subject indicator. The subject indicators and subject identifiers defined in such a process are likely to be untrustable and unstable. URIs and the resources they address are moving targets in the Web universe. The publishers of resources used as subject indicators might not even be aware of it, and are likely to leave applications and users with meaningless identifiers and indicators, if any indicator at all, without previous notice.

Here the publishers enter in the loop. If some publishers are aware of the whole problem, and want to provide applications with stable, trustable, authoritative subject indicators and identifiers, the situation is far better. The publishers can provide sets of subject indicators and subject identifiers published in a standard way, and declare their intention to maintain them stable and trustable for topic maps and other applications. At that point, the applications and users are provided with published subjects, defined in published subject documentation, containing published subject indicators addressable through published subject identifiers. They will use them as above, but the whole scenario will become really sustainable.

Definition 5

A published subject is a subject for which there exists at least one published subject indicator.

Definition 6

A published subject indicator is a subject indicator that is published and maintained at an advertised address in order to facilitate interoperability of applications.

Definition 7

A published subject identifier is the URI of a published subject indicator, chosen and declared by its publisher as the URI to be used by applications to identify the published subject.

The topic maps literature has coined the acronym "PSI", also used in XTM 1.0 specification. Note that it can expand both as "published subject indicator" and "published subject identifier". Those are two faces of the concept, one looking at humans (the indicator), and one looking at computers (the identifier). Like Janus Bifrons over Roman doors, PSIs are warrants of a good communication between two universes.

3 - Requirements and Recommendations

The following are the basic requirements and recommendations for PSIs

Requirement 1 :

  • A Published Subject Identifier must be an URI.

Requirement 2 :

  • When a Published Subject Identifier is an URL,
    it must resolve to an human interpretable Published Subject Indicator.

Recommendation 1 :

  • A Published Subject Indicator should provide some way to identify its publisher.

Recommendation 2 :

  • Any relevant metadata may be specified by the Published Subject Indicator.

Recommendation 3 :

  • The Published Subject Indicator should either have a machine-processable structure, or link to a resource providing machine-processable expression of assertions about the subject (or both)

4 - Examples

Draft proposal under discussion : Apples and Oranges