Proposal for Document Process metadata, a new standard
|
This document is a draft proposal for a proposed new standard for so called Document Process metadata. It contains the following parts: |
|
This document is a draft proposal for a proposed new standard for so called Document Process metadata. The purpose of this draft proposal is to provisionally define the new standard well enough so that interested parties can understand the scope of the standard. The intention is that some of these interested parties will then express their desire to participate in an expert group whose first function will be to fully define the proposed standard. This proposal is not intended not to create
a protocol for delivering detailed information, but to enable sharing and
filtering of common documents. Other projects focus on XML solutions that
cover the aspects of delivering exact and in detail described structured
information. This proposal focus on general properties, that are more limited
and more difficult to select in many aspects. |
|
|
Information about information, or data about data, that is, metadata has received a new focus thanks to the enormous spread of information facilitated by the world-wide web. The current de facto standard for bibliographic metadata, Dublin Core, established 1994 is an example of document-based, rather than database-based, metadata. Information used at work and that travels through work flows is dependent on systems, applcations and organizations with differing designs. Documents are derived from different sources, e.g. databases and word processors, and used in different ways, e.g. just for printing or published on a net. Organizing or sharing digital information is extremely difficult, if not impossible, when there is no infrastructure in place that covers all these sources. As a result the costs of integrating diffrent systems increases exponentially. Accordingly there is a need for a structured set of metadata that deals with the processes a document is involved in, what we call Document Process (DP) metadata |
|
There are different ways to store metadata, for example: · Document databases (closed solutions for authoring, access and flows) · HTTP-DAV (The new web-protocol for authoring and versioning standard support properties) ·
Local properties in file systems (NT2000) or documents (Word, Star Office,
etc..) that act as metadata contaners We propose that DP metadata be stored in a database and that certain portions of it be replicated be within the document. There are several ways to handle metadata, for example: · Document databases (closed solutions for authoring, access and flows) · HTTP-DAV/SAL (The new web-protocol for authoring and versioning standard support properties) · Metadata search engines (that search distributed documents over network) |
|
The organization and structure of the proposed DP metadata must combine the following: · Usability and efficiency in common tools, e.g. MS Word, HTML, SGML · Simplicity that scales from small to global organizations · Basic flows that can be extended to sub-group classifications ·
Coverage of basic information-sharing needs between organizations This proposed structure for DP metadata is presented below. |
|
This is the proposed classification of DP metadata elements: Groups of elements Element, or classes of elements Elements Source Identifier unique doucument identifier Creator Same as Dublin Core Publisher Same as Dublin Core Date Same as Dublin Core Process Class see Elements – Class below. Status see Elements - Process States below. Relations see Elements – Relations below. Role see Elements – Role below. Publishing One of a number of pre-defined states (as yet undefined) in a publishing process Versioning Version information
(defined in other standards) Contents Title Same as Dublin Core Description Same as Dublin Core Resource Type Same as Dublin Core Language Same as Dublin Core Coverage Same as Dublin Core Rights Same as Dublin Core |
Element constructs of DP metadata
|
This section contains the preliminary definitions of the proposed new DP metadata elements. A list of the Dublin Core (DC) elements covering biblographic information is in the final section. |
|
Element - Class |
This element should be called DP.CLASS and
can be seen as a DP specific variant of DC Type. It should have the following
sub-elements:
|
|
This element should be called DP.STATE (no
correlation with DC) and have the following sub-elements (these elements
do not relate to stages in a publishing process):
|
|
This element should
be called DP.PROCESS and have the following sub-elements:
|
|
This element should
be called DP.ROLE and have the following sub-elements:
|
This is an overview of existing Dublin Core elements that cover bibliographic information. (se http://www.dublincore.org)
|
Dublin Core Elements |
Description |
|
Title |
A name given to the resource. |
|
Creator |
An entity primarily responsible for making the content of the resource. |
|
Subject and Keywords |
The topic of the content of the resource. |
|
Description |
An account of the content of the resource. |
|
Publisher |
An entity responsible for making the resource available |
|
Contributor |
An entity responsible for making contributions to the content of the resource. |
|
Date |
A date associated with an event in the life cycle of the resource. |
|
Resource Type |
The nature or genre of the content of the resource. |
|
Format |
The physical or digital manifestation of the resource. |
|
Identifier |
An unambiguous reference to the resource within a given context. |
|
Source |
A Reference to a resource from which the present resource |
|
Language |
A language of the intellectual content of the resource. |
|
Relation |
A reference to a related resource. |
|
Coverage |
The extent or scope of the content of the resource. |
|
Rights |
Information about rights held in and over the resource. |