Download Government of Canada Metadata Implementation Guide for Web

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

URL redirection wikipedia , lookup

Transcript
Government of Canada
Metadata Implementation Guide
for Web Resources
2nd edition – May 2003
Prepared by the
Ad Hoc Committee of Federal Metadata Experts
Metadata Action Team
Council of Federal Libraries
st
1 edition entitled "Common Look and Feel Metadata Implementation Guides", September 2002
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Table Of Contents
Acknowledgements
Chapter 1:
1.1
1.2
1.3
1.4
1.5
1.6
Introduction
What is Metadata?
Why Does It Matter?
Why a Common Standard?
Common Look and Feel (CLF) Mandatory Metadata Elements
Optional Metadata Elements
HTML Metatags
Chapter 2: Guidance for Metadata Developers
2.1
General Guidance
2.2
Guidance Applicable to All Metadata Elements
CLF Required Elements
2.3
Creator
2.4
Date
2.5
Language
2.6
Subject
2.7
Title
Optional Elements
2.8
Geographic Coverage
2.9
Description
2.10
Type
Chapter 3:
3.1
3.2
3.3
Guidance for Managers
Introduction
Departmental Information Management Activities and Metadata Implementation
Meeting the Metadata Requirement
Chapter 4:
4.1
4.2
4.3
4.4
4.5
4.6
4.7
4.8
Specific Metadata Elements - Considerations
Creator
Date
Subject
Title
Geographic Coverage
Description
Type
References
Chapter 5: Updates to the Guide
Chapter 6: Staying on Top of Government of Canada Metadata Policies and Developments
Chapter 7: For More Information
Appendix A: Options for Linking Metadata to Web Pages
Appendix B: Sample CLF Metadata Source Code
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Acknowledgements
This Guide is a project of the Council of Federal Libraries' Metadata Action Team. The Guide draws on
the expertise of metadata practitioners in several disciplines and government departments who pooled
their thoughts on the issues and best practices involved in the implementation of Treasury Board
1
Common Look & Feel (CLF) Standard 6.3 and Treasury Board Information Management Standard
2
(TBITS) 39 . The practitioners' expertise, time and commitment were invaluable in the creation of this
document.
For their support in the creation of this edition, the Council of Federal Libraries thanks:
Name
Department
Maria Bencsath
Agriculture and Agri-Food Canada
Diane Brooks
Indian and Northern Affairs Canada
Marie-Claude Côté
Treasury Board Secretariat
Nikki Cox
Human Resources Development Canada
Alain Lanoix
Library of Parliament
Gay Lepkey
Depository Services Program, Communication Canada
David L. McCallum
Consultant (Group Facilitator and Writer)
Cindy Mitchell
Natural Resources Canada
Tony Moren
Statistics Canada
Karen Morgenroth
CISTI, National Research Council Canada
Sylvie Régimbal
Veterans Affairs Canada
Gregory Renaud
Treasury Board Secretariat
Cecil Somerton
Health Canada
Deane Zeeman
National Library of Canada (Group Coordinator and Editor)
Team members thank Carole Julien, CFL Library Officer, for the excellent administrative services that
kept the background running so smoothly.
The Council of Federal Libraries (CFL) also wishes to acknowledge the contribution of the many
volunteers from federal government departments who reviewed the Guide with an eye to ensuring that it
would meet the needs of metadata developers and managers.
The Guide was produced with support from the Information Management Division of the Chief Information
Officer Branch of Treasury Board Secretariat.
1
Government of Canada Web sites must include five metadata elements to describe their Web resources: Title,
Creator, Language, Date and Subject.
2
The Dublin Core metadata standard (http://dublincore.org/) adopted by Treasury Board ministers as the metadata
standard for the Government of Canada.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Chapter 1
Introduction
In the year 2000, the Treasury Board of Canada issued a government-wide directive making it mandatory
for federal government Web sites to include descriptive metadata, structured information about the
content of Web-based resources. A Web resource is defined as a single Web page, a document
(consisting of multiple Web pages), a digitized image, a sound file, or an animation such as a movie.
3
The metadata requirements are stated in Common Look and Feel (CLF) Standard 6.3 .The official
deadline for implementing this standard for external Web sites was December 31, 2002. Requirements
are still in effect.
4
The first edition of the Metadata Implementation Guide was written by an ad hoc group of federal
metadata experts facilitated by the Metadata Action Team of the Council of Federal Libraries in June
2002. Its purpose was to assist those responsible for implementing this standard, and for the ongoing
maintenance of the metadata mandated by it. This second edition was undertaken by a similar ad hoc
group of experts in February 2003.
The introduction to this edition explains what metadata is and why it is important, and sets out the
metadata elements that Government of Canada Web sites are expected to carry in order to comply with
CLF Standard 6.3.
The section for metadata developers offers practical assistance to those responsible for creating
metadata content for federal government Web resources in accordance with Government of Canada
metadata standards. It explains the tasks to be performed, demonstrates how the required information
should be created, and directs users to other resources. It also provides instructions on how and where to
insert the required source codes into a Web document.
The section for metadata managers offers assistance to managers responsible for developing metadata
management policy and for meeting Government of Canada metadata standards in their departments or
agencies.
Your comments on these documents and suggestions for improvement are welcomed. Please send your
input to the Metadata Coordinator at the National Library of Canada (mailto:[email protected]).
1.1
What is Metadata?
Metadata is structured information about the characteristics of a physical or digital object.
Imagine an unlabelled, sealed tin can. It could contain motor oil, red kidney beans or cat food. The only
way to ascertain its contents is to open it. But if it had a label, you'd know what was inside without
opening it. And if it had a really useful label, you'd know at a glance everything you needed to about the
contents of the tin and be able to make an informed decision about whether to buy and/or open it.
Metadata serves the same function as a label. Just like other labels, metadata provides information about
the contents of an object – in this case, an electronic resource on the World Wide Web. The information
provided facilitates resource discovery in the same way that library catalogues provide paths to required
information. Metadata is used by search engines to improve the matching between user queries and
descriptions of resources indexed by the engine.
A typical label for a tin can of red kidney beans might consist of the following information:
title:
creator:
ingredients:
date of manufacture:
3
Yummy Delicious Red Kidney Beans
Hubert's, part of the Vegetables Division of the World Wide
Foods Branch of the BWG Corporation
red kidney beans, distilled water, salt, ascorbic acid
2002-06-14
See CLF - Navigation and Format Standards and Guidelines - Standard 6.3 at http://www.cio-dpi.gc.ca/clfupe/6/6a_e.asp#6.3).
4
Common Look and Feel Metadata Implementation Guides (http://www.nlc-bnc.ca/cfl-cbgf/s37-4016-e.html).
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
The label could also contain additional information, from allergy alerts to a toll-free number for
consumers. Some jurisdictions, including Canada, require food processors to include a certain amount of
mandatory information on their product labels or packaging. In addition, Canadian regulations require that
the information be expressed in both of Canada's official languages.
The Government of Canada's metadata standards are analogous to Canadian product labeling
regulations in that they require that federal government Web pages contain certain mandatory information
(metadata), expressed in a certain way (standard).
The basic unit of metadata is a two-part set called a metadata element. The first part, the name, specifies
a property being described such as title, creator, language, subject matter or date of creation. The second
part, the content, provides specific information about the named property of a particular resource and
distinguishes it from other resources. Metadata elements sometimes include a third part, called a
scheme reference, which identifies that the content information has been selected from a specified list of
possible entries for a given label name. For example:
<meta name="XXXXX" scheme="XXXXX" content="XXXXX">
Metadata elements are sometimes called metatags or tags, and creating the content for these elements is
also referred to as meta tagging.
1.2
Why Does It Matter?
Good metadata helps people find the information they're looking for. A search through unstructured text
(i.e. a full-text search) or uncontrolled terminology (i.e. keywords) may yield tens of thousands of results,
the vast majority of which are often completely irrelevant to the searcher. The structure of metadata
allows searching for terms in discrete elements (e.g. title, subject) of metadata records. The resulting
search results are smaller and more focused.
Quality control and consistency are important. If a Web resource lacks essential metadata, or if the
metadata is inaccurate or incorrect, search engine results will be negatively affected.
In the Government of Canada context, metadata serves three separate but interrelated functions
simultaneously:
•
Direct resource discovery by Canadians, a primary goal of Government Online.
Web crawlers and indexes in departments, at the Canada Site and at other government information
aggregation services are configured to use the metadata to create searchable central indexes of
departments and agency Web resources.
•
Information sharing and exchange between federal organizations and their partners.
Metadata will also be used by clusters and gateways to identify appropriate content.
•
The information management needs of federal organizations.
1.3
Why a Common Standard?
Adopting a single metadata standard ensures that metadata on federal government Web sites will be
coordinated, cost-effective and client-focused. In the context of the Government of Canada, a single
standard makes it possible for Canadian citizens and federal employees to search the Government of
Canada domain for Web information without needing to know which government department or agency
produced it.
A common standard also makes it possible for systems to communicate the existence and characteristics
of electronic information resources to other electronic applications or search tools, and permits efficient
migration of information from one application or search system to another.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
1.4.
Common Look and Feel Mandatory Metadata Elements
5
In the CLF standard 6.3 , Treasury Board mandates five elements for description of Government of
Canada Web resources. These five mandatory elements are title, creator, date, language and subject.
6
In TBITS 39.1 , Treasury Board adopted the international Dublin Core metadata standard for use in the
Government of Canada. Of the sixteen (16) Dublin Core metadata elements, the 5 required by CLF are
mandatory, the rest are optional.
The five mandatory elements are intended only as a starting point for federal organizations using
metadata as part of their information management strategy. Expanding the metadata element set beyond
the mandatory five to include such elements as "coverage", "description", and "type" is encouraged, as is
the use of local elements, provided that evolving guidance from the Government of Canada and the
Treasury Board Secretariat is followed. Canadians will benefit from improved resource discovery, and
Web site managers will benefit from the improved information management capabilities provided by
including additional resource characteristics in resource descriptions.
The five mandatory elements are described below. Please note that detailed guidance for each element
may be found in the Developers' section, and in IMRC “Common Look and Feel Metadata Standard
Definitions and HTML Examples" at http://www.cio-dpi.gc.ca/im-gi/meta/clf-nsi-meta/clf-nsi-meta_e.asp.
Title
"Title" specifies the title given to a resource. When a title is ambiguous or unclear, an enhanced title may
be developed. The Dublin Core name for this element is <dc.title>.
Creator
"Creator" specifies the name of the department or agency that created and maintains the resource.
Depending on local convention, this name may include an organizational hierarchy including the
responsible unit as well. The Dublin Core name for this element is <dc.creator>.
Date
"Date" specifies one of two dates:
The Dublin Core element <dc.date.created> is used to identify the date the resource was first posted on
the Web. This element is mandatory.
The Dublin Core element <dc.date.modified> is used to identify the date a substantially revised version of
the resource was last posted. This element need be included only when the intellectual content of the
resource has been substantially modified.
Language
"Language" indicates the language of the resource. The Dublin Core name for this element is
<dc.language>.
Subject
"Subject" contains one or more words or phrases (descriptors) selected from an authorized controlled
vocabulary to describe the subject of the intellectual content of the resource. The Dublin Core name for
this element is <dc.subject>.
Choosing descriptors from a controlled vocabulary ensures that documents are indexed and can be
searched using consistent terminology (e.g., everything about aircraft, airplanes or planes will be grouped
together using a preferred term, "aircraft").
5
See CLF - Navigation and Format Standards and Guidelines - Standard 6.3 at http://www.cio-dpi.gc.ca/clfupe/6/6a_e.asp#6.3).
6
See TBITS 39: Treasury Board Information Management Standard, Part 1: Government On-Line Metadata
Standard at http://www.cio-dpi.gc.ca/its-nit/standards/tbits39/crit391_e.asp.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
7
Under TBITS 39.2 , Treasury Board has mandated the Government of Canada Core Subject Thesaurus
(CST) as the authorized controlled vocabulary to describe subject(s) of Government of Canada Web
resources. This thesaurus was developed and is maintained by the Depository Services Program,
Communication Canada (http://dsp-psd.pwgsc.gc.ca/index-e.html) specifically to provide subject access
to Canadian government information.
8
Departments and agencies are urged to use CST descriptors in the subject metadata element for
metatagged information resources. Using CST terms will enhance the utility of departmental information
resources on Government of Canada portals.
1.5.
Optional Metadata Elements
Guidance is provided on the use of optional Dublin Core metadata elements "coverage", "description",
and "type". Future editions will cover the elements "audience", "format", and possibly more.
1.6
HTML Metatags and Title Element
Because some commercial search engines may not be configured to read and use Dublin Core meta
elements as adopted by the Government of Canada, it is highly recommended that in addition to the
mandatory Dublin Core metadata elements, the non-Dublin Core metatags <description> and
<keywords> as well as the HTML <title> tags be included in Web resource descriptions:
<title> Insert title text here </title>
<meta name="description" content="Insert description text here">
<meta name="keywords" content="Insert keywords here">
For more information, consult the Best Practices section of "CLF - Navigation and Format Standards and
Guidelines (http://www.cio-dpi.gc.ca/clf-upe/6/6a_e.asp).
The HTML <title> tag and the Dublin Core metatag <dc.title> should have the same content.
The HTML <description> metatag is a short, plain language résumé of the intellectual content of the
resource. Some search engines display this description in search results. Users use this information to
determine which items in the search results most closely match their needs. If no description metatag is
included in the resource, a search engine may display the first words that appear on the Web page (often
the text of navigation links).
The HTML <keywords> metatag content should be additional words or phrases to serve as access points
for search engines, and may be useful in the context of your local information management system.
Keyword values are often natural language terms that are not part of a controlled vocabulary. Whether
and how keywords are used depends on your local metadata access policies and search engine.
Guidance on using keywords will be available soon.
7
8
See http://www.cio-dpi.gc.ca/its-nit/standards/tbits39/crit392_e.asp
See http://en.thesaurus.gc.ca
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Chapter 2
Guidance For Metadata Developers
2.1 General Guidance
As a metadata developer you will be expected to identify the information components necessary to
describe your Web resources. The Common Look and Feel (CLF) Standard for Internet requirements
are for five metadata elements: title, creator, date, language and subject.
How you will populate these elements depends on what policies have been established by the
Information Management / Information Technology (IM/IT) sections of your organization, and the
information system(s) used. Although it is likely that you will input the information using some form of
graphical interface, on an HTML page the metadata will look like this (the metadata values are added in
the spaces between the double quotation marks):
<link rel="schema.dc" href="http://purl.org/dc/elements/1.1/">
<meta name="dc.title" content=" ">
<meta name="dc.creator" content=" ">
<meta name="dc.date.created" content=" ">
<meta name="dc.date.modified" content=" ">
<meta name="dc.language" scheme="ISO639-2" content=" ">
<meta name="dc.subject" scheme="gccore" content=" ">
Depending on the policies of your department or agency, additional metadata elements may be required.
The preceding template is the same for all Web pages whether they are in English only, in French only, in
both official languages, or in any combination of English, French and any other language.
When a Web page contains text in more than one language, the title, creator, language and subject
elements are repeated so that there are separate elements for content in each language. It is
unnecessary to repeat the date elements as the content is a numeric value.
For some elements (e.g. "subject", "coverage" and "type"), content MUST be selected from an authorized
list of values. The lists from which values can be selected are available on the National Library of
Canada Canadian Government-maintained Controlled Vocabularies and Thesauri Web page at
http://www.nlc-bnc.ca/8/4/r4-281-e.html.
Note that you MUST include the name of the scheme in the metadata tag. Guidance on which schemes
to use for which metadata elements is presented below.
2.2 Guidance Applicable to All Metadata Elements
2.2.1. Metadata Element Structure
Metadata elements must always appear as they are shown in the examples.
Follow the examples as to the use of upper and lower case, quotation marks and angle brackets.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.2.2. Syntax
In the context of metadata, syntax refers to the rules for the construction of metadata elements.
The basic syntax takes the form:
<meta name="abc" content="xyz">
where abc is the element name (e.g. dc.title), and xyz is the information pertaining to the element (e.g.
the title of a resource). The content information must be in the language of the resource being described.
If more than one language is used, the element is repeated for each language.
When a scheme is required as part of the element, the syntax takes the form:
<meta name="abc" scheme="def" content="xyz">
where abc is the element name (e.g. dc.subject), def is the label of a recognized scheme (e.g. gccore in
the case of the Government of Canada Core Subject Thesaurus), and xyz is the information pertaining to
the element (e.g. the subject term or terms selected to describe what the resource is about). If content
from more than one scheme is used, the element must be repeated for each scheme.
This syntax is NEVER translated. Although the words such as name and content appear to be English,
they are in fact internationally recognized machine-readable codes, which, if modified, would not be
recognized by search engines.
Additionally, the content of meta "name" is a code and never translated. For example, do not use
<name="dc.sujet> on French pages. <dc.subject> is the correct syntax regardless of the language of the
content of the resource.
2.2.3. Use of the Validation Statement
The following statement is necessary for machine validation of the Dublin Core schema and must appear
once in the <head> of every Web resource:
<link rel="schema.dc" href="http://purl.org/dc/elements/1.1/">
For further information on the technical aspects of this coding, consult: "Encoding Dublin Core in HTML"
by John Kunze (http://www.ietf.org/rfc/rfc2731.txt).
2.2.4. Use of Upper and Lower Case
As a general rule, upper and lower case characters may be used ONLY in the content of
<dc.title>
<dc.creator>
<dc.subject>
in the mandatory Dublin Core elements.
Terms selected from authorized controlled vocabularies must be exactly as they appear in the controlled
vocabulary being used, including case and punctuation.
2.2.5. Proper Use of Schemes
Schemes are authorized lists of values from which content for metadata tags may be selected.
DO NOT use values from a scheme identified for use with one element as the content of another element.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
For example, it would be wrong to use a term from the Core Subject Thesaurus (the default scheme for
use in <dc.subject>) in the <dc.coverage.spatial> element. Only values from schemes registered for use
in <dc.coverage.spatial> should be used as content for this element.
Because schemes are uniquely linked to their elements, misuse will have a negative impact on the
reliability of search results and defeat the purpose of using metadata at all.
2.2.6. French Accents
Be consistent.
As is the case with Web document content in general, metadata content can include an accented
character (e.g., "é") or a character entity equivalent (e.g., "&eacute;"). Although using the character entity
equivalent provides an extra measure of assurance that the accented character will be widely
decipherable, using the accented character itself is usually sufficient since most modern browsers and
search engines interpret and display the ISO Latin-1 Character Set.
2.2.7
Abbreviations, Acronyms and Initialisms
Generally, do not use.
Refer to the guidance for specific metadata elements. Depending on the element, these shortened
versions are either not permitted, or are permitted provided that certain rules are observed.
2.2.8
Description vs. Evaluation
Do not editorialize. Metadata describes Web resource content; it does not evaluate or comment on that
content.
2.2.9
Assigning Roles and Responsibilities
Each department will decide whether metadata developers will provide all or just some of the required
metadata content, how the progress of metadata development will be tracked and verified.
Departments have considerable latitude in selecting the content for HTML metatags such as
<description> and <keywords>.
2.3 Creator – CLF Required Element
2.3.1
Definition
The government department or agency primarily responsible for making the content of the resource.
2.3.2
Usage
The <dc.creator> element is expressed in the form:
<meta name="dc.creator" content="abc">
where abc is the name of the responsible department or agency.
2.3.3
Guidance
All departmental or agency names must begin with "Government of Canada" followed by the
departmental or agency name (e.g. Government of Canada, Natural Resources Canada).
For English-only, or French-only Web resources, use:
<meta name="dc.creator" content="ddd">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
where ddd represents EITHER the name of the department or agency (preceded by "Government of
Canada") OR includes the full bureaucratic hierarchy of the responsible organizational unit.
If more than one department or agency is responsible for the information resource, repeat the
<dc.creator> element as many times as necessary to specify every responsible organization.
On bilingual pages, repeat the element for the creator name in English and the creator name in French.
2.3.4
Values
Depending on local metadata policies and operational requirements, the department or agency name
may be followed by the name of a major organizational division, which in turn may be followed by the
name(s) of additional organizational sub-divisions in a hierarchical chain, ending with the name of the
entity responsible for creating the Web resource.
2.3.4.1 Authorized sources of terminology
In order to ensure successful retrieval of Web resources, it is very important that the form of the name of
the originating department or agency be identical in all Government of Canada metadata records.
Authorized sources for organizational names must be used.
For department/agency names, use the current form of name as provided in the Treasury Board of
9
Canada Secretariat Titles of Federal Organizations . This information is also available in French as
10
Titres des organismes fédéraux .
For names of organizational divisions and sub-divisions, use the form of name that appears in the
Government Electronic Directory Services (GEDS) (http://direct.srv.gc.ca/cgi-bin/direct500/BE).
2.3.4.2 Abbreviations, Acronyms and Intitialisms
Do not use abbreviations for organizational components (e.g. Use Atomic Energy of Canada Limited, not
AECL).
2.3.4.3 Changes of name
Always use the current form of creator name for your department or agency. Do not retrospectively
change the creator name assigned to static and unchanging resources such as annual reports and other
legacy documents. Such resources were created by the organization under its name at the time of
publication, and metadata should not be updated to reflect subsequent name changes.
2.3.4.4 Punctuation
Separate levels in the organizational hierarchy with a comma (",") or a period ("."). Use one or the other
consistently, following the guidelines established by your department or agency.
9
See http://www.tbssct.gc.ca/Pubs_pol/sipubs/TB_FIP/titlesoffedorg_e.html.
See http://www.tbs-sct. gc.ca/Pubs_pol/sipubs/TB_FIP/titlesoffedorg_f.html
10
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.3.5
Examples
2.3.5.1 For a resource in English
The content of <dc.creator> is" Government of Canada, Natural Resources Canada".
Expressed in HTML as :
<meta name="dc.creator" content="Government of Canada, Natural
Resources Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
The responsible entity is the Geological Survey of Canada, a subdivision of Natural Resources Canada.
The content of <dc.creator> is "Government of Canada, Natural Resources Canada, Earth Sciences
Sector, Geological Survey of Canada". This text represents the full bureaucratic hierarchy of the unit
responsible for the page. The full hierarchy is applied because some Web users may believe the
Geological Survey of Canada to be an independent entity.
Expressed in HTML as:
<meta name="dc.creator" content="Government of Canada, Natural
Resources Canada, Earth Sciences Sector, Geological Survey of
Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.3.5.2 For a resource in French
The responsible entity is Ressources naturelles Canada.
The content of <dc.creator> is "Gouvernement du Canada, Ressources naturelles Canada".
Expressed in HTML as:
<meta name="dc.creator" content="Gouvernement du Canada,
Ressources naturelles Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
The responsible entity is the Commission géologique du Canada, a subdivision of Ressources naturelles
Canada.
The content of <dc.creator> is "Gouvernement du Canada, Ressources naturelles Canada; Secteur des
sciences de la Terre; Commission géologique du Canada". This text represents the full bureaucratic
hierarchy of the unit responsible for the page. The full hierarchy is applied because some Web users may
believe the Geological Survey of Canada to be an independent entity.
Expressed in HTML as:
<meta name="dc.creator" content="Gouvernement du Canada,
Ressources naturelles Canada, Secteur des sciences de la Terre,
Commission géologique du Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.3.5.3 For a bilingual Web resource
The content of <dc.creator> is BOTH "Library and Archives of Canada" AND "Bibliothèque et archives du
Canada". On bilingual pages, repeat the element for the creator name in English and the creator name in
French.
Expressed in HTML as:
<meta name="dc.creator" content="Government of Canada, Library
and Archives of Canada">
<meta name="dc.creator" content="Gouvernement du Canada,
Bibliothèque et archives du Canada">
2.4 Date – CLF Required Element
2.4.1
Definition
A date associated with an event in the life cycle of the resource.
2.4.2
Usage
Two date-related metadata elements are required for Web resources:
<dc.date.created> contains the date the resource was first posted on the Web.
<dc.date.modified> is used for the most recent date a substantially revised version was re-posted on the
Web.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.4.3
Guidance
DO NOT use <dc.date> metadata elements for Web resources whose content is always changing (e.g.
Web pages generated dynamically from a database).
For Web resources whose content is not subject to continuous change (i.e. "static" HTML Web pages),
a)
Use <dc.date.created> when the resource is first posted on the Web; and
b)
When the intellectual content of the resource has been substantially changed, add
<dc.date.modified> with the date of re-posting.
These rules apply whether the Web resource is English, French or bilingual.
If necessary because of technical requirements of local systems or processes, it is permissible to add the
<dc.date.modified> tag at the same time as the <dc.date.created>. In this case, at the time of initial
posting, add both date elements, but DO NOT include a date in the content of <dc.date.modified>.
Instead, insert a single blank character between the quotation marks following content= as follows:
<meta name="dc.date.modified" content=" ">
See Section 4.2 of the Managers Section for an illustration of this option.
2.4.4
Values
The <dc.date> elements are expressed as YYYY-MM-DD, where YYYY is the year, MM is the month of
the year between 01 (January) and 12 (December), and DD is the day of the month between 01 and 31.
If only the year and month are known, enter "01" for the day, e.g., 2003-04-01.
If only the year is known, enter "01" for both the month and the day, e.g., 2003-01-01.
Since automated information retrieval systems retrieve on all eight digits of the international date,
constructing dates in this manner will ensure the systems' ability to search and retrieve on either month
and year, or year only.
The <dc.date.created> value NEVER CHANGES.
2.4.5
Examples
For a Web resource first posted on November 20, 2002 and last substantially modified on May 6, 2003
•
On the original posting date, the contents of <dc.date.created> is the date of posting.
<meta name="dc.date.created" content="2002-11-20">
•
Any time after that date when punctuation or typographical errors are corrected and the resource is
re-posted, the <dc.date.created> value is not changed.
<meta name="dc.date.created" content="2002-11-20">
•
When the resource is re-posted with intellectual content substantially changed, <dc.date.modified> is
added and contains the date of posting of this substantially changed version; the <dc.date.created>
value is not changed.
<meta name="dc.date.created" content="2002-11-20">
<meta name="dc.date.modified" content="2003-05-06">
NOTE: As noted above, depending on the requirements of local systems and/or processes it may be
necessary to include <dc.date.modified> when the metadata for the resource is first created. The
organization's metadata strategy should define the approach followed. Apply it consistently.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.5 Language – CLF Required Element
2.5.1
Definition
The language of the intellectual content of the resource.
2.5.2
Usage
The element is expressed in the form:
<meta name="dc.language" scheme="ISO639-2" content="abc">
where abc is a language code, and ISO 639-2 refers to the language coding scheme mandated for use in
the Government of Canada domain.
2.5.3
Guidance
If the Web resource is in one language, enter the appropriate language code.
If the resource is available in more than one language, repeat the <dc.language> element using the
corresponding code for each language.
2.5.4
Values
The language codes are three-character lower-case alphabetic strings, usually based on the first three
letters of the English form
English
eng
French
fre
Codes for other languages may be found in "Codes for the Representation of Names of Languages"
(http://lcweb.loc.gov/standards/iso639-2/englangn.html).
2.5.5
Examples
2.5.5.1 English-only Web resource
Use:
<meta name="dc.language" scheme="ISO639-2" content="eng">
2.5.5.2 French-only Web resource
Use:
<meta name="dc.language" scheme="ISO639-2" content="fre">
2.5.5.3 Bilingual Web resource (i.e both English and French content)
Use:
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
<meta name="dc.language" scheme="ISO639-2" content="eng">
<meta name="dc.language" scheme="ISO639-2" content="fre">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.5.4.4 Multilingual Web page (e.g. English, French and German)
Use:
<meta name="dc.language" scheme ="ISO639-2" content="eng">
<meta name="dc.language" scheme ="ISO639-2" content="fre">
<meta name="dc.language" scheme ="ISO639-2" content="ger">
2.6 Subject – CLF Required Element
2.6.1
Definition
The topic of the content of the resource.
2.6.2
Usage
The subject element is expressed in the form:
<meta name="dc.subject" scheme="abc" content="def">
where abc refers to an authorized list of controlled vocabulary terms, and def refers to one or more values
chosen from that vocabulary list.
Multiple subjects selected from the same scheme can be used in the element; values should be
separated by a semi-colon (";"). A semi-colon is used as a separator rather than a comma because a
controlled subject term may include a comma within the term.
2.6.3
Guidance
Follow the guidance established by your organization's metadata strategies and policies to assign values
to <dc.subject>.
2.6.4
Values
Always select terms that describe what the resource is about. Be aware that the title does not always
reflect the content of a resource. For example, the book "Men are from Mars, Women are from Venus" is
about male-female communication, not astronomy.
Always keep the user in mind. Ask yourself, "f I were searching for information using this term, would I
want to retrieve this particular resource?"
Be specific; do not use terms that are too broad, or too narrow. Using a broad term (e.g. "Oceans") to
describe a resource on a more specific topic (e.g. "Pacific Ocean") would mean that users searching for
information on "Oceans" would retrieve all information about all oceans – a possibly overwhelming set,
not directly relevant to their search. Similarly, terms for individual oceans should not be used for a
resource focusing on "Oceans"' as a whole, even if particular oceans are mentioned.
Use as many terms as required to accurately describe the resource. Trying to be too specific will result in
the terms selected only covering a small portion of the content of the resource. Remember to ask
yourself, "If I were searching for information using this term, would I want to retrieve this particular
resource?"
If your department uses more than one authorized controlled vocabulary list for dc.subject, a separate
dc.subject tag must be used for each vocabulary.
For example, do not include terms from the e-Health Thesaurus in a metatag containing terms from the
Core Subject Thesaurus, and vice versa. Precede each content= value with the registered scheme name
of the thesaurus.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.6.5
Examples
2.6.5.1 English-only Web resource using terms from the Core Subject Thesarus
http://www.nrcan-rncan.gc.ca/inter/index_e.html
<meta name="dc.subject" scheme="gccore" content="Natural
resources; Climate; Forests; Minerals; Metals; Maps; Energy;
Energy conservation; Geology; Environmental management">
2.6.5.2 French-only Web resource described using terms from the Core Subject Thesaurus
http://www.nrcan-rncan.gc.ca/inter/index_f.html
<meta name="dc.subject" scheme="gccore" content="Ressources
naturelles; Climat; Forêt; Minéral; Métal; Carte; Énergie;
Conservation de l'énergie; Géologie; Gestion de
l'environnement">
2.6.5.3 Bilingual Web resource described using terms from the Core Subject Thesaurus
http://www.nrcan-rncan.gc.ca/inter/index.html
<meta name="dc.subject" scheme="gccore" content="Natural
resources; Climate; Forests; Minerals; Metals; Maps; Energy;
Energy conservation; Geology">
<meta name="dc.subject" scheme="gccore" content="Ressources
naturelles; Climat; Forêt; Minéral; Métal; Carte; Énergie;
Conservation de l'énergie; Géologie">
2.6.5.4 Multiple vocabularies to describe a single Web resource
2.6.5.4.1 English-only Web resource described using terms from both the Core Subject Thesaurus and
the eHealth Thesaurus
<meta name="dc.subject" scheme="gccore" content="[one or more
English terms from the Core Subject Thesaurus]">
<meta name="dc.subject" scheme="gcicth" content="[one or more
English terms from the e-Health Thesaurus]>
2.6.5.4.2: French-only Web page described using terms from both the Core Subject Thesaurus and the
eHealth Thesaurus
<meta name="dc.subject" scheme="gccore" content="[one or more
French terms from the Core Subject Thesaurus]">
<meta name="dc.subject" scheme="gcicth" content="[one or more
French terms from the eHealth thesaurus]>
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7 Title – CLF Required Element
2.7.1
Definition
The name given to the resource.
2.7.2
Usage
The <dc.title> element is expressed in the form:
<meta name="dc.title" content="abc">
where abc is the title of the resource.
2.7.3
Guidance
The title is transcribed from the authoritative source. For Web resources, this is the resource itself.
On individual Government of Canada Web sites, no two pages should have identical titles.
When evaluating or formulating a title, keep it as brief as possible, spell out any acronyms, and avoid
abbreviations.
In the case of bilingual Web pages (i.e. "Welcome" or "splash" pages), DO NOT place the English and
French titles in the content of a single <dc.title> element; make TWO separate <dc.title> element entries,
one containing the English title and one containing the French title.
2.7.4
Values
To determine the content for <dc.title>, take the following steps in the order presented:
1. If the Web page of the resource has an HTML title (contents of the coloured bar at the top of the
browser window, or select View-Source (Internet Explorer) or View-Page Source (Netscape) too see
the content of the HTML <title> tag in the coded version of the page),
Web page HTML <title:
About us – XYZ
Content for <dc.title>:
About us – XYZ
Where XYZ is the name of a department or agency, or of an organizational unit within it.
2. If there is no HTML <Title> tag, use a prominent textual (i.e. non-graphic) heading displaying at the
beginning of the page contents, use this textual heading as the title.
HTML <title> :
<blank>
Web page textual heading:
About us – XYZ
Content for <dc.title>:
About us – XYZ
Where XYZ is the name of a department or agency, or of an organizational unit within it.
3. If there is no prominent text heading at the beginning of the Web page contents, but the page
contains a title graphic, use the text provided by the graphic as the resource title.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
4. If the HTML <title> tag, textual heading or title graphic do not provide enough information to uniquely
identify the resource, the <dc.title> content may be enhanced to provide context.
HTML <title>, Web page textual
heading or graphic:
About us
Enhanced content for <dc.title>
About us – XYZ
Where XYZ is the name of a department or agency, or of an organizational unit within it.
5. If by following these rules two or more pages are assigned the same title, create a brief qualifier to
differentiate between the pages.
HTML <title> tag, Web page
textual heading or graphic:
National Research Council Canada
Qualified content for <dc.title>:
National Research Council Canada Home page
National Research Council Canada Main search page
6. If none of these sources provides title information, contact the person responsible for the Web
resource for guidance; if no suitable guidance is forthcoming, create a suitably informative title for use
in <dc.title>.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7.5
Examples
2.7.5.1 HTML <title> as source for the title of the Web page
Use the content of the HTML <title> (in the coloured bar at top of browser screen).
The content of <dc.title> is "CSA - Home Page of Canadian Space Agency"
Expressed in HTML as:
<meta name="dc.title" content="CSA – Home Page of the Canadian Space
Agency">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
The HTML <title> tag contains good, specific identifying information.
The content of <dc.title> is "Environment Canada's Green Lane - About Environment Canada"
Expressed in HTML code as:
<meta name="dc.title" content=" Environment Canada's Green Lane About Environment Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7.5.2 HTML <title> tag contains insufficient information
The HTML <title> tag "Government" provides no context. Enhancement from the graphical title will be
necessary.
The content of <dc.title> is enhanced with the organization name to be "National Archives of Canada –
Government".
Expressed in HTML code as:
<meta name="dc.title" content=" National Archives of Canada –
Government">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7.5.3 HTML <title> conflicts with other sources of title information
The HTML <title> tag "Electronic Commerce Task Force" differs from the textual heading "Electronic
Commerce in Canada" which is the same as the graphical title "Electronic Commerce in Canada".
The content of <dc.title> uses information from both sources to create an enhanced title, "Electronic
Commerce Task Force - Electronic Commerce in Canada".
Expressed in HTML code as:
<meta name="dc.title" content="Electronic Commerce Task Force Electronic Commerce in Canada">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7.5.4 HTML <title> and graphic title differ
Neither the HTML <title> nor the graphical title contains sufficient information to identify the resource.
The content of <dc.title> is "Finance Canada – About the Department".
Expressed in HTML code as:
<meta name="dc.title" content="Finance Canada – About the
Department">
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.7.5.5 Bilingual "Welcome" Page
HTML title and graphical title indicate that the title is "Library and Archives of Canada / Bibliothèque et
archives du Canada".
The content of <dc.title> is
English: Library and Archives of Canada
French: Bibliothèque et archives du Canada
Expressed in HTML code as:
<meta name="dc.title" content="Library and Archives of Canada">
<meta name="dc.title" content="Bibliothèque et archives du
Canada">
2.8 Geographic Coverage – Optional Element
2.8.1
Definition
Used to document the geographic extent of the content of a resource.
2.8.2
Usage
The geographic coverage element is expressed as
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
<meta name="dc.coverage.spatial" scheme="abc" content="def">
where abc is the label for an authorized list of geographic names and def is a specific location chosen
from the authorized source, or a set of geographic coordinates.
2.8.3
Guidance
Follow the guidance established by your organization's metadata strategies and policies to assign values
to <dc.coverage.spatial>.
Guidelines for use of specific encoding schemes are available at http://publiservice.cio-dpi.gc.ca/mwggtm/gcs-scg/docs/docs_e.asp.
2.8.4
Values
Repeat <dc.coverage.spatial> as needed, using the appropriate scheme name for each location
described in the metadata.
Use specific location information.
Do not use "Canada" as a default. There is no advantage in populating coverage with a value of
"Canada" when a resource is implicitly "Canadian". Canada is a valid descriptor when the resource
explicitly pertains to all of Canada (e.g. a map of Canada). Otherwise, it should not be used as a default
value.
In some cases, it could be appropriate to apply the same term to both the coverage and subject metadata
elements, but only if the resource is actually about the geographic area in question, and if the subject
element were selected from an authorized list of terms.
2.8.5
Examples
Using GeoNames, <dc.coverage.spatial> for a resource describing plans for a national historical site in
Halifax would be expressed as:
<meta name="dc.coverage.spatial" scheme="gcgeonames"
content="Canada; Nova Scotia; Halifax">
The <dc.coverage.spatial> for a resource that contains employment opportunity information for
Peterborough and North Bay would be expressed as:
<meta name="dc.coverage.spatial" scheme="gcgeonames"
content="Canada; Ontario; Peterborough">
<meta name="dc.coverage.spatial" scheme="gcgeonames"
content="Canada; Ontario; North Bay">
Note that each geographic location is in a separate dc.coverage.spatial element.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.9 Description – Optional Element
2.9.1
Definition
A narrative account of the content of the resource.
2.9.2
Usage
The <dc.description> element takes the form:
<meta name="dc.description" content="abc">
where abc is the description of the resource.
2.9.3
Guidance
An accurate, clear, concise and objective description assists users in determining whether a resource will
be useful to them.
The words in a description can be indexed by search engines and displayed in search results. As a
general rule, descriptions should be no more than 50-80 words in length. Because only the first few lines
of a description may be displayed in a listing of Web resources, it is important that the most meaningful
information be contained in the first few sentences.
2.9.4
Values
2.9.4.1 Preparing a description.
•
Determine the key concepts, characteristics and significant content of the resource by reviewing it in
total (if it is of practicable length), or by reading the table of contents, prefatory material, introductions
to sections and conclusions. Use any existing summary of the resource as a starting point.
•
Draft a descriptive text, keeping the following tips (below) in mind:
•
•
•
•
•
Review and edit the draft text with a view to:
•
•
•
•
Begin with a verb (e.g. "Gives..." "Presents..."), and use the objective third person point of view.
Avoid using "We provide..." or "You can find...".
Do not repeat the title. Instead, provide more detail about the contents of the resource.
Make the description understandable to a wide audience; avoid jargon and overly technical
descriptions.
Use simple and direct language, omitting unnecessary words.
Condensation – Are the essential contents of the resource reflected?
Clarity – Is the text simple, understandable at first glance?
Correctness – Are the grammar and spelling correct?
The contents of the HTML <description> metatag and of <dc.description> element content should be
identical.
2.9.4.2 Abbreviations, Acronyms and Intitialisms
These may be used, provided they are also spelled out in full.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.9.5
Examples
2.9.5.1 English-only Web resource
http://www.nlc-bnc.ca/trains/index-e.html
<meta name="dc.description" content="Contains three main
sections. Ties That Bind provides a short history of railways in
Canada and how they marketed their passenger services, providing
a context for the other two sections, Transcontinental Tour and
Tracking Time.">
2.9.5.2 French-only Web resource
http://www.nlc-bnc.ca/trains/index-f.html
<meta name="dc.description" content="Compte trois sections
principales. La section Lignes qui unissent présente un bref
historique des chemins de fer au Canada et de la manière dont
les compagnies ont commercialisé leurs services voyageurs. Cet
historique sert de cadre aux sections Circuit transcontinental
et Ligne chronologique.">
2.10 Type – Optional Element
2.10.1 Definition
The nature or genre of the content of the resource.
2.10.2 Usage
The type element is expressed in the form:
<meta name="dc.type" scheme="abc" content="def">
where abc is the label for a specific scheme, and def is a term selected from the scheme to describe the
type of resource.
2.10.3 Guidance
Type includes terms to describe general categories, functions, or genres of Web resource contents. It is
NOT the same as format (the physical manifestation of the resource).
Including a value for <dc.type> helps users find the information by giving them an additional way to
narrow their search queries. When users select a resource type along with other terms (such as title,
subject, date, etc.) in a search query, they will be able to narrow their search for an event, presentation,
report, policy, etc. For example, a user can drastically reduce the results of a subject search for
"information management" by also selecting the type "event" or "policy".
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
2.10.4 Values
It is strongly recommended that metadata developers use values from the Government of Canada Type
11
Scheme , which includes terms that describe a wide variety of resource types found on Government of
Canada Web sites.
It is registered as a controlled vocabulary by the National Library of Canada. The scheme label for the
Government of Canada Type Scheme is gctype.
When a web page is described using more than one type, repeat the type element, including the type
scheme, for each different type.
NOTE: While the Government of Canada Type Scheme includes a broad range of terms, it is not
intended to be an exhaustive list of types. Follow your department or agency metadata guidelines for use
of additional terminology.
2.10.5 Examples
2.10.5.1 English resource
The National Flag of Canada
http://www.canadianheritage.gc.ca/progs/cpsc-ccsp/sc-cs/df1_e.cfm
<meta name="dc.type" scheme="gctype" content="still image">
2.10.5.2 French resource
Le drapeau canadien
http://www.canadianheritage.gc.ca/progs/cpsc-ccsp/sc-cs/df1_f.cfm
<meta name="dc.type" scheme="gctype" content="image fixe">
2.10.5.3 Bilingual resource
Welcome page of Canadian Heritage
http://www.canadianheritage.gc.ca
<meta name="dc.type" scheme="gctype" content="welcome page">
<meta name="dc.type" scheme="gctype" content="page de
bienvenue">
11
See http://publiservice.cio-dpi.gc.ca/mwg-gtm/typ-typ/docs/schem_e.asp.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Chapter 3
Guidance for Managers
3.1 Introduction
Common Look and Feel (CLF) Standard 6.3
description of federal Web resources.
12
mandates the use of five metadata elements in the
13
Under Treasury Board Information Management Standard (TBITS) 39.1 , Treasury Board adopted the
Dublin Core metadata standard as the method of expressing the required metadata. Software and
systems acquired by the Government of Canada for the purpose of creating, editing, generating, parsing,
harvesting, extracting, indexing, browsing or displaying metadata for Web resources must support this
international standard.
This chapter is directed to those individuals or groups responsible for implementing CLF Standard 6.3 in
their federal departments or agencies, and to those who provide advice and guidance on metadata within
those organizations.
Before reading this section, please review the preceding chapters in this Guide.
14
The first version of this Guide was posted on the Web in September 2002 to provide assistance to
departments and agencies in meeting the CLF metadata implementation requirement by December 31,
2002. That deadline represented not the end, but the beginning of the process for incorporating metadata
into departmental Web sites.
The December 2002 date can be seen as a beginning because:
•
New Web resources must have metadata developed for them as they are created;
•
Since Web resources are continuously updated, metadata must be adjusted to reflect current
content;
•
As search engines (including the search engine on the Canada Site) continue to change and
improve, local systems and practices will require adjustment;
•
Organizations may decide to incorporate more than the five mandatory elements into the
metadata descriptions of their resources. The Treasury Board Secretariat will develop further
specifications for optional elements such as audience and format, and departments may require
other non-Dublin Core metadata elements;
•
Best practices will evolve as departments gain experience with metadata;
•
The Dublin Core Metadata Initiative itself will evolve;
•
Your organization's Internet / metadata strategies will evolve, leading to possible extensions and
modifications of current practices;
•
The Web is dynamic by nature. It is important to expect changes and to remain flexible and open
to new developments in Web-related technologies.
This section of the Guide aims to provide useful explanations, directives, and best practices to metadata
managers in the Canadian public service. Managers are urged to adapt the Guide to suit the particular
Information Management (IM) needs, policies, practices and technologies of their organizations.
Important sites and contacts for information and questions include:
•
Your organization's CLF Centre of Expertise for Internet (see list at http://www.cio-dpi.gc.ca/clfupe/common/cei-contact/cei-contact_e.asp);
•
Your organization's representative on the GOL Metadata Working Group (see list at
http://publiservice.cio-dpi.gc.ca/cioscripts/committees/members/member-membre_e.asp?who=/mwg-
12
See CLF - Navigation and Format Standards and Guidelines - Standard 6.3 at http://www.cio-dpi.gc.ca/clfupe/6/6a_e.asp#6.3).
13
See TBITS 39.1 http://www.cio-dpi.gc.ca/its-nit/standards/tbits39/crit391_e.asp
14
Common Look and Feel Metadata Implementation Guides (http://www.nlc-bnc.ca/cfl-cbgf/s37-4016-e.html).
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
gtm/&id=83)
If your organization does not have a representative on the Working Group, the Information
Management Division, Treasury Board Secretariat should be contacted for advice on securing
appropriate representation at [email protected];
•
Your departmental library or information resource centre;
•
The Treasury Board Secretariat's Information Management Resource Centre (http://www.ciodpi.gc.ca/im-gi/index_e.asp; questions may be sent by e-mail to [email protected]);
•
The Metadata Coordinator, National Library of Canada, at [email protected]; and
•
The Government of Canada Core Subject Thesaurus Manager at http://dsppsd.communication.gc.ca/Thesaurus/thesaurus_contact_e.html
3.2. Information Management Activities & Metadata Implementation
3.2.1 Purpose of the Metadata Requirements
Each federal organization has a unique combination of data, needs and technologies, and will have its
own requirements for metadata to support its objectives. The required metadata elements may extend
beyond the basic five mandated by CLF.
How metadata is implemented will be determined by an assessment of what information is posted on the
Web, who will access that information, and the technologies available to create and search for
information, including organizational IM / IT systems and search engine(s), the search engine of the
Canada Site and commercial search engines.
To meet these varied needs, it is important to regard the creation and maintenance of metadata as an
integral part of the development, production and review of information on the Web, and to regard
metadata as an essential element of an overall organizational Internet/intranet strategy.
If your metadata needs extend beyond the required and optional elements, consult Information
Management officials at the Treasury Board Secretariat for guidance ([email protected]) before
developing any departmental/agency or domain-specific metadata elements.
3.3
Meeting The CLF Metadata Requirement
A survey conducted by Treasury Board in 2002 identified that 76% of federal departments and agencies
have established a strategy for meeting the requirements of CLF Standard 6.3. The following
suggestions should be reviewed in the context of those strategies. Organizations that are still creating
their strategies may wish to use them as a starting point.
There are two crucial considerations:
•
Ensure roles and responsibilities are clear (i.e., who is responsible for performing which action by
which date).
•
Set realistic expectations for achievable results that can be delivered within reasonable time frames.
Don't oversell metadata as a panacea for resolving all problems related to organizational information
management.
3.3.1
Identifying and briefing key individuals
Identify the following key contacts:
• Who is accountable for meeting the overall CLF requirements?
•
Who is accountable for meeting the CLF metadata requirement?
•
Who represents the organization on the Government On-Line (GOL) Metadata Working Group?
If your organization is not represented in this group, contact the Information Management
Division, Treasury Board Secretariat for advice on securing appropriate representation
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
([email protected]). This person may also serve as the link between your organization and the
Government of Canada Core Subject Thesaurus, or may know who does.
Ensure that all sectors within the organization know who is responsible for meeting the CLF metadata
requirement, that they are informed about the process and that they understand their roles.
Provide a briefing for key personnel on the CLF metadata requirement and their respective duties. Key
personnel include Web masters and Web content creators, as well as those responsible for the following
areas:
• Communications
• Information Management (IM)
• Information Technology (IT)
• Library (or Libraries) / Information Resource Centre(s)
• Records Management
In large organizations, consider establishing a formal metadata implementation committee.
3.3.2
Assessing the technology
Understand how the organization's Web pages are created, and how best to incorporate the metadata
required by CLF into them. If there are plans for system modifications and/or replacement, become a
participant or at least an advisor in that process.
If the organization does not already have software that supports the creation and incorporation of
metadata, consult Appendix A to this Guide for options on how to add metadata to your Web-based
resources.
Get to know the people responsible for the organization's search engine(s). With their help, learn the
characteristics of the system(s), including:
•
Search engine functionality. If possible, take a training course on search engines in general, and
on the search engine used by your organization in particular;
•
How your search engine(s) handle upper and lower case characters (both?, lower case only?).
With some search engines, only search criteria in lower case or exact case will match a mixed
upper- and lower-case term;
•
How French accents are represented;
•
What information is gathered by the search engine and how it is processed;
•
Configuration options;
•
How easy or difficult it is to make modifications. Some systems are proprietary (i.e. controlled by
a commercial entity) and may be impossible or costly to modify. Others are "open source" and
may be less costly to modify.
Following the directives in Chapter 2 (Guidance for Metadata Developers), develop some test metadata
for various parts of your organization's Web site. Ask your representative on the GOL Metadata Working
Group to review these exercises and to validate the metadata using departmental search engines so as
to confirm that they are able to accept, understand and use the metadata.
3.3.3
Raising awareness
Hold information sessions for key IM/IT personnel, for senior management, and for the organization as a
whole, covering:
•
The value and benefits of metadata in general;
•
The CLF metadata requirement itself;
•
The need for metadata to be accurate, complete, consistent and up-to-date;
•
How your department will meet the CLF metadata requirement; and
•
The duties of IM/IT personnel, senior managers and the organization as a whole in meeting the
requirement.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
3.3.4
Planning
Assess the need for additional metadata elements beyond the five mandatory CLF elements. Having
determined the elements required for your organizational needs, develop an appropriate "application
profile", which lists the elements you will use locally and provides guidance in how to populate them. For
more information on departmental application profiles, contact your organization's representative on the
GOL Metadata Working Group (see Section 7), and/or the IM Division of the Treasury Board Secretariat
([email protected]).
Best practice is to complement the mandatory Dublin Core metadata elements with the non-Dublin Core
metatags <description> and <keywords> and the HTML <title> tag (see section 1.6 in the Introduction).
Determine how your organization's metadata elements will be populated (see Appendix A: Options for
Linking Metadata to Web Pages).
Decide who will create metadata and apply it to your Web resources. Options include:
•
In-house originators (authors only, authors assisted in some way, Library / Information Resource
Centre only, or a combination of these);
•
Contracting out; or
•
A combination of in-house originators and contracting out.
Decide whether it is necessary to use more than one authorized controlled vocabulary list. [For more
information on controlled vocabularies and thesauri, refer to Section 4.3].
Establish an order of priority for applying metadata to your Web resources [See "Setting Priorities",
below).
If your organization does not already have one, develop an indexing policy. For guidelines on creating
such a policy, see http://www.cio-dpi.gc.ca/im-gi/meta/index/indexpr_e.asp?format=print.
Incorporate the process of metadata creation as an integral part of the workflow in your organization's
development, production and review of information in Web-based resources. Ideally, the process will
ensure that metadata is properly specified and coded before a new resource is posted and that metadata
is verified for modified Web pages.
Develop a process to track the pages containing metadata.
Develop a mechanism for continuously tracking your organization's progress in meeting the metadata
requirements.
Develop a control process to maintain metadata quality over the long-term. Address questions such as:
Who will review which pages? How often? How will personnel be kept informed of any procedural
changes / updates?
Develop a training plan, identifying:
•
•
•
•
3.3.5
Who should be trained (i.e. trainers or the actual metadata providers);
Content (including actual examples from your department's Web site);
Duration of training; and
Logistics (facilities required, etc.).
Setting Priorities
According to the Common Look and Feel standard, the five mandatory CLF metadata elements must be
created for all Government of Canada Web resources. It is the responsibility of each department or
agency to determine how metadata will be added to its Web resources.
Use a phased approach based on resources available.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
15
According to TBITS 39.1 , as a starting point, the five mandatory Dublin Core elements must be applied
to the following kinds of pages
•
Welcome pages and Home pages (the major entry point to an institution or an organizational unit
of an institution that is likely to be perceived by the public as a distinct entity);
•
Topics/services in high demand by the community served by the institution (high-demand
services can be selected on the basis of statistics or by a subjective determination of which
documents have the greatest potential public interest);
•
Information required by the public to understand their entitlements to government assistance or
obligations;
•
Pages that provide an actual on-line service to the public (such as payment forms and application
forms);
•
Pages required to meet a prescribed legal or service obligation by the institution;
•
Entry points to specific on-line services and indexes (e.g., an entry point to a legal database);
•
Major formal publications (e.g., annual reports, corporate strategic plans, public policy and
accountability documents, etc);
•
Media releases;
•
Major entry points or indexes and menus to a range of closely related topics, programs or
policies;
•
Information about agency powers affecting the public, and manuals and other documents used in
decision making that affects the public; and
•
Substantial descriptive or marketing information about institutions, their services, activities and
collections.
Note that this does NOT mean that metadata must be added to every Web page. Often Web resources
are "cut up" into multiple files for more effective information management and/or speed of download. Each
of these sub-pages would logically contain the same metadata content (since each page is part of the
larger whole). Adding identical metadata to multiple pages will seriously compromise effective searching
as not only the main page (title page or table of contents) but all the parts would be returned in search
results, swamping searchers with too much information.
3.3.6
Implementation
Develop a metadata input template for your organization. For an example of such a template, see Natural
Resources Canada's Web site at http://www.nrcan.gc.ca/communications/clf-upe/metadata_e.html.
Templates may also be developed for various parts of the organization, with some of the mandatory
required elements (e.g. creator) pre-filled. This ensures that the full name of each organizational unit is
provided consistently, and reduces the work required.
Deliver appropriate training sessions, and maintain a list of all personnel who were trained so that they
can be informed of any changes or updates.
Work with your departmental partners on an ongoing basis to determine what other elements should be
incorporated into your Internet / metadata strategies.
Stay attuned to changes and trends affecting your departmental partners (for example, needs, policies,
structure, technology, etc.) and to developments at the government-wide level (see Staying on top of
Government of Canada Metadata Policies and Developments below).
15
See TBITS 39 Part 1: Government On-Line Metadata Standard (http://www.cio-dpi.gc.ca/itsnit/standards/tbits39/crit391_e.asp#app
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Chapter 4
Specific Metadata Elements - Considerations
4.1 Creator
The metadata policy of your organization should include guidance on the level of detail within the
organizational hierarchy to be used in describing departmental Web resources.
While use of the departmental or agency name alone is acceptable practice and is usually sufficient, the
addition of organizational levels could be helpful or necessary. For example, when a major organization is
part of a larger department or agency, and would not appear in <dc.creator> at all if the parent
organization alone were used, lower levels in the organizational hierarchy should be included.
4.1.1 Example:
The Geological Survey of Canada is part of Natural Resources Canada and would not appear in
<dc.creator> if "Government of Canada, Natural Resources Canada" alone were. The following
hierarchical structure includes its name and its hierarchical relationship to the parent organization.
<meta name="dc.creator" content="Government of Canada, Natural
Resources Canada, Earth Sciences Sector, Geological Survey of
Canada">
Because the same content for the creator element will recur in the metadata for many different Web
resources, local implementations may "pre-fill" this field as appropriate.
4.2 Date
16
Although the Dublin Core mandates the World Wide Web Consortium Date and Time Format (W3CDTF)
as the scheme for use with dates, in the Government of Canada context, inclusion of the scheme name is
optional.
Depending on the requirements of local systems or processes, one of the following two approaches may
be adopted for adding <dc.date> elements:
4.2.1 Option 1
•
On the original posting date, include <dc.date.created> only with the date of posting of the resource.
•
Any time after that date when punctuation or typographical errors are corrected, re-post the resource.
DO NOT change the content of <dc.date.created>.
•
Any time that the intellectual CONTENT of the resource is substantially changed, on re-posting, add
<dc.date.modified>, with the date of re-posting.
4.2.2 Option 2
•
On the original posting date, include <dc.date.created> with the date of posting of the resource AND
<dc.date.modified>, inserting a single blank character between the quotation marks following
content= (i.e. content=" ").
•
Any time after that date when punctuation or typographical errors are corrected, re-post the resource.
DO NOT change the content of <dc.date.created>, and DO NOT add content to <dc.date.modified>.
•
Any time that the intellectual CONTENT of the resource is substantially changed, complete the
<dc.date.modified> tag by adding the date of most recent modification.
16
See http://www.w3.org/TR/NOTE-datetime
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
4.3 Subject
Using terms from a controlled vocabulary to describe the subject or "aboutness" of Web resources
provides an additional method of accessing information produced by many different creators. Using
standardized terminology from one or more lists of controlled subject terms or thesauri allows searchers
to find resources on the same subject consistently and efficiently.
A Treasury Board survey on metadata implementation in federal organizations has revealed that
populating the subject element presents significant challenges. Namely, the correct selection of terms
from controlled vocabularies is not normally in the skill set of people responsible for managing Web
resources. Rather, it is principally a skill of professional librarians and indexers. The complexity of the
challenge of selecting terms increases when more than one controlled vocabulary is employed.
Managers are urged either to provide appropriate subject analysis training for their metadata developers
or to consider obtaining assistance from their Library / Information Resource Centres, or external
contractors, in populating this crucial field.
4.3.1 What is a Controlled Vocabulary?
A controlled vocabulary is a set of standardized words or phrases used in indexing and information
retrieval. Controlled vocabularies prescribe a single term to identify a concept, thereby reducing or
eliminating choice and, consequently, the use of synonyms. Examples include subject heading
compendia such as Library of Congress Subject Headings (LCSH) and thesauri such as the Government
of Canada Core Subject Thesaurus (CST).
A thesaurus is a specific type of controlled vocabulary arranged in a known order (not necessarily
alphabetical) in which equivalence (USE; USE FOR), homographic (variant spellings), hierarchical
(Broader Term; Narrower Term) and associative (Related Term) relationships among terms are clearly
displayed and identified by standardized relationship indicators. A thesaurus normally conforms to
international standards.
Note that dictionaries, which list one or more meanings for a given term, and unstructured, alphabetized
word lists such as glossaries, are NOT controlled vocabularies. They do not control synonyms.
Typically, a controlled vocabulary is designed to be used in a specific context. Prefatory material or other
instructions for use should be carefully considered before a given vocabulary is adopted. For example,
terminology from the Library of Congress Subject Headings, a standardized vocabulary used in library
catalogues, is intended to be applied with sub-divisions of various kinds (form, geographic, etc.). Thesauri
may have various forms of term arrangement and display and may be pre-coordinated (entries include
base terms as well as modifiers and sub-divisions) or post-coordinated (terms are not linked together
semantically; one or more individual terms is used by indexers to build an overall description of the
subject).
For more information on using a controlled vocabulary, consult your departmental library or the FAQ
section of "Implementing a Controlled Vocabulary" at the National Library of Canada Web site
http://www.nlc-bnc.ca/8/4/r4-294-e.html.
4.3.2 Sources of Controlled Terminology for <dc.subject>
The authorized controlled vocabulary for federal government departments and agencies is the
Government of Canada Core Subject Thesaurus (CST) at http://thesaurus.gc.ca. Other authorized
vocabularies may be used, provided that a separate subject element is created for each vocabulary set
employed.
The CST does not include specialized terminology used in specific and limited disciplines. It may be
necessary to select additional terms from one or more specialized controlled vocabularies that are
registered with the National Library of Canada. Vocabularies registered with the Dublin Core Metadata
Initiative (http://dublincore.org) may also be used provided that equivalent terms in both official languages
are available (see Section 4.3.4).
If your department or agency uses one or more authorized vocabularies other than the CST, it is highly
recommended that one or more CST terms be selected as content for a separate <dc.subject> tag.
Doing so allows for the consistent application of subject terms that apply to multiple federal organizations.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Detailed guidance on the use of the CST is available at http://thesaurus.gc.ca. For generic information on
the selection of subject terms, see Guidelines for Indexes and Related Information Retrieval Devices by
James D. Anderson (http://www.niso.org/standards/resources/tr02.pdf).
For more information on thesauri, consult the Thesauri and Controlled Vocabularies pages on the
National Library Web site at http://www.nlc-bnc.ca/8/4/r4-280-e.html), or contact the Metadata
Coordinator, National Library of Canada, at: [email protected].
4.3.3 If the Core Subject Thesaurus is not deemed suitable
1. Every effort should be made to use at least one term from the CST to describe Government of
Canada Web resources.
2.
If the CST generally answers you vocabulary needs, but you have identified a set of terms which are
not in the tool, contact the Thesaurus Manager at http://dsppsd.communication.gc.ca/Thesaurus/thesaurus_contact_e.html to discuss how your terminology
needs can be met.
3. If the CST, because of its general nature, is judged as not suitable, consult the List of Canadian
Government Maintained Controlled Vocabularies on the National Library of Canada Web site to see
what other vocabularies are registered for use in the gc.ca domain. The list, available at
http://www.nlc-bnc.ca/8/4/r4-281-e.html, includes information about the vocabulary label names for
use in the coding of <dc.subject> scheme component, with links to the online version of the
vocabularies.
If you use terminology from a controlled vocabulary other than the CST, you MUST use the scheme
label for that vocabulary in the <dc.subject> meta element.
4. If none of the vocabularies registered in the Government of Canada domain suits your terminology
needs, consult the forthcoming list of controlled vocabularies registered by the Dublin Core at
http://dublincore.org/.
Please note, however, that only vocabularies in this registry with equivalent terms in both official
languages will be acceptable for use. "Equivalent terms" is used instead of "translations" as there are
instances when the term for a concept in one language does not have an exact match in the other
(see Section 4.3.4).
5. If none of these vocabularies meets your needs, an in-house solution may be the best option.
Alternatives:
•
•
•
•
Build your own bilingual thesaurus. This will require help from an expert as a thesaurus is
highly structured and built according to established criteria and standards with relationships
among terms clearly displayed and identified by standardized relationship indicators.
Develop a customized controlled vocabulary for your subject domain. A controlled vocabulary
is less rigorous than a thesaurus. HOWEVER, synonyms MUST be controlled (i.e. nonpreferred terms point to the preferred terms), the most basic form of terminology control.
Vocabularies developed in-house for use in <dc.subject> must be registered with the National
Library.
The terms in such locally controlled vocabularies can be mapped to more general terms in
the CST so that both specific terminology and more general CST terminology are part of the
metadata. For example, the in-house vocabulary term "aquacultural biotechnology" could be
mapped to "Biotechnology", found in the CST. The metadata would contain two dc.subject
elements, one for the broad CST term, one for the specific local vocabulary term.
For advice on building or registering a vocabulary, consult the Metadata Co-ordinator at the
National Library (819 994-6889; [email protected]).
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
4.3.4 Language Equivalents
When choosing subject terms for a resource available in both official languages, assign English terms to
the English version, and French to the French version.
For each term in one official language there must be an equivalent term (or terms) in the other. This is
straightforward when selecting terms from vocabulary lists with English and French equivalents, but is
more challenging when selecting from one list in one official language, and from a different list for the
other. Until formal guidance is available, contact the Treasury Board Secretariat Information Management
Division at [email protected] for advice.
4.3.5 When Using Multiple Vocabularies
If more than one authorized vocabulary is used, a separate subject metatag is required for each
vocabulary. Each subject metatag must include the scheme label assigned to the vocabulary. The labels
are part of the vocabulary descriptions in the registry of controlled vocabularies at the NLC Web site
(http://www.nlc-bnc.ca/8/4/r4-281-e.html).
See the Developers section for an example.
4.4
Title
Encourage authors to assign information-rich titles to aid in the resource discovery process.
Develop a process to ensure that no two pages have identical titles.
4.5
Geographic Coverage
4.5.1 Why use coverage
Spatial information contained in the coverage element can be used to perform powerful searches that
allow a user to search on a specific location, exclude certain areas, sort by nearest location, and perform
many other operations. Coverage can be used to describe the geographic aspects of a resource not
normally included under <dc.subject> or <dc.description>.
Coverage should be implemented when the content of the resource is applicable to a place, as in the
following examples:
•
•
•
Employment opportunities for a certain region (information that is organized by area)
Environmental impact assessment for a watershed (information about an area)
Sites of historical significance (location information).
4.5.2 Values for coverage
Coverage should be populated using a controlled vocabulary, authority file or other registered scheme.
The most appropriate choice of vocabulary depends on the resource being documented. More than one
vocabulary may be used to describe a single resource.
GeoNames – The Canadian Geographic Names Database (CGNDB)
Canadian place names.
17
is the preferred vocabulary for
Controlled vocabularies or authority files authorized for use with <dc.coverage.spatial> are registered with
18
the National Library of Canada (NLC) . Departments are encouraged to register other standardized
vocabularies that meet their particular requirements with the NLC. Vocabularies are registered under a
unique scheme name that must be stated in every instance of <dc.coverage.spatial>.
17
18
-See http://geonames.nrcan.gc.ca/index_e.php
See http://www.nlc-bnc.ca/8/4/r4-281-e.html.
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Place names should be used in preference, or in addition to, geographic coordinates. Place names
convey more meaning to a person. Named geographies from known, controlled sources can be translated
to geographic coordinates via web services, such as the CGDI Gazetteer Service (http://cgdidev.geoconnections.org/prototypes/cgdigaz/).
Complete place names should be used in preference to abbreviations or symbols to improve
comprehension for the reader.
The language equivalency rule (see 4.3.4) applies for place names as well.
4.6
Description
Develop description-writing guidance for your local context, based on the Guidelines for Writing
Descriptions for Government of Canada Web Sites (http://www.cio-dpi.gc.ca/im-gi/references/metadescrip/meta-descrip00_e.asp).
4.7
Type
Organizations may request the inclusion of new resource types by contacting the Information
Management Division of the Treasury Board Secretariat at [email protected].
An organization may develop a local scheme (or schemes) to achieve a more precise description of
resource types for local indexing and retrieval needs. However, any such schemes must be registered
with the National Library. Contact the TBS Information Management Division for assistance in developing
local schemes.
4.8
References
The following is a selection of resources and style guides on writing for the Web to improve readability
and search engine results:
Guidelines for Abstracts. NISO: Bethesda, MD; 1997.
http://www.niso.org/standards/resources/Z39-14.pdf
Kilian, Crawford. Writing for the Web. Self Counsel Press, 2000.
Nielson, Jakob. Be Succinct! (Writing for the Web).
http://www.useit.com/alertbox/9703b.html
Nielson, Jakob. Microcontent: How to Write Headlines, Page Titles and Subject Lines.
http://www.useit.com/alertbox/980906.html
Nielson, Jakob. Writing for the Web. (Resource Page)
http://www.useit.com/papers/webwriting
Writing Meta Tags for Higher Search Engine Placement and Good Descriptions.
http://www.iprimus.ca/~iwb/search_engine/killer.html
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
Chapter 5
Updates To This Guide
The Metadata Implementation Guide will be updated with new information as often as needed.
Chapter 6
Staying On Top Of Government Of Canada Metadata Policies And
Developments
19
Visit the Information Management Resource Centre (IMRC) , a single-window source to information on
government metadata development, and follow the link to the metadata page.
The Treasury Board is developing a Government of Canada "application profile" to describe required and
optional metatags in greater depth than is provided in the Developers Guide. The profile should be
consulted when further guidance is necessary. The profile will be made available on the metadata page at
the IMRC site.
Chapter 7
For More Information
Metadata managers are invited to pose questions to their organization's representative on the GOL
Metadata Working Group (http://publiservice.cio-dpi.gc.ca/mwg-gtm/index_e.asp).
If the organization does not have a representative on the Working Group, the Information Management
Division, Treasury Board Secretariat should be contacted for advice on securing appropriate
representation ([email protected]). The Council of Federal Libraries may also be able to provide
assistance (http://www.nlc-bnc.ca/6/37/index-e.html).
Appendix A
Options For Linking Metadata To Web Pages
When the content of all five required Dublin Core metadata elements has been determined and approved,
it is added to the source code of the Web page it describes. An example of the source code appears in
Appendix B. The required elements have been highlighted in bold face.
This source code may reside on the server, filed permanently alongside the HTML code for the Web page
content, or it may reside in a separate file that is called into service only when the Web page is displayed
or when a search tool initiates a search. These two approaches to metadata incorporation are called
static and dynamic, respectively.
The static method involves coding the metadata directly into the Web page. There are several ways to do
this. The range of options includes:
19
•
Typing the metadata directly into the source of the Web page using a simple plain-text editor such
as Notepad.
•
Typing the metadata into a template and then using the cut-and-paste functions to transfer the
metadata into the source code of the appropriate Web page.
•
Inputting the metadata into a WYSIWYG (What You See Is What You Get) editor such as Front
Page or Dreamweaver. Consult your department's Information Technology section for options.
See http://www.cio-dpi.gc.ca/im-gi
Government of Canada Metadata Implementation Guide for Web Resources
2nd edition
If you choose to store the metadata separately from the HTML content source code of the Web page and
append it dynamically when the page is displayed, you will need to maintain a database or other external
file system for this purpose. For further guidance, consult your IT department.
Appendix B
Sample CLF Metadata Source Code
The Developers' section provides instructions on how to create and enter the required information (i.e.,
what appears between the quotes after [content =]).
Here's how the required metadata elements would appear in the source code for an English sub-site of
Health Canada:
<link rel="schema.dc" href="http://purl.org/dc/elements/1.1/">
<meta name="dc.title" content="Conferences - eHealth Resource
Centre">
<meta name="dc.creator" content="Government of Canada, Health
Canada, Office of Health and the Information Highway (OHIH)">
<meta name="dc.language" scheme="ISO639-2" content="eng">
<meta name="dc.date.created" content="2000-00-00">
<meta name="dc.date.modified" content="2003-03-17">
<meta name="dc.subject" scheme="gcicth" content="Health care;
Information and communications technologies; Telehealth; Health
informatics; Databases">
<meta name="dc.subject" scheme="gccore" content="Conferences;
Health care; Databases; Meetings">
<meta name="description" content="List of and links to various
conferences, tradeshows, and workshops related to information and
communications technologies in health">
<meta name="keywords" content="conferences, health care,
healthcare, information and communications technologies,
information and communications technology, ICT, ICTs, telehealth,
health informatics, databases, meetings, e-health, ehealth">
<title>Conferences - eHealth Resource Centre</title>
Note that <dc.title> and the HTML <title> are identical.