Guidelines for SGML Text Mark-up at the Electronic Text Center
University of Virginia
List of Tags in the TEILITE.DTD
Note: This list does not include tags in the header, which are covered in the TEI Header section.
This document is extracted from the TEI Guidelines.
The global attributes are as follows:
n="1" : a number or word/phrase
id="xyz" : a string of characters and/or numbers: must begin with a
character and must be unique within a work
rend="italics" : typographical rendering, e.g. italics
lang="eng" : the language of the material within the tag pair
For texts processed at Uva, all attribute values
are to be placed in double quotation marks.
<abbr> contains an abbreviation of any sort.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time title
xptr xref unclear
Attributes: global, plus the following:
expan: gives an expansion of the abbreviation. Value: any string of characters.
The address is Southmoor <abbr expan="road">Rd.
resp: signifies the editor or transcriber responsible for supplying the expansion of the
abbreviation. Must be an identifier declared in the document header.
The address is Southmoor <abbr expan="Road" resp="LB">Rd.
cert: signifies the degree of certainty ascribed to the expansion of the abbreviation.
type: allows the encoder to classify the abbreviation according to some convenient
typology. Legal values are:
suspension: the abbreviation provides the first letter(s) of the word or phrase,
omitting the remainder.
contraction: the abbreviation omits some letter(s) in the middle.
brevigraph: the abbreviation comprises a special symbol or mark.
superscription: the abbreviation includes writing above the line.
acronym: the abbreviation comprises the initial letters of the words of a phrase.
title : the abbreviation is for a title of address (Dr, Ms, Mr, ...)
organization : the abbreviation is for the name of an organization.
geographic : the abbreviation is for a geographic name.
The type="" attribute is provided for the sake of those who wish to classify abbreviations at
their point of occurrence; this may be useful in some circumstances, though usually the
same abbreviation will have the same type in all occurrences. As the sample values make
clear, abbreviations may be classified by the method used to construct them, the method
of writing them, or the referent of the term abbreviated; the typology used is up to the
encoder and should be carefully planned to meet the needs of the expected use.
In the full TEI guidelines (but not in TEILITE) there is also the <expan> tag. The
<abbr> tag is the mirror image of the <expan> tag; both allow the encoder
to transcribe both an abbreviation and its expansion. In <abbr>,
however, the original is transcribed as the content of the element and the expansion as an
attribute value; <expan> reverses this.
<add> contains letters, words, or phrases inserted in the text by an author, scribe,
annotator or corrector.
May contain: abbr add address anchor bibl biblFull cit corr date del emph figure foreign
formula gap gi gloss hi l label lang lg list mentioned name note num orig p ptr q ref reg
rs s seg sic soCalled sp stage table term text time title xptr xref
Attributes: global, plus the following:
place="" : if the the addition is written into the copy text, indicates where the additional text
is written.
Legal values are:
inline : addition is made in a space left in the witness by an earlier scribe
supralinear : addition is made above the line
infralinear : addition is made below the line
left : addition is made in left margin
right : addition is made in right margin
top : addition is made in top margin
bottom : addition is made in bottom margin
opposite : addition is made on opposite page
verso : addition is made on verso of sheet
mixed : addition is made somewhere, one or more of other values
resp: signifies the editor or transcriber responsible for identifying the hand of the
addition. Must be one of the identifiers declared in the document header.
cert: signifies the degree of certainty ascribed to the identification of the hand of the
addition.
hand : signifies the hand of the agent which made the
addition. Value: must be one of the hand identifiers declared in the document header.
Remarks: The <add> tag should not be used for additions made by editors or encoders.
In these cases, the <corr> tag should be used. [The full TEI tagset also has the
<supplied> tag for this purpose].
<address> Contains a postal or other address (i.e., publisher, organization, individual).
May contain: <addrline> for each line of an address
Attributes: global only
<addrline> : contains one line of a postal or other address.
May occur within: address
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref unclear
Attributes: global only
<anchor> [empty tag]: attaches an identifier to a point within a text, whether or not it
corresponds with a textual element.
Attributes: global
Note: id is required. It supplies an arbitrary name, unique within the document, used to
identify the point at which this element occurs.
Example:
<s>The anchor is he<anchor id="A234">re somewhere.</s>
<s>Help me find it.<ptr target="A234"></s>
<argument> : A formal list or prose description of the topics addressed by a
subdivision of a text. Often contains either a list or a paragraph.
May contain: bibl biblFull cit head l label lg list note p q sp stage
Attributes: global
<author> : in a bibliographic reference, contains the name of the author(s), personal
or corporate, of a work; the primary statement of responsibility for any bibliographic
item.
May occur within: bibl titleStmt
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num ptr ref reg rs s seg sic soCalled term time
title xptr xref
Attributes: global only.
<back> marks material usually found at the back of works, such as notes,
advertisements, indices. The content model of back matter is identical to that of
front matter.
May occur within: text
May contain: argument div docAuthor docDate opener titlePage
Attributes: global only
<bibl> Contains a (loosely structured) bibliographic citation of which the sub-components
may or may not be explicitly tagged. Contains either prose alone, or any
combination of elements from the citation class
May contain:
#PCDATA abbr add address anchor author biblScope corr date
del edition editor emph extent foreign formula gap gi gloss hi idno imprint mentioned name note
num orig ptr publisher pubPlace ref reg rs s seg sic soCalled term time title xptr xref unclear
Attributes: global
Example:
<bibl>Blain, Clements and Grundy: Feminist Companion to Literature in
English (Yale, 1990) </bibl>
Example:
<bibl><title>The Interesting story of the Children in the Wood<title>.
In <author>Victor E Neuberg</author>, <title>The Penny Histories</title>.
<publisher>OUP</publisher> <date>1968</date>.
<body> The wrapper for the main part of the <text>, excluding <front> and
<back> matter.
Must occur within text
Must include: div0
Can include: cit closer div1 docAuthor docDate label opener trailer
Attributes: global
boldface -- see <hi>
<byline> contains the primary statement of responsibility given for
a work on its title page (or at the head or end of the work).
The byline on a title page may include either the name or a
description for the document's author.
Can include: #PCDATA abbr add address anchor corr
date del docAuthor emph foreign formula gap gi gloss hi mentioned name
num orig ptr ref reg rs s seg sic soCalled term time title xptr xref unclear
Attributes: global
Example:
<byline>Written by a CITIZEN who continued all the while in
<place>London</place>. Never made publick before.</byline>
Example:
<byline>BY
<docAuthor>THOMAS PHILIPOTT,</docAuthor>
Master of Arts, (Somtimes) Of Clare-Hall in Cambridge.
</byline>
<cell> : contains one cell of a table.
May occur within: row
May contain: #PCDATA abbr add address anchor bibl biblFull biblStruct cit corr date
del emph figure foreign formula gap gi gloss hi label list mentioned name note num
ptr q ref reg rs s seg sic soCalled stage table term text time title xptr xref
Attributes: global plus the following:
role : indicates the kind of information held in the cell.
Legal values are:
label : labelling or descriptive information only.
data : data values.
The value specified overrides any default specified by the
role attribute of the parent <row> element, for this cell only.
rows : indicates the number of rows occupied by this cell. Value: A number; a value
greater than one indicates that this cell spans several rows. Where several cells span
several rows, it may be more convenient to use nested tables.
cols : indicates the number of columns occupied by this cell. Value: A number; a value
greater than one indicates that this cell spans several columns. Where an initial cell
spans an entire row, it should be treated as a heading.
Example:
<row>
<cell role="label">General conduct</cell>
<cell role="data">Not satisfactory, on account of his great
unpunctuality and inattention to duties</cell>
</row>
<cit> A quotation from some other document, together with a bibliographic
reference to its source. Must contain a single quote and a single bibliographic reference
in either order
May contain: bibl biblFull ptr q ref xptr xref
Attributes: global only
Example:
<cit> <q>and the breath of the whale is frequently attended with such an
insupportable smell, as to bring on disorder of the brain. </q>
<bibl>Ulloa's South America</bibl> </cit>
<closer> : groups together dateline, byline, salutation, and similar phrases appearing
as a final group at the end of a division, especially of a letter.
May occur within: body div div0 div1 div2 div3 div4 div5 div6 div7 group lg
May contain: #PCDATA abbr add address anchor corr date del emph foreign
formula gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic
soCalled term time title xptr xref unclear
Attributes: global only
Example:
...perhaps you will favour me with a sight of it when convenient.</p>
<closer><salute>I remain, &c. &c.</salute> <signed>H.
Colburn</signed> </closer> </div>
Example:
...and his heart was going like mad and yes I said yes I will Yes.</p> <closer>
<dateline> <rs>Trieste-Zürich-Paris,</rs> <date>1914-1921</date>
</dateline> </closer> </div> </text>
<corr> contains the correct form of a passage apparently erroneous in the copy text.
The corr tag is a mirror of sic: the latter leaves the original text untouched, giving the
correction as an attribute value; the former substitutes the correction, leaving the original
reading as an attribute value. The choice between them is up to the encoder.
May contain : #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi l label lang lg list listBibl mentioned name note
num orig p ptr q ref reg rs s seg sic soCalled sp stage table term text time title xptr xref
unclear
Attributes: global plus the following:
sic : gives the original form of the apparent error in the copy text.
Example
for his nose was as sharp as a pen, and <corr sic="a Table">
a 'babbled</corr>
of green fields.
resp : signifies the editor or transcriber responsible for suggesting the correction held as
the content of the <corr> element. Value: must be one of the identifiers declared in
the document header. If the correction was made in the source, this attribute should be
used to identify the hand of the corrector.
cert : signifies the degree of certainty ascribed to the correction held as the content
of the <corr> element.
Example:
If all that is desired is to call attention to the fact that the copy text has been
corrected, no attributes are required:
I don't know, Juan. It's so far in the past now — how <corr>can
we</corr> prove or disprove anyone's theories?
Example:
It is also possible to provide a correct reading and to identify the individual
responsible for the correction:
I don't know, Juan. It's so far in the past now — how
<corr sic="we can" resp="MSM">can we</corr> prove or disprove anyone's theories?
<date> Contains a date in any format. Usually the value is given in the yyyy-mm-dd
format.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time title
xptr xref unclear
Attributes: global plus the following:
calendar : indicates the system or calendar to which the date belongs. Recommended
values include: Gregorian, Julian, Roman, Mosaic, Revolutionary, Islamic.
He was born on <date calendar="Gregorian">Feb. 22, 1732</date> (<date
calendar="Julian" value="1732-2-22">Feb. 11, 1731/32, O.S.</date>).
value : gives the value of the date in some standard form, usually yyyy-mm-dd.
This list begins in the year 1632, more precisely on Trinity Sunday, i.e. the Sunday
after Pentecost, in that year the <date calendar="Julian" value="1632-6-6"> 27th
of May (old style)</date>.
certainty : indicates the degree of precision to be attributed to the date. Value: Any
appropriate value, e.g. ca., approx, after, before.
<date value="1980-02" certainty="approx">early February
1980</date>
<date value="1990-09">September 1990</date>
<del> contains a letter, word or passage deleted, marked as deleted, or otherwise
indicated as superfluous or spurious in the copy text by an author, scribe, annotator or
corrector.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref unclear
Attributes: global plus the following:
rend indicates how the deletion was indicated in the copy text.
Legal values are:
subpunction : dots below the line indicate matter to be deleted.
overstrike : lines through the text indicated matter to be deleted.
erasure : material to be deleted has been erased (but remains legible enough to
transcribe).
bracketed : brackets around the material indicate that it is spurious or
superfluous.
type : classifies the type of deletion using any convenient typology. To record the
manner in which the deletion is signaled, use rend, not type.
status : may be used to indicate faulty deletions, e.g. strikeouts which include too much
or too little text. Value: any description of flaws in the marking of a deletion, e.g.
``excess left'', ``excess right'', ``short left'', ``short right''. Status information on each
deletion is needed rather rarely except in critical editions from authorial manuscripts.
resp : signifies the editor or transcriber responsible for identifying the hand of the
deletion. Value: must be one of the identifiers declared in the document header,
associated with a person asserted as responsible for some aspect of the text's creation,
transcription, editing or encoding.
cert : signifies the degree of certainty ascribed to the identification of the hand of the
deletion.
hand : signifies the hand of the agent which made the deletion. Value: must be one of
the hand identifiers declared in the document header.
Example:
<l><del type="overtyped">Mein</del> Frisch
<del type="overstrike">schwebt</del> weht der Wind
The full TEI Guidelines provide a <delSpan> tag for longer sequences of text, for those
containing structural subdivisions.
The <del> tag should not be used for deletions made by editors or encoders. In these
cases, the <corr> tag should be used.
<div> contains a subdivision of the front, body, or back of a text.
May occur within: back body div front
May contain: argument bibl biblFull biblStruct byline cit closer div docAuthor docDate
entry epigraph head l label lg list note opener p q salute signed sp stage trailer
Note: at UVa, we use unnumbered <div>s only in <front> and <back> sections, not
in the <body> of a <text>.
Attributes: global only
<div0> : contains the largest possible subdivision of the body of a text.
<div1> <div2> <div3> <div4> etc: Wrappers for major
textual divisions (e.g. volumes, chapters, acts,
scenes, poems, letters) within the body.
May occur within: body
May contain: argument bibl biblFull byline cit closer div1 docAuthor
docDate epigraph head l label lg list note opener p q salute signed
sp stage trailer
Attributes: global, plus
type=""
part="" (Y|N|I|M|F) N
<docAuthor> marks off the author as named on the titlepage (often
included in the <byline>)
May occur within: back body byline div div0 div1 div2 div3 div4 div5 div6 div7
front group lg titlePage
May contain: #PCDATA abbr add address anchor corr date del distinct emph foreign
formula gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term
time title xptr xref
Attributes: global
<docdate>: contains the date of a document, as given (usually) on a title page.
May occur within: back body div div0 div1 div2 div3 div4 div5 div6 div7 docImprint
front group lg titlePage
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
Attributes: global plus the following:
value : gives the value of the date in a standard form. Recommended form is
``yyyy-mm-dd''.
This list begins in the year 1632, more precisely on Trinity Sunday, i.e. the
Sunday after Pentecost, in that year the <date type="Julian" value="1632-6-6"> 27th
of May (old style)</date>.
<docEdition> : Description: contains an edition statement as presented on a
title page
of a document.
May occur within: titlePage
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date
del emph figure foreign formula gap gi gloss hi label mentioned name note num orig ptr
q ref reg rs s seg sic soCalled stage table term text time title xptr xref
Attributes: global only
<docImprint> the imprint statement, which includes the place and date of
publication and the publisher's name
May occur within: titlePage
May contain: #PCDATA abbr add address anchor corr date del docDate emph foreign
formula gap gi gloss hi mentioned name num orig ptr publisher pubPlace ref reg rs s seg
sic soCalled term time title xptr xref
Attributes: global only
Example:
<docImprint>Oxford, Clarendon Press, 1987</docImprint>
<docImprint><pubPlace>London</pubPlace>
Printed for <name>E. Nutt, at <pubPlace>Royal Exchange; <name>J. Roberts in
<pubPlace>wick-Lane; <name>A. Dodd without <pubPlace>Temple-Bar; and
<name>J. Graves in <pubPlace>St. James's-street. <date>1722. </docImprint>
<docTitle> contains the title itself, as well as its subsections, as it
appears on the page
Must include: titlePart
Attributes: global
Example:
<docTitle>
<titlePart type="main">The DUNCIAD,
VARIOURVM.
<titlePart type="sub">WITH THE
PROLEGOMENA of SCRIBLERUS.
</docTitle>
<eg> : (in a dictionary) contains an example text containing at least one occurrence of
the word form, used in the sense being described; examples may be quoted from
(named) authors or contrived.
May occur within: attDef entry etym exemplum hom re sense trans
May contain: cit q
Attributes: global only
<emph> Marks words or phrases which are stressed or emphasized for linguistic or
rhetorical effect.
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi label list mentioned name note num orig ptr q ref
reg rs s seg sic soCalled stage table text time title xptr xref
Attributes: global only
Example:
You took the car and did <emph>what</emph>?!!
<epigraph>: a quotation or citation at the beginning of a work, a section or chapter,
or on a title page. Often indicates a sentiment, moral, or mood. This should contain
bibliographical information if possible.
May occur within: back body div div0 div1 div2 div3 div4 div5 div6 div7 front group
titlePage
May contain: bibl biblFull cit l label lg list listBibl note p q sp stage
Attributes: global only
Example:
<epigraph lang="lat">
<cit><bibl>Lucret.</bibl>
<q>
<l part="f">petere inde coronam,
<l>Vnde prius nulli velarint tempora Musae.</l> </q> </cit>
</epigraph>
<figdesc> : contains a brief prose description of the appearance or content of a
graphic. Useful as text to display instead of the graphic image, and is also used for
indexing or documentary purposes.
May occur within: figure
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi label mentioned name note num orig ptr q
ref reg rs s seg sic soCalled stage table text time title xptr xref
Attributes: global
<figure> : indicates the location of a graphic, illustration, or figure.
May contain: figDesc head p text
Attributes: global plus the following:
entity : names the external entity within which the graphic image of the figure is stored.
<figure entity="Fig1">
<head>Figure One: The View from the Bridge</head>
<figDesc>A Whistleresque view showing four
or five sailing boats in the foreground, and a
series of buoys strung out between them.</figDesc>
</figure>
Note: The contents of the entity="" attribute must be declared at the head of the file. See the
section on the teiHeader.
<foreign> Identifies a word or phrase as belonging to some language other than that
of the surrounding text. The lang="" attribute must be specified on all <foreign>
elements.
Can include: pcdata abbr address date name num rs time add corr del gap
orig reg sic unclear emph foreign gloss hi mentioned socalled term title ptr ref xptr xref
anchor s seg gi formula eg bibl biblfull cit q label list listbibl note figure stage table text
Attributes: global only
lang : identifies the language of the word or phrase marked.
Value: contains a language code (associated with some writing system declaration) for
the language in question; where applicable, the codes of
ISO 639 should be used.
Example:
This is heathen Greek to you still?
Your <foreign lang="lat">lapis philosophicus</foreign>?
<formula> : contains a mathematical or other formula.
May contain: figure note
Attributes: global plus the following:
notation : supplies the name of a previously defined notation used for the content of the
element. The name of a notation for which a formal SGML notation declaration has
been provided in the document type declaration.
<front> : the wrapper for all prefatory material (contents, introduction, preface, etc).
Can include: argument byline div docAuthor docDate epigraph head opener salute
signed titlePage
Attributes: global only
<gap> An empty tag. Indicates a point where material has been omitted in a
transcription, whether for editorial reasons described in the TEI header, as part of
sampling practice, or because the material is illegible or inaudible.
May contain: no other tags
Attributes: global plus the following:
desc : gives a description of the omitted text.
reason : gives the reason for omission. Sample values include ``sampling'', ``illegible'',
``inaudible'', ``irrelevant'', ``canceled'', ``canceled and illegible''.
resp : indicates the editor, transcriber or encoder responsible for the
decision not to provide any transcription of the text and hence the application of the <gap> tag.
Must be one of the identifiers declared in the document header, associated with a person
asserted as responsible for some aspect of the text's creation, transcription, editing or
encoding.
hand : In the case of text omitted from the transcription because of deliberate deletion
by an identifiable hand, signifies the hand which made the deletion. Must be one of the
hand identifiers declared in the document header.
agent : In the case of text omitted from the transcription because of damage or other
phenomenon resulting from an identifiable cause, signifies the causative agent.
extent : indicates approximately how much text has been omitted from the transcription,
in letters, minims, inches, or any appropriate unit.
<gi> : contains the name (generic identifier) of an SGML element.
Attributes: global plus the following
tei : indicates whether this element is part of the TEI encoding scheme or not.
Legal values are: yes, no.
Example
The element <gi>sourceDesc</gi> is missing.
<gloss> : identifies a phrase or word used to provide a gloss or definition for some
other word or phrase.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled time
term title xptr xref
Attributes: global plus the following:
target : identifies the associated <term> element in the current document
Example:
We may define <term rend="sc" id="tdpv">discoursal point of view</term> as <gloss
target="tdpv">the relationship, expressed through discourse structure, between the
implied author or some other addresser, and the fiction.</gloss>
<group>
May occur within: group text
May contain: argument byline closer docAuthor docDate epigraph group head opener
salute signed text trailer
Attributes: global only
<head> : marks off a heading to a <div>, <lg>, <list>, etc.
May occur within: argument back body div div0 div1 div2 div3 div4 div5 div6 div7
figure front group lg list table
May contain: #PCDATA abbr add address anchor bibl biblFull cit date del emph figure
foreign formula gap gi gloss hi label list mentioned name note num orig ptr q ref reg rs s
seg sic soCalled stage table term text time title xptr xref
Attributes: global plus the following
type : categorizes the heading in some way meaningful to the encoder.
<hi> marks a word or phrase as graphically distinct from the surrounding text, for
reasons concerning which no claim is made.
Attributes: global only
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi label list mentioned name note num orig ptr q
ref reg rs s seg sic soCalled stage table term text time title xptr xref
Example: <hi rend="gothic">And this Indenture further witnesseth</hi>
that the said <hi rend="italics">Walter Shandy</hi>, merchant, in consideration of the said
intended marriage ...
Note: for the sake of brevity, The Electronic Text Center uses the following shortened
forms in-house -- they are converted back to <hi> tags automatically, prior to parsing:
<i> italics
<b> boldface
<sc> small caps
<sup> superscript
<sub> subscript
<imprint> : groups information relating to the publication or distribution of a
bibliographic item.
May occur within: bibl
May contain: biblScope date publisher pubPlace
Attributes: global only
Example:
<imprint>Oxford, Clarendon Press, 1987 </imprint>
<imprint><pubPlace>Oxford</pubPlace>
<publisher>Clarendon Press</publisher>
<date>1987</date>
</imprint>
<index> : marks a location to be indexed
Global plus the following:
level1 (i.e. first-level index entry) : gives the form under which the index entry is to be
At least one level of entry is required.
level2 (i.e. second-level index
entry) : gives the second-level form, if any.
level3 (i.e. third-level index entry) : gives the third-level form, if any.
level4 (i.e. fourth-level index entry) : gives the fourth-level form, if any.
Example:
David's other principal backer, Josiah ha-Kohen
<index level1="Josiah ha-Kohen b. Azarya"
level2="cousin and backer of David b. Daniel">
b. Azarya, son of one of the last gaons of Sura,
<index level1="Azarya" level2="gaon of Sura">
<index level1="Sura"> was David's own first cousin.
<interp> : provides for an interpretative annotation which can be linked to a span of
text.
Attributes: global plus the following:
value : identifies the specific phenomenon being annotated.
<interpGrp resp="TMA" type="collection of structural units">
<interp value="introduction">
<interp value="conflict">
<interp value="climax">
<interp value="revenge">
<interp value="reconciliation">
<interp value="aftermath"></interpGrp>
<item> marks items in a <list>
May occur within: change list
May contain: #PCDATA abbr add address anchor bibl biblFull
cit corr date del emph figure foreign formula gap gi gloss hi l
label lg list mentioned name note num orig p pause ptr q ref reg rs s
seg sic soCalled sp stage term text time title xptr xref
Attributes: global only
Italics: see <hi>
<keywords> : contains a list of keywords or phrases identifying the topic or nature
of a text.
May contain: list term
Attributes: global plus the following:
scheme : identifies the controlled vocabulary within which the set of keywords concerned
is defined.
Example:
<keywords scheme="BL">
<list><item>Babbage, Charles
<item>Mathematicians - Great Britain - Biography
</list>
</keywords>
<l> contains a single, possibly incomplete, line of verse; usually within a <lg>.
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr
date del emph figure foreign formula gap gi gloss hi label list mentioned name note num orig ptr
q ref reg rs s seg sic soCalled stage table text time title xptr xref
part= : specifies whether or not the line is metrically complete. Legal values are:
Y : the line is metrically incomplete
N : either the line is complete, or no claim is made as to its completeness
The following values -- I, M, or F -- should be used only where it is clear how the line is
to be reconstituted:
I : the initial part of an incomplete line
M : a medial part of an incomplete line
F : the final part of an incomplete line
Attributes: global plus the following:
part="" specifies whether or not the line is metrically complete.
part="y" -- the line is metrically incomplete
part="n" -- either the line is complete, or no claim is made as to its completeness
part="i" -- the initial part of an incomplete line
part="m" -- a medial part of an incomplete line
part="f" -- the final part of an incomplete line
enjamb
met
real
rhyme
<label> Contains the label associated with an item in a list; in glossaries it marks the
term being defined. Labels are most commonly used for the headwords in glossary lists
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled time
title xptr xref
Attributes: global
In the example below, note the use of the global lang attribute to set the
default language of the glossary list to Middle English, and identify the glosses and headings as
modern English or Latin.
<label>nu <item lang="EN">now
<label>lhude <item lang="EN">loudly
<label>bloweth <item lang="EN">blooms
<label>med <item lang="EN">meadow
</list>
<lb> [empty tag] : marks the start of a new prose line in some edition or version of a
text. It should be carefully distinguished from the <l> element, used to mark lines of
verse.
Attributes: global plus the following:
ed : indicates the edition or version in which the line break is located at this point :
<lb ed="Riverside" n="123">
n : indicates the number or other value associated with the line which follows the point
of insertion of this <lb>.
<lg> Marks a group of lines functioning as a formal unit e.g. a stanza, refrain, verse
paragraph, etc.
May occur within: add argument body corr div div0 div1 div2 div3 div4 div5 div6 div7
epigraph item lg note q sic sp stage
May contain: argument byline closer docAuthor docDate epigraph head l lg opener
salute signed trailer
Attributes: global, plus the following:
type="" [Characterizes the line group in some respect (i.e., verse paragraph, stanza, verse, etc.)]
org="" composite|uniform
sample="" initial|medial|final|unknown|complete
part="" y|n|i|m|f
met
real
rhyme
<list> : contains any sequence of items organized as a list.
May contain: head item label
Attributes: global plus the following:
type= : describes the form of the list.
Legal values are:
ordered : list items are numbered or lettered.
bulleted : list items are marked with a bullet or other typographic device.
simple : list items are not numbered or bulleted.
gloss : each list item glosses some term or concept, which is given by a
<mentioned> : marks words or phrases mentioned, not used.
Attributes: global only
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled time
title xptr xref
Example:
There is thus a striking accentual difference between a verbal form like <mentioned
lang="EL" id="c234">eluthemen</mentioned> <gloss target="c234">we were
released,</gloss> accented on the second syllable of the word, and its participial
derivative <mentioned lang="EL" id="c235">lutheis</mentioned> <gloss
target="c235">released,</gloss> accented on the last.
<milestone> [Empty tag] Marks the boundary between sections of text, as indicated
by changes in a standard reference system.
Attributes: global plus the following:
ed : indicates which edition or version the milestone applies to.
unit : indicates what kind of section is changing at this milestone.
Legal values are:
page : page breaks in the reference edition.
column : column breaks.
line : line breaks.
book : any units termed book, liber, etc.
poem : individual poems in a collection.
canto : cantos or other major sections of a poem.
stanza : stanzas within a poem, book, or canto.
act : acts within a play.
scene : scenes within a play or act.
section : sections of any kind.
absent : passages not present in the reference edition.
<name> Contains a proper noun or noun phrase. Proper nouns referring to people,
places, and organizations may be tagged instead with <person>, <place>, or <org>.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled time title xptr
xref
Attributes: global plus the following:
type : indicates the type of the object which is being named by the phrase. Values such
as person, place, institution, product, acronym.
<note> : contains a note or annotation.
Attributes:
n : indicates the symbol or number used to mark the note's point of attachment to the
main text.
Global plus the following:
type : describes the type of note; e.g. annotation, gloss, citation, digression, preliminary,
temporary.
resp : indicates who is responsible for the annotation: author, editor, translator, etc.
Legal values are:
auth[or] : note originated with the author of the text.
ed[itor] : note added by the editor of the text.
comp[iler] : note added by the compiler of a collection.
tr[anslator] : note added by the translator of a text.
transcr[iber] : note added by the transcriber of a text into electronic form.
(initials) : note added by the individual indicated by the initials.
place : indicates where the note appears in the source text.
Legal values are:
foot : note appears at foot of page.
end : note appears at end of chapter or volume.
inline : note appears as a marked paragraph in the body of the text.
left : note appears in left margin.
right : note appears in right margin.
interlinear : note appears between lines of the text.
app[aratus] : note appears in the apparatus at the foot of the page.
anchored : indicates whether the copy text shows the exact place of reference for the
note.
Legal values are:
yes : copy text indicates the place of attachment for the note.
no : copy text indicates no place of attachment for the note.
target : indicates the point of attachment of a note, or the beginning of the span to which
the note is attached.
If target and targetEnd are to be used to indicate where notes attach to the text, then
elements at the appropriate locations (<anchor> elements if necessary) must be given id
values to be pointed at.
<num : contains a number, written in any form.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time title
xptr xref
Attributes: global plus the following:
type : indicates the type of numeric value.
Legal values are:
cardinal : absolute number, e.g. ``21'', ``21.5''
ordinal : ordinal number, e.g. ``21st''
fraction : fraction, e.g. ``one half'' or ``three halves'' percentage e.g. ``ten percent''
value : supplies the value of the number in an application-dependent standard form.
Example:
<num type="cardinal" value="21">twenty-one</num>
<num type="cardinal" value="1.5">1.5</num>
He stands <num type="cardinal" value="1.9">1 · 90</num>m. high.
<opener> : groups together dateline, byline, salutation, and similar phrases appearing
as a preliminary group at the start of a division, especially of a letter.
May contain: #PCDATA abbr add address anchor corr date del emph foreign
formula gap gi gloss hi mentioned name num orig ptr ref reg rs s
salute seg sic signed soCalled term time title xptr xref
Attributes: global only
Example:
<opener>
<dateline><place>Great Marlborough Street</place>
<date>November 11, 1848</date>
</dateline>
<salute>My dear Sir,</salute>
</opener>
<p>I am sorry to say that absence from town and other
circumstances have prevented me from earlier enquiring...
<orig>
May contain: pcdata abbr address date name num rs time add corr del
gap orig reg sic unclear emph foreign gloss hi mentioned socalled term title ptr ref xptr xref
anchor s seg gi formula
Attributes: global plus the following:
reg
resp
<p> marks a paragraph in prose.
Attributes: global only
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi label list mentioned name note num orig ptr q ref
reg rs s seg sic soCalled stage table term text time title xptr xref
<pb> empty tag : marks the boundary between one page of a text and the next in a
standard reference system.
Attributes: global plus the following:
ed : indicates the edition or version in which the page break is located at this point
<pb ed="Riverside" n="123">
By convention, <pb> elements should appear at the start of the page to which they
refer.
<ptr> : a pointer to another location in the current document.
Attributes: global plus the following:
target= Specifies the destination of the pointer as one or more SGML identifiers.
type= Categorizes the pointer in some respect; it should indicate the function of the
pointer, or the rhetorical relationship between its source and target.
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
figure foreign formula gap gi gloss hi l label lg list mentioned name note num orig p ptr
q ref reg rs s seg shift sic soCalled sp stage table term text time title xptr xref
Use <q type="direct"> (if needs be) to mark direct speech.
Attributes: global plus the following:
type= [May be used to indicate whether the quoted matter is spoken or thought, or to
characterize it more finely.]
spoken : representation of direct speech, usually marked by quotation marks.
thought : representation of thought, e.g. internal monologue.
direct : used to indicate whether the quoted matter is regarded as direct or
indirect speech.
who= [Identifies the speaker.]
<ref> defines a reference to another location in the current document, in terms of
one or more identifiable elements, possibly modified by additional text or comment.
Attributes: global plus the following:
target = Specifies the destination of the reference as one or more SGML identifiers.
type=
crdate=
targtype=
targorder= Y|n|u
evaluate= All|one|none
<refsdecl> : specifies how canonical references are constructed for this text.
Occurs within: encodingDesc
Contains: p
Attributes: global plus the following:
doctype : identifies the document type within which this reference declaration is used.
<reg> : contains a reading which has been regularized or normalized in some sense.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi lang mentioned name num orig ptr ref reg rs s seg sic soCalled tag term
time title xptr xref
Attributes:
orig : gives the unregularized form of the text as found in the source copy.
<reg orig="auctoritee">Authority</reg>
resp: identifies the individual responsible for the regularization of the word or phrase.
Value: any string of characters, typically the initials of the individual involved, or a role
identifier like "editor" if not known by name.
Remarks: The <reg> tag is mirrored by the <orig> tag, which allows the unnormalized
form of the original to be retained as the content of the element, while still providing the
opportunity to record the regularized or normalized form postulated by a researcher. The
choice between the two elements is up to the encoder.
<rendition> : supplies information about the intended rendition of one or more
elements.
Attributes: [None: global and inherited attributes only.]
May contain: #PCDATA abbr add address anchor att bibl biblFull
biblStruct c caesura caption castList cit cl corr date
del distinct emph expan figure foreign formula
gap gi gloss hi label lang link list listBibl m measure
mentioned move name note num orig ptr q
ref reg rs s seg sic soCalled stage table tag tech term text
time title xptr xref
<resp> : contains a phrase describing the nature of a person's intellectual
responsibility.
May occur within: respStmt
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi lang mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
br> Attributes: global only
Example:
<respStmt><resp>compiler</resp> <name>Edward Child</name></respStmt>
<respstmt> : supplies a statement of responsibility for someone responsible for the
intellectual content of a text, edition, recording, or series, where the specialized elements
for authors, editors, etc. do not suffice or do not apply.
May occur within: bibl change editionStmt series seriesStmt titleStmt
May contain: name resp
Attributes: global
Example:
<respStmt><resp>transcribed from original ms</resp>
<name>Claus Huitfeldt</name>
</respStmt>
<revisiondesc> : summarizes the revision history for a file. Record changes with most
recent changes at the top of the list.
May occur within: teiHeader
May contain: change list
Attributes: global
Example:
<revisionDesc>
<change><date>11 Nov 91</date>
<name>EB </name>
<what>Deleted chapter 10 </what>
</revisionDesc>
<row>: contains one row of a table.
May occur within: table
May contain: cell table
Attributes: global plus the following:
role
indicates the kind of information held in the cells of this row.
Legal values are:
label
data
Remarks: The value specified is the default for all cells in this row.
Example:
<row role="data">
<cell role="label">Classics
<cell>Idle listless and unimproving </row>
<rs> : contains a general purpose name or referring string.
May contain: #PCDATA abbr add address anchor att c caesura cl corr
date dateRange dateStruct del distinct emph expan foreign formula gap
gi gloss handShift hi lang link m measure mentioned name num orig oRef
oVar phr ptr pRef pVar ref reg rs s seg sic soCalled tag term time
timeRange timeStruct title val w xptr xref
Attributes: global plus the following:
key
reg
type : indicates more specifically the object referred to by
the referencing string. Values might include
"person", "place", "ship", "element" etc.
<q>My dear <rs type="person">Mr. Bennet</rs>, </q>
said <rs type="person">his
lady</rs> to him one day,
<q>have you heard that <rs type="place">Netherfield
Park</rs> is let at last?</q>
<s> : Contains any arbitrary phrase-level unit of text, such as a sentence.
Can include: date emph foreign i
name place s title q socalled ref mention term gloss address q list
note stage bibl
Attributes: global plus the following:
type= Indicates what sort of segment is marked.
Function= <s>: contains a sentence-like division of a text -- orthographic sentences,
or any other segmentation of a text, provided that the segmentation is end-to-end,
complete, and non-nesting. For other kinds of segmentation, the <seg> element should
be used.
May contain: #PCDATA abbr add address anchor corr date del emph expan foreign
formula gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term
time title xptr xref
Attributes: global
<s>When are you leaving?</s>
<s>Tomorrow.</s>
<salute> : contains a salutation or greeting prefixed to a foreword, dedicatory epistle
or other division of a text, or the salutation in the closing of a letter, preface, etc.
May contain: #PCDATA abbr add address anchor corr date del emph expan foreign
formula gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled tag
term time title xptr xref
Attributes: global
<seg> : contains any arbitrary phrase-level unit of text (including other <seg>
elements).
Remarks: The <seg> element may be used at the encoder's discretion to mark any
segments of the text of interest for processing. One use of the element is to mark text
features for which no appropriate markup is otherwise defined --- i.e. as a simple
extension mechanism. May contain anything which may appear within a paragraph.
May contain: #PCDATA abbr add address anchor att bibl biblFull cit corr date del
emph expan figure foreign formula gap gi gloss hi label lang link list listBibl m measure
mentioned move name note num orig ptr q ref reg rs s seg sic soCalled stage table term
text time title xptr xref
Type
Function
subtype : provides a sub-categorization of the segment is marked. The subtype attribute
may be used to provide any classification for the <seg> elements tagged in a text
suitable for the type given.
part : specifies whether or not the segment is complete. Legal values are:
Y : the segment is incomplete
N : either the segment is complete, or no claim is made as to its completeness
The following values -- I, M, or F -- should be used only where
it is clear how the segment is to be reconstituted.
I : the initial part of an incomplete segment
M : a medial part of an incomplete segment
F : the final part of an incomplete segment
<sic> : contains text reproduced although apparently incorrect or inaccurate.
Corr cdata #implied
Resp idref #implied
Cert cdata #implied
<seriesStmt> : groups information about the series, if any, to which a publication
belongs.
May occur within: biblFull fileDesc
May contain: idno p respStmt title
Attributes: global
Example:
<seriesStmt>
<title>Machine-Readable Texts for the Study of Indian Literature</title>
<respStmt>
<resp>ed. by</resp> <name>Jan Gonda</name>
</respStmt>
<idno type=vol>1.2</idno>
<idno type=ISSN>0 345 6789</idno>
</seriesStmt>
<sic> : contains text reproduced although apparently incorrect or inaccurate.
The <sic> tag is a mirror of <corr>: the former leaves the original text untouched,
giving the correction as an attribute value; the latter substitutes the correction, leaving
the original reading as an attribute value. The choice between them is up to the encoder.
May contain: #PCDATA abbr add address anchor bibl biblFull cit corr date del emph
expan figure foreign formula gap gi gloss graph hi l label lang lg list mentioned name
note num orig p ptr q ref reg rs s seg sic soCalled sp table term text time title xptr xref
Attributes: global plus the following:
corr : gives a correction for the apparent error in the copy text.
for his nose was as sharp as a pen, and a Table of
green fields.
resp : signifies the editor or transcriber responsible for suggesting the correction held as
the value of the corr attribute. Must be one of the identifiers declared in the document
header, associated with a person asserted as responsible for some aspect of the text's
creation, transcription, editing or encoding.
cert : signifies the degree of certainty ascribed to the correction held as the value of the
corr attribute.
If all that is desired is to call attention to the apparent problem in the copy text, no
attributes are required:
I don't know, Juan. It's so far in the past now —
how <sic>we can</sic> prove or disprove anyone's theories?
It is also possible to provide a correct reading and to identify the individual responsible
for the correction:
I don't know, Juan. It's so far in the past now —
how <sic corr=can we resp=MSM>we can</sic> prove or
disprove anyone's theories?
<signed> : contains the closing salutation, etc., appended to a foreword,
dedicatory epistle, or other division of a text.
May occur within: back body closer div div0 div1 div2 div3 div4 div5 div6 div7
epilogue front group lg opener
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
Attributes: global
Example: <signed>Thine to command <name>Humph.
Moseley</name></signed>
For <sc> small caps, see <hi>
<socalled> : contains a word or phrase for which the author or narrator indicates a
disclaiming of responsibility, for example by the use of scare quotes or italics.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
Attributes: global
Example: To edge his way along the crowded paths of life, warning all human
sympathy to keep its distance, was what the knowing ones call <soCalled>nuts</soCalled>
to Scrooge.
<sourceDesc> : supplies a bibliographic description of the copy text(s) from which an
electronic text was derived or generated.
May occur within: biblFull fileDesc
May contain: bibl biblFull biblStruct p
Attributes: global
Example:
<sourceDesc>
<p>No source: created in machine-readable form.</p>
</sourceDesc>
<sp> : An individual speech in a performance text, or a passage presented as such in
a prose or verse text.
May occur within: add argument body corr div div0 div1 div2 div3 div4 div5 div6 div7
epigraph item note q sic stage
May contain: l lg p seg speaker stage
Attributes: global plus the following:
who : identifies the speaker of the part by supplying an ID.
Remarks: The who="" attribute on this element may be used either in addition to the
<speaker> element or as an alternative.
<speaker> : A specialized form of heading or label, giving the name of one or more
speakers in a dramatic text or fragment.
Marks a speaker within a <sp>
Must be within:<sp>
Attributes: global
The who attribute on the <sp> element may be used either in addition to this element
or as an alternative.
May occur within: sp [May not appear within <speaker>]
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
<stage> Contains any kind of stage direction within a dramatic text or fragment.
May contain: #PCDATA abbr add address anchor bibl biblFull biblStruct cit corr date
del emph figure foreign formula gap gi gloss graph hi l label lang lg list mentioned name
note num orig p ptr q ref reg rs s seg shift sic soCalled sp stage table term text time title
xptr xref [May not include at any level: stage]
Attributes: global plus the following:
type="" : indicates the kind of stage direction
(i.e., setting, entrance, exit, business,
novelistic, delivery, modifier, location, mixed, etc.)
Legal values are:
setting : describes a setting.
entrance : describes an entrance.
exit : describes an exit.
business : describes stage business.
novelistic : is a narrative, motivating stage direction.
delivery : describes how a character speaks.
modifier : gives some detail about a character.
location : describes a location.
mixed : more than one of the above
Example:
<stage type="setting">A curtain being drawn.</stage>
<stage type="setting">Music</stage>
<stage type="entrance">Enter Husband as being thrown off his
horse.</stage>
<stage type="exit">Exit pursued by a bear.</stage>
<stage type="business">He quickly takes the stone out.</stage>
<stage type="delivery">To Lussurioso.</stage>
<stage type="novelistic">Having had enough, and embarrassed for
the family.</stage>
<stage type="modifier">Disguised as Ansaldo.</stage>
<stage type="location">At a window.</stage>
<stage type="delivery rend="inline">Aside.</stage>
For
<sup> superscript, see <hi>
For
<sub> subscript, see <hi>
<table> :
contains text displayed in tabular form, in rows and columns.
May occur within: add admin camera caption case cell corr
country damage def desc descrip docEdition emph equiv etym figDesc
foreign form head hi
item l lang lbl meeting mood note number p
q ref region row seg set sic
stage title titlePart
trans unclear xref
May contain: head row
Attributes: global plus the following:
rows
indicates the number of rows in the table.
cols
indicates the number of columns in each row of the table.
Remarks: Any rendition information should be supplied using the global
rend attribute, at the table, row, or cell level as appropriate.
<targetEnd> : points to the end of the span to which the note is attached, if the note
is not embedded in the text at that point.
May contain: #PCDATA abbr add address anchor att bibl biblFull
biblStruct c caesura camera caption castList cit cl corr date
dateRange dateStruct del distinct emph entry expan
figure foreign formula gap gi gloss graph hi l
label lang lg lg1 list m measure mentioned move name
note num orig p ptr q ref reg rs s
seg sic soCalled sp stage table tag tech term
text time title xptr xref
Attributes: global plus the following:
type= [Describes the type of note (i.e., annotation, gloss, citation,
digression, preliminary, temporary, etc.).] "ed" "au" "en"
place= [Indicates where the note is in the source text.] "f" "e" "l" "r" "n"
target= [indicates the point of attachment of a note, or the beginning of the span to
which the note is attached.]
anchored= (yes | no)
Resp
Targetend
<taxonomy : defines a typology used to classify texts either implicitly, by means of a
bibliographic citation, or explicitly by a structured taxonomy.
May occur within: classDecl
May contain: bibl biblFull biblStruct category
Attributes: global
Example:
<taxonomy id=B>
<bibl>Brown Corpus</bibl>
<category id=B.A><catdesc>Press Reportage
<category id=B.A1><catdesc>Daily</category>
<category id=B.A2><catdesc>Sunday</category>
<category id=B.A3><catdesc>National</category>
<category id=B.A4><catdesc>Provincial</category>
<category id=B.A5><catdesc>Political</category>
<category id=B.A6><catdesc>Sports</category>
</category>
<category id=B.D><catDesc>Religion
<category id=B.D1><catdesc>Books</category>
<category id=B.D2><catdesc>Periodicals and tracts</category>
</category>
</taxonomy>
<term;
A computational device that infers structure from grammatical strings of words is known
as a <term>parser</term>, and much of the history of NLP over the last 20 years has
been occupied with the design of parsers.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time
title xptr xref
type : classifies the term using some typology.
<text> : contains a single text of any kind, whether unitary or composite, for example
a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample.
Attributes: global
May contain: back body front group
<textclass> : groups information which describes the nature or topic of a text in terms
of a standard classification scheme, thesaurus, etc.
Attributes: global
<time> : contains a phrase defining a time of day in any format.
May contain: #PCDATA abbr add address anchor corr date del emph foreign formula
gap gi gloss hi mentioned name num orig ptr ref reg rs s seg sic soCalled term time title
xptr xref
value : gives the value of the time in a standard form; recommended form is ``hhmm'',
using the 24 hour clock.
<time value=1600>four o' clock</time>
type : indicates something about the type of temporal expression being tagged.
Legal values are:
am : indicates a temporal expression made on the basis of a twelve-hour clock
and referring to a time between midnight and noon.
pm : indicates a temporal expression made on the basis of a twelve-hour clock
and referring to a time between noon and midnight.
24hour : indicates a temporal expression made on the basis of a twenty-four-hour
clock.
descriptive : indicates a temporal expression made in descriptive terms, e.g.
``noon''.
zone : indicates time zone or place name wherever this is necessary to evaluate a
temporal expression. Contains a word or phrase such as GMT or 'Eastern Standard
Time' which might be helpful in evaluating the temporal expression.
<title> : contains the title of a work, whether article, book, journal, or series, including
any alternative titles or subtitles.
May contain: #PCDATA abbr add address anchor bibl biblFull biblStruct cit corr date
del emph figure foreign formula gap gi gloss hi label list mentioned name note num orig
ptr q ref reg rs s seg sic soCalled stage table term text time title xptr xref
level : indicates whether this is the title of an article, book, journal, series, or
unpublished material.
Legal values are:
a : analytic title (article, poem, or other item published as part of a larger item)
m : monographic title (book, collection, or other item published
as a distinct item,
including single volumes of multi-volume works)
j : journal title
s : series title
u : title of unpublished material (including theses and dissertations unless
published by a commercial press)
type : classifies the title according to some convenient typology.
Legal values are:
main : main title
subordinate : subtitle, title of part
parallel : alternate title, often in another language, by which the work is also
known
abbreviated : abbreviated form of title
This attribute is provided for convenience in analysing titles and processing them
according to their type; where such specialized processing is not necessary, there is no
need for such analysis, and the entire title, including subtitles and any parallel titles, may
be enclosed within a single <title> element.
<titlepage> : contains the title page of a text, appearing within the front or back
matter.
May occur within: back front
May contain: byline docAuthor docDate docEdition docImprint
docTitle epigraph titlePart
Attributes:
type : classifies the title page according to any convenient typology.
Value: any string, e.g. ``full'', ``half'', ``Series'', etc.
This attribute allows the same element to be used for volume title pages, series title
pages, etc., as well as for the main title page of a work.
<titlePart> marks a particular subsection of the title of a work,
as indicated on a title page.
May occur within: docTitle titlePage
May contain: #PCDATA abbr add address anchor att bibl biblFull
biblStruct c caesura camera caption cit cl corr date
del distinct emph expan figure foreign formula
gap gi gloss handShift hi label lang link list listBibl m measure
mentioned move name note num orig ptr q
ref reg rs s seg sic soCalled stage table tag tech term text
time title xptr xref
Attributes: global, plus the following:
type="main" the main title of the work
type="sub" a subtitle of the work
type="alt" alternative title of the work
type="desc" descriptive paraphrase of the work included in the title
Example:
<docTitle>
<titlePart type=main>THE
FORTUNES
AND
MISFORTUNES
Of the FAMOUS
Moll Flanders, &c.
<titlePart type=desc>Who was BORN in
NEWGATE,
And during a Life of continu'd Variety for
Threescore Years, besides her Childhood, was
Twelve Year a {Whore}, five times a {Wife} (wherof
once to her own Brother) Twelve Year a {Thief,}
Eight Year a Transported {Felon} in {Virginia}, at last
grew {Rich}, liv'd {Honest}, and died a {Penitent}.
</docTitle>
<trailer> Contains a closing title or footer appearing at the end of
a division of a text.
May occur within: body castGroup div div0 div1 div2 div3 div4 div5
div6 div7 epilogue group lg listBibl performance prologue
May contain: #PCDATA abbr add address anchor att c caesura cl corr
date dateRange dateStruct del distinct emph expan foreign formula gap
gi gloss handShift hi lang link m measure mentioned name num orig oRef
oVar phr ptr pRef pVar ref reg rs s seg sic soCalled tag term time
timeRange timeStruct title val w xptr xref
Attributes: global
Example:
<trailer>Explicit pars tertia
<unclear>
Description: contains a word, phrase, or passage which cannot be
transcribed with certainty because it is illegible or inaudible in the
source.
The same element is used for all cases of uncertainty in the
transcription of element content, whether for written or spoken material.
The <damage>, <del>, <unclear> and <supplied> elements may be
closely allied in use.
May contain: pcdata abbr address date name num rs time‚ add corr del gap orig
reg sic unclear emph foreign gloss hi mentioned socalled term title ptr ref xptr xref anchor ‚s
seg gi formula eg bibl biblfull cit q label list listbibl note figure stage table text
reason
Value: any phrase describing the difficulty, e.g. ``faded'',
``ambient noise'', ``passing truck'', ``ill formed'', ``eccentric
ductus''.
resp: indicates the individual responsible for the
transcription of the letter, word or passage contained with the
<unclear> element. Must be one of the identifiers declared in the document
header, associated with a person asserted as responsible for some
aspect of the text's creation, transcription, editing or encoding
cert:
signifies the degree of certainty ascribed to the
transcription of the text contained within the <unclear> element
hand : Where the difficulty in transcription arises from
action (partial deletion, etc.) assignable to an identifiable
hand, signifies the hand responsible for the action.
Must be one of the hand identifiers declared in the
document header.
agent : Where the difficulty in transcription arises from an
identifiable cause, signifies the causative agent.
<xptr>
(i.e. extended pointer)
defines a pointer to another location in the current
document or an external document.
Attributes:
type
resp
crdate
targtype
targorder (Y n u) u
evaluate (All one none)
doc
from
to
May occur within: abbr activity actor add addrLine author bibl byline
caption cell cit cl closer corr creation damage date
def del desc docAuthor docDate docEdition docImprint edition
editor emph equiv expan extent figDesc
foreign funder gloss head hi item l label lang language measure mentioned name note num
opener orig p pubPlace q re ref
resp rs s
salute seg signed soCalled
speaker sponsor tagUsage term time title tr
trailer trans u unclear xref
May contain: [none]
Xref
<xref> (i.e. extended reference)
: defines a reference to another location in the current
document, or an external document, using an extended pointer notation,
possibly modified by additional text or comment.
May contain: pcdata abbr address date name num rs time add corr del gap orig reg
sic unclear emph foreign gloss hi mentioned socalled term title ptr ref xptr xref anchor s seg gi
formula eg bibl biblfull cit q label list listbibl note figure stage table text