Full list of encoding topics

Text structure

Advertisements
Encoding of two different kinds of advertisements: advertisements for other books, and preliminary addresses to the reader

Arguments and summaries
Definition of argument, and use of argument element

Back matter
General notes on encoding back matter, including the various types of concluding material

Closers and trailers
Using closer to group together information at the bottom of a division (especially in letters); the usual contents of closer; difference between closer and trailer

Dedications
Encoding dedications using div type="dedication"; distinguishing between dedications and other prefatory material

Embedded texts

Epigraphs
Encoding of epigraphs with either epigraph or div type="epigraph" depending on their structural location.

Essays
Identification and encoding of essays

Figures
Encoding of figures and illustrations using figure; handling of text within figures; discussion of the WWP’s changes to the content model of figure

Frontispieces
Use of div type="frontispiece"

Front matter
General notes on encoding front matter, including the various types of prefatory material

group and text

Headings
Use of head to encode headings, and permissible values for type attribute

Heads and labels
Comparison of headings and labels, and the use of head and label

Indexes
Differences between tables of contents (ordered by location in the book) and indexes (ordered by topic).

Indexes and tables of contents
Differences between tables of contents (ordered by location in the book) and indexes (ordered by topic).

Journal and diary entries
Encoding of diary and journal entries using div type="entry"

Letters: general notes
General information on encoding letters in three different contexts: within collections, embedded in other works, and as prefatory material

Letters embedded within other works
More details on encoding embedded letters, a special case of embedded texts.

Lists: errata lists
Encoding of errata lists with div type="errata" and a nested list type="errata". Within each list item, further encoding captures the functional components such as the error, the page number, etc.

Lists: general notes
Encoding lists, including discussion of criteria for identifying lists

Lists: sequencing errors
Encoding lists, including discussion of criteria for identifying lists

Lists: subscriber lists
Encoding subscriber lists using list type="subscriber", including handling of ditto used for repeated items

Numbered and unnumbered div

Numbering: general
Use of the n attribute and handling of line numbering

Openers and closers
Using opener to group together information at the top of a division (especially in letters); the usual contents of opener

Overlapping and fragmented elements: q, quote, l, and lg
Strategies for handling overlapping textual features, particularly quotations and poetry

Paragraphs and prose components

Postscripts, ps
Encoding of postscripts using the WWP element ps

Prefatory material
Use of div type="prefatory" and other

Quotations: direct speech
Encoding of quotations, distinction between use of q and quote, treatment of quotation marks

Quotations: distinction between q and quote
Encoding of quotations, distinction between use of q and quote, treatment of quotation marks

Quotations: material from other sources
Encoding of quotations, distinction between use of q and quote, treatment of quotation marks

Recognizing subdivisions

Salutes and signatures
Encoding of salutations and signatures in letters using salute and signed

Subdivisions of the text

Tables of contents
Encoding of tables of contents with list inside div type="contents", with internal encoding to capture the functional parts of the table of contents information, such as page numbers and titles.

Text structure

Text Structure: Overview
An outline view of the topics covered within the discussion of text structure

The internal structure of div

The type attribute on div

Verse

Additional verse features

Line Groups

Stanzas and repeating forms

TEI special tag set for verse

The poem as a whole

Verse

Drama

Cast lists
Encoding of cast lists using castList, situations where the original cast list is missing or incomplete, discussion of WWP changes to the TEI DTD

Cast lists
Use of the who attribute to represent the speaker in a dramatic text, and to link individual speeches to the cast list.

Cast lists: special cases
Encoding of cast lists using castList, situations where the original cast list is missing or incomplete, discussion of WWP changes to the TEI DTD

Dramatic speakers
Encoding of dramatic speeches and speakers using sp and speaker, use of the who attribute. Use of full TEI encoding for drama to encode speech and speakers in both dramatic and non-dramatic texts. This entry discusses the differences and what encoding features are best used in each context.

Stage directions
Encoding of stage directions in drama and verse dialogues, position of stage directions, identification of speakers within stage directions

The type attribute of stage
Categorization of stage directions using the type attribute, with a list of permissible values

Letters

Collections of letters
Encoding collections of letters using div type="letter", and description of internal structure

Letters as prefatory material
Encoding letters which serve as prefatory material

Phrase-level

Abbreviations
Encoding of abbreviations using abbr, including a list of common abbreviations which are not tagged, and treatment of punctuation

Authors in the main text
Encoding of authors in bibliographic entries, using author and persName

Combined dates and times
Encoding of combined date and time data, using the value attribute for date

Dates, errors in
Encoding errors in dates

Dates: complex encoding
Encoding dates using date and the value attribute, including detailed instructions on the ISO8601 standard for date values

Dates: date ranges
Encoding date ranges using the date element rather than dateRange

Dates: dealing with calendars
Encoding of old-style dates and dates expressed in the Julian calendar

Dates: format for the value and when attributes
Encoding dates using date and the value (P4) or when (P5) attribute, including detailed instructions on the ISO 8601 standard for date values

Dates: simple encoding
Encoding dates using date and the value (P4) or when (P5) attribute

Emphasis
The emph element should be used for linguistic emphasis, where that can be distinguished from casual or decorative highlighting and from other motivating factors such as titles, foreign words, and so forth.

Encoding unknown features
Use of seg or ab as a placeholder to flag textual features for which the correct encoding is uncertain

Foreign words and phrases
Encoding foreign-language words and phrases using the lang or xml:lang attribute on existing elements, and the foreign element when necessary

Interaction of various emendation types
Discussion of how different kinds of textual emendation interact at the encoding level

Measures and numbers
Encoding of numbers and measurements using measure and num

Name keys: encoding unique name references
Use of the key attribute on persName to uniquely identify individuals

Names: abbreviations
Encoding of abbreviated versions of names

Names: general notes
Overview of the encoding of names, including personal names, place names, organizational names, and the names of objects

Names: problems of multiple reference
Discussion of encoding personal names that refer to more than one person

Names: problems of nested reference
Discussion of the encoding of human names using persName, including criteria for identifying creatures as human, and guidelines for nesting name elements

Names of collectivities and organizations
Discussion of encoding the names of collectivities and organizations using name and orgName, including distinctions between collectivities and organizations

Names of humans: encoding details
Discussion of the encoding of human names using persName, including criteria for identifying creatures as human, and guidelines for nesting name elements

Names of humans: general notes
Discussion of the encoding of human names using persName, including criteria for identifying creatures as human, and guidelines for nesting name elements

Names of non-humans and things
Discussion of the encoding of the names of non-human creatures, things, and events using name

Names of places
Discussion of encoding the names of places using placeName, including definition of place and relationship between place names and personal names

Phrase-level encoding: Overview
An outline view of the topics covered within the discussion of phrase-level encoding

Referencing strings (the rs element)
Use of the rs element

Simple highlighting
Encoding of simple renditional highlighting using hi

Special terminology, irony, and other forms of textual highlighting
Encoding of specialized language, including technical terminology, ironic usage, and words which are being discussed as words rather than used

Time
Encoding of time using time and the value attribute; our recommended usage limited to cases which are used to structure a set of entries in a journal or log

Titles of other texts
Encoding of titles in bibliographic entries and in running prose, using title, including criteria for identifying titles

Rendition

Alignment and justification
Use of the align keyword to encode horizontal alignment of elements whose position on the page is vertically constrained

Braces and the braced keyword
Use of the braced keyword to encode bracing used to group together multiple lines (e.g. lines of poetry)

Case
Use of the case keyword to capture case, and approaches to transcription

Columns
Encoding of multi-column layouts using the columns keyword in the renditional ladder

Decorative capitalization
Decorative capitalization should be encoded with hi, with an optional type attribute if categorization is useful.

Encoding document appearance: general points
General notes on what kinds of renditional information we do and do not capture

Encoding document appearance: rendition ladders
General notes on the use of the rendition ladder in the rend attribute, overview of keyword/value structure

Forme work (metawork): general
Encoding various types of forme work (including page numbers, line numbers, catchwords, press figures, signatures, and a few other features) using the mw element

Indentation
Overview of the encoding of indentation, including absolute and relative indentation, first-line indentation, and negative indentation

Inverted characters
Treatment of characters which are printed upside down in the source

Leaders
Transcription of leaders (separators within column-formatted lists such as tables of contents)

Line breaks: general
Line breaks in general are encoded with lb, with the exception of verse lines.

Quotation marks
Quotation marks should be captured where possible as renditional information modifying the element that motivates their appearance.

Renditional defaults
Methods of setting renditional defaults, using the tagsDecl in the TEI header

Renditional prefixes and postfixes
Use of the pre and post keywords to capture characters printed before or after an element (used as delimiters)

Rendition ladders: common keywords and values
Overview of keywords and values used in the rendition ladder

Ruled lines, borders, and ornaments
Transcription of rules and ornaments using an entity reference

Rules and ornaments: use as delimiters
Encoding of rules and ornaments as delimiters on elements, using the rend attribute

Slant and weight
Use of the slant and weight keywords to capture italicization and bold type

Space
Regularization of sizing and spacing, including regularization of vertical and horizontal space and of type size

Specialized renditional keywords: bestow and bequeath
Use of the bestow and bequeath keywords to propagate renditional information from an element to its children or descendants

Specialized renditional keywords: get
Use of the get keyword to duplicate the renditional features of a given element on other elements

Superscripted and subscripted letters
Use of the sub and sup keywords to encode subscription and superscription of letters

The break keyword and line break defaults
Use of the break keyword to capture line breaks between elements, including defaults assumed in WWP practice

The place keyword and text positioning
Use of the place keyword to encode the vertical and horizontal position of elements whose position on the page is unconstrained

Turnunders and the pos keyword
Use of the pos keyword to indicate the position of the remainder of a verse line which is printed on the line above or below

Type size and face
Treatment of type size and type face, information the WWP does and does not record

Typographic detail
General notes on what aspects of typography the WWP does and does not capture

Physical document

Catchwords
Encoding of catchwords using mw type="catch", including how to handle discrepancies between the catchword and the matching word in the text

Encoding information about the physical document: overview

Forme work, encoding within
Discussion of the types of encoding which may appear within the mw element

Forme work: renditional issues
Encoding renditional distinctions within mw

Line numbers
Encoding line numbers that are printed in the original text using mw type="lineNum".

Milestones, mw, and div boundaries
Discussion of the order and location of elements associated with page breaks (catchwords, milestones, page numbers, etc.)

Page breaks and page numbering
Encoding of page breaks and page numbering using the pb element and its n attribute, including guidelines for creating idealized page number sequences

Signatures
Encoding of the collation of the document, recording both printed signatures as they appear on the page using mw type="signature" and also an idealized signature sequence using milestone unit="sig".

Notes and annotations

Notes and annotations: overview

Reference structures

Acrostics
Encoding acrostics, and in particular capturing the encrypted word or words from the acrostic in a searchable form, using the WWP acrostic element

Bibliographic references
Encoding of bibliographic references using bibl, including guidelines for identifying bibliographic references and when not to encode them.

Endnotes
Encoding of endnotes using note, within a separate div type="endnotes"

hyperDiv
Use of the WWP hyperDiv element as a container for notes and other hypertextual components of the text, such as acrostics and supplemental cast lists

Inline notes
Encoding of inline notes, using note rend="place(inline)"

Links and cross-references
Encoding of links and cross-references using ref and xref

Notes, endnotes, and footnotes: overview
General overview of encoding notes, including footnotes, endnotes, marginal notes, and inline notes, and giving a summary of how notes are linked to the main text

Notes, page breaks within
Encoding page breaks within footnotes, using a second pb element which points to the main pb element

Notes: encoding the note itself
Encoding the text of a note, including details of the WWP’s changes to the content model of note, and discussion of the various things that appear in notes

Notes: linking the note and the text
Notes (including footnotes, endnotes, and marginal notes) should be linked to their anchor point using a bi-directional link which explicitly identifies both the anchor point and the note.

Notes: resp and type
Use of the resp and type attributes on note to capture the authorship of the note

Notes in the TEI header
Encoding of textual notes in the notesStmt of the TEI header

Repeated information: ibid, ditto, and other abbreviations
Encoding repetitions signalled with ditto, ibid, and similar markers, using the sameAs attribute

Transcription

Early typography and letter substitutions

Ellipsis

Encoding brevigraphs

Errors in the original

Features omitted from transcription

Handwriting: additions and deletions

Handwriting: General notes

Handwriting: the hand attribute and the hand element

Punctuation: general

Punctuation: hyphens

Punctuation and elements

Regularization: orig and reg

Regularization: silent

Special characters: ordinary characters requiring special treatment

Tagging at the letter or word level

Transcription of primary sources