<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="../stylesheets/yaps-tei.css"?>
<?oxygen RNGSchema="../schema/yaps.rnc" type="compact"?>
<?oxygen SCHSchema="../schema/yaps.sch"?>
<TEI xmlns="http://www.wwp.brown.edu/ns/yaps/1.0" xmlns:xi="http://www.w3.org/2001/XInclude">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title>Manuscript and Physical Document Encoding</title>
        <author xml:id="JHF">Julia Flanders</author>
      </titleStmt>
      <xi:include href="./boilerplate_publicationStmt.xml">
        <xi:fallback>
        <publicationStmt status="restricted">
          <note type="auto">WARNING: XInclude processing failed &#x2014; this file should not be copied or
            used (and is invalid) as a result.</note>
        </publicationStmt>
        </xi:fallback>
        </xi:include>
      <sourceDesc>
        <p>This is the source.</p>
      </sourceDesc>
    </fileDesc>
    <revisionDesc>
      <change when="2006-08-09" who="#SB">Added encoding to show
          <soCalled>source</soCalled> for <gi>addSpan</gi> &amp;
          <gi>delSpan</gi> examples.</change>
      <change when="2006-03-13" who="#SB">automatically converted
        from presentation.odd conforming to yaps.odd conforming
        using p2y.xslt and p2y.perl</change>
      <change when="2006-02-15" who="#JHF">Reordered slides, added
        intro section on philosophical issues</change>
    </revisionDesc>
  </teiHeader>
  <text>
    <presentation>
      <section>
        <head>Some philosophical issues</head>
        <slide>
          <p>How to represent both the material and logical nature
            of the text</p>
          <p>How to accommodate materiality in a representational
            system that gives primacy to information</p>
          <p>How to formalize the material facts of the text</p>
        </slide>
        <lectureNote>
          <p>Note that these are really two separate, though closely
            related issues: <list>
              <item>All documents are physical; manuscripts are just
                a specific category whose physicality is
                particularly inescapable</item>
              <item>Special problems of manuscripts include issues
                having to do with handwriting and with the fact that
                the format of the page is essentially unregulated</item>
              <item>Special problems of print documents include
                chiefly issues having to do with collation and
                pagination</item>
            </list></p>
          <p>There are some aspects of the encoding of physical
            document structures which are common to print and MS
            documents, so we treat them together</p>
          <p>Similarly, there are some issues having to do with our
            perception of the physical exemplar (esp. having to do
            with legibility and conjecture) that are common to both.</p>
          <p>By and large, the TEI is focused, methodologically, on
            the text as linguistic rather than material information:
            its encoding provisions for genre, language, content are
            rich and detailed, while its provisions for material
            information are fairly minimal. Textual materiality
            poses some interesting conceptual problems for markup
            systems: <list>
              <item>it is not so readily representable as a logical
                structure</item>
              <item>its descriptors are less well codified</item>
              <item>it has typically been theorized as secondary to
                the text's logical structure</item>
              <item>representing materiality in formal terms is
                challenging</item>
            </list></p>
          <p>We raise these here mostly as signposts to issues that
            may be interesting, rather than trying to give an
            adequate treatment here; there's a lot of interesting
            debate on this topic and if you're interested we can
            provide some pointers</p>
          <p>For now, we're just going to cover some practical
            encoding points.</p>
        </lectureNote>
      </section>
      <section>
        <head>Encoding the physical document</head>
        <slide>
          <p>TEI makes the logical structures primary. </p>
          <p>The physical document can also be represented,
            typically as milestones: <list>
              <item><gi>pb</gi>, <gi>cb</gi>, <gi>lb</gi></item>
              <item>also <gi>milestone</gi> for other kinds of
                boundaries: for instance, signatures</item>
            </list>
          </p>
          <p><gi>fw</gi> for aspects of the printed forme work: page
            numbers, running heads, catchwords, signatures, etc.</p>
        </slide>
        <lectureNote>
          <p>In TEI, the primary emphasis of the encoding is on the
            text stream (paragraphs, divisions, and so forth)<list>
              <item>the physical document can also be represented,
                but since physical document structures overlap with
                logical document structures, the physical structures
                are typically represented by milestone elements of
                various sorts, as we saw a moment ago:</item>
              <item><gi>pb</gi>,<gi>cb</gi>, <gi>lb</gi></item>
              <item>also <gi>milestone</gi> for other kinds of
                boundaries: for instance, signatures</item>
              <item><gi>fw</gi> for aspects of the printed forme
                work: page numbers, running heads, catchwords,
                signatures, etc.</item>
              <item>note where they go in relation to structural
                elements: where they belong</item>
            </list></p>
        </lectureNote>
      </section>
      <section>
        <head>Sample page break</head>
        <slide>
          <eg><![CDATA[<fw type="catchword">the</fw>
<pb n="33" facs="./images/pg0033.jpg"/>
<milestone unit="sig" n="E1r"/>   
<fw type="pageNum">XXXIII</fw>]]></eg>
        </slide>
      </section>
      <section>
        <head>An Alternate View of the Universe</head>
        <slide>
          <eg><![CDATA[<div type="signature" n="A">                    
  <div type="folio" n="A1">              
    <div type="page" n="A1r"><!-- [contents of page] --></div>                    
    <div type="page" n="A1v"><!-- [contents of page] --></div>                   
  </div>
  <div type="folio" n="A2">              
    <div type="page" n="A2r"><!-- [contents of page] --></div>                    
    <div type="page" n="A2v"><!-- [contents of page] --></div>                   
  </div>
</div>]]></eg>
        </slide>
        <lectureNote>
          <p>Note that the physical structure can also be
            represented as the primary structure, for instance using
              <tag>div type="page | signature | etc."</tag></p>
        </lectureNote>
      </section>
      <section>
        <head>Illegibility and unclarity</head>
        <slide>
          <list>
            <item><gi>unclear</gi> for places where you can read but
              aren’t certain, want to express uncertainty</item>
            <item><gi>supplied</gi> for places where you can’t read,
              but can supply a reading from another source, or from
              editorial intuition</item>
            <item><gi>gap</gi> for places where you can’t read and
              can’t surmise either</item>
            <item><gi>damage</gi> for places that have been damaged
              (e.g. torn, burned, eaten by bugs)</item>
          </list>
        </slide>
        <lectureNote>
          <list>
            <item><gi>unclear</gi> for places where you can read but
              aren’t certain, want to express uncertainty</item>
            <item><gi>supplied</gi> for places where you can’t read,
              but can supply a reading from another source, or from
              editorial intuition</item>
            <item><gi>gap</gi> for places where you can’t read and
              can’t surmise either</item>
            <item><gi>damage</gi> for places that have been damaged
              (e.g. torn, burned, eaten by bugs)</item>
            <item>these elements may nest to represent more complex
              situations: e.g. a damaged portion part of which is
              illegible, part of which can be deciphered with
              uncertainty, part of which can’t be read but can be
              surmised.</item>
          </list>
        </lectureNote>
      </section>
      <section>
        <head>Additions, deletions</head>
        <slide>
          <p>A few simple elements:</p>
          <list>
            <item><gi>add</gi>: for handwritten additions to a
              printed text, or insertions/revisions in a manuscript
              text</item>
            <item><gi>del</gi>: for hand deletions from a printed or
              manuscript text</item>
            <item><gi>addSpan</gi>: for additions which overlap
              element boundaries</item>
            <item><gi>delSpan</gi>: for deletions which overlap
              element boundaries</item>
          </list>
          <p>Note that for <gi>addSpan</gi> and <gi>delSpan</gi> you
            need to include the TEI module on Transcription of
            Primary Sources.</p>
        </slide>
        <lectureNote>
          <p><gi>add</gi> and <gi>del</gi>: these contain brief
            additions or deletions in the text; provide a way to
            indicate where located, whose handwriting</p>
          <p>
            <gi>addSpan</gi> and <gi>delSpan</gi>: these are empty
            elements which mark the start of an addition or
            deletion, and point (with the <att>spanTo</att>
            attribute) to the end of the span, which is marked by an
              <gi>anchor</gi> that carries an <att>xml:id</att>.</p>
          <p>Note that for spans and handwriting identification,
            need to declare Chapter 18 (<val>transcr</val>)</p>
        </lectureNote>
      </section>

      <section>
        <head>Example: <gi>add</gi> and <gi>del</gi></head>
        <slide>
          <p rend="CSS( padding: 6pt;             font-style: roman;             font-size: normal;             margin-left: 1em;             margin-right: 1ex;             margin-top: 6pt;             )">What a hot <hi rend="CSS(vertical-align: super;)">and
              muggy</hi> day! Such <hi rend="CSS(text-decoration: line-through;)">beastly</hi> weather!</p>
          <eg><![CDATA[What a hot <add hand="#jflanders" 
place="supralinear">and muggy</add> day! Such 
<del hand="#sbauman" rend="crossout">beastly</del> 
weather!]]></eg>
        </slide>
      </section>
      <section>
        <head>Example: <gi>addSpan</gi></head>
        <slide>
          <p rend="CSS( padding: 6pt;             font-style: roman;             font-size: normal;             margin-left: 1em;             margin-right: 1ex;             margin-top: 6pt;             )">Today I sent a letter to <hi rend="CSS(vertical-align: super;)">my dear friend John
            </hi>Smith, to whom I owe so much.</p>
          <eg><![CDATA[<p>Today I sent a letter to <addSpan spanTo="#a01"/>my dear 
friend <name>John <anchor xml:id="a01"/>Smith</name>, 
to whom I owe so much.
</p>]]></eg>
        </slide>
      </section>
      <section>
        <head>Example: <gi>delSpan</gi></head>
        <slide>
          <p rend="CSS( padding: 6pt;             font-style: roman;             font-size: normal;             margin-left: 1em;             margin-right: 1ex;             margin-top: 6pt;             )">Today I sent a letter to <hi rend="CSS(text-decoration: line-through;)">my dear
              friend John </hi>Smith, which I now regret.</p>
          <eg><![CDATA[<p>Today I sent a letter to <delSpan spanTo="#a01"/>my dear 
friend <name>John <anchor xml:id="a01"/>Smith</name>, 
which I now regret.
</p>]]></eg>
        </slide>
      </section>

      <section>
        <head>Handwriting Identification</head>
        <slide>
          <eg><![CDATA[<teiHeader>
  <profileDesc>
    <handNotes>
      <handNote xml:id="jflanders"
                script="florid"
	        medium="ink-brown"
	        scope="major"/>
      <handNote xml:id="sbauman"
                script="printing"
	        medium="crayon-blue"
	        scope="minor"/>
    </handNotes>     
  </profileDesc>
</teiHeader>]]></eg>
        </slide>
        <lectureNote>
          <p>To identify the person whose handwriting is present,
            and also to provide information about the handwriting
            (what style, what ink, etc.)<list>
              <item><gi>hand</gi> in the TEI header gives a place to
                locate this information</item>
            </list></p>
        </lectureNote>
      </section>
      <section>
        <head>Handwriting Identification: in the text</head>
        <slide>
          <eg><![CDATA[<p><handShift new="#jflanders"/>We are really suffering.
There has been no respite. <handShift new="#sbauman"/>
What a hot <add hand="#jflanders">and muggy</add> day!                        
Such <del hand="#sbauman">beastly</del> weather!
<handShift new="#sbauman" writing="shaky"/>If it goes
on much longer we shall have to emigrate to Alaska.</p>]]></eg>
        </slide>
        <lectureNote>
          <list>
            <item>the <att>hand</att> attribute on <gi>add</gi> and
                <gi>del</gi> points to the <gi>hand</gi> element</item>
            <item>the <gi>handShift</gi> element indicates a place
              where the handwriting starts or changes, and carries
              attributes that allow you to identify the new
              handwriting, the old handwriting (if applicable), and
              various other aspects (e.g. the ink, the writing
              style, etc.)</item>
          </list>
        </lectureNote>
      </section>
    </presentation>
  </text>
</TEI>
