<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="../stylesheets/yaps-tei.css"?>
<?oxygen RNGSchema="../schema/yaps.rnc" type="compact"?>
<?oxygen SCHSchema="../schema/yaps.sch"?>
<TEI xmlns="http://www.wwp.brown.edu/ns/yaps/1.0" xmlns:xi="http://www.w3.org/2001/XInclude">
  <teiHeader>
    <fileDesc>
      <titleStmt>
	<title>
	  <hi rend="CSS(color:red;)">O V E R L</hi>
	  <hi> </hi>
	  <hi rend="CSS(color:blue;)">H</hi>
	  <hi> </hi>
	  <hi rend="CSS(color:purple;)">A P</hi>
	  <hi> </hi>
	  <hi rend="CSS(color:blue;)">P E N S</hi>
	</title>
	<author xml:id="SB">Syd Bauman</author>
	<author xml:id="JF">Julia Flanders</author>
      </titleStmt>
      <xi:include href="./boilerplate_publicationStmt.xml">
        <xi:fallback>
          <publicationStmt status="restricted">
            <note type="auto">WARNING: XInclude processing failed &#x2014; this file should not be copied or
            used (and is invalid) as a result.</note>
          </publicationStmt>
        </xi:fallback>
      </xi:include>
      <sourceDesc>
	<p>None.</p>
      </sourceDesc>
    </fileDesc>
    <revisionDesc>
      <change when="2008-01-10" who="#JF">Revised version for seminar audience</change>
      <change when="2006-03-13" who="#SB">automatically converted
      from presentation.odd conforming to yaps.odd conforming
      using p2y.xslt and p2y.perl</change>
    </revisionDesc>
  </teiHeader>
  <text>
    <presentation>
      <section>
        <head>Examples of overlapping structures</head>
        <slide>

          <list>
            <item>Physical features like pages, columns, and lines, and textual things like paragraphs or names</item>
           <item>Verse lines and quotations, names, other phrasal elements</item>
           <item>Verse lines and linguistic features</item>
           <item>Dramatic speeches and verse lines</item>
           <item>Handwritten additions or deletions and other structures</item>
           <item>Typographical features and linguistic features</item>
          </list>
        </slide>
       <lectureNote>
        
         <p>This is a potential problem for all XML encoding</p>
         <p>Particularly common when encoding extant older
          texts: <list>
           <item>the encoder does not control structure</item>
          </list></p>
         <p>generally less common when creating documents</p>
        <p>There are a variety of approaches to representing these kinds of overlapping structures in TEI:
        <list>
         <item>sometimes you just have one option</item>
         <item>sometimes there are a few different ways to approach the problem</item>
         <item>we'll look at the various possibilities; some we've already seen without realizing it</item>
        </list>
        </p>
        
       </lectureNote>
      </section>
      <section>
        <head>Empty elements used as milestones</head>
        <slide>

          <eg><![CDATA[<pb n="249"/>
<milestone unit="sig" n="R5r"/>
<lb/>digested. Its long trunk, as seen slanting down from
<lb/>out of the building across the wharf and into the ship,
<lb/>is a mere wooden pipe; but this pipe is divided within.
<lb/>It has two departments; and as the grain-bearing 
<lb/>troughs pass up the one on a pliable band, they pass
<lb/>empty down the other. The system therefore is that
<lb/>of an ordinary dredging machine; only that corn, and
<lb/>not mud is taken away, and that the buckets or 
<lb/>troughs are hidden from sight. Below, within the
<lb/>stomach of the poor bark, three or four labourers are
<lb/>at work, helping to feed the elevator. They shovel
<lb/>the corn up towards its maw, so that at every swallow
<lb/>he should take in all that he can hold...
<lb/>...The transit of the bushels 
<lb/>of corn from the larger vessel to the smaller will have
<lb/>taken less than a minute, and the cost of that transit
<lb/>will have been—a farthing.</p>
<pb n="250"/>
<milestone unit="sig" n="R5v"/>]]></eg>
        </slide>
       <lectureNote>
        <p>Simplest option: instead of encoding the feature by enclosing it in an element, instead just mark its boundaries with empty elements</p>
        <p>The most common case of this is with milestone elements:
        <list>
         <item>Elements that divide the text into segments according to some system: pages, columns, lines</item>
         <item>works perfectly for an information structure which
         is completely flat and divides up the whole text into parts: page breaks, signatures, reels of a movie</item>
         <item>i.e. there's nothing in the text that isn't on some page; there's nothing in a paragraph that's not on some line</item>
         <item>in these cases, you mark the boundaries between segments, so each boundary element marks the end of one segment and the start of the next.</item>
         
        </list>
        </p>
       </lectureNote>
      </section>
     <section>
      <head>Empty elements used as endpoints</head>
      <slide>
       <eg><![CDATA[<addSpan spanTo="#addEnd01"/>
<p>An elevator is as ugly a monster as has been yet
<lb/>produced. In uncouthness of form it outdoes those
<lb/>obsolete old brutes who used to roam about the semi-
<lb/>acqueous world, and live a most uncomfortable life
<lb/>with their great hungering stomachs and huge un-
<lb/>satisfied maws. The elevator itself consists of a big
<lb/>moveable trunk,—moveable as is that of an elephant,
<lb/>but not pliable, and less graceful even than an ele-
<lb/>phant's. This is attached to a huge granary or barn...</p>
<anchor xml:id="addEnd01"/>

]]></eg>
       <eg><![CDATA[<p>...for the elevator is an amphibious insti-
<lb/>tution, and flourishes only on the banks of navigable
<lb/>waters. When its head is ensconced within its box,
<lb/>and the beast of prey is thus nearly hidden within
<lb/>the building, the unsuspicious vessel is brought up
<lb/>within reach of the creature's trunk, and down it
<lb/>comes, like a mosquito's proboscis, right through the
<lb/>deck, in at the open aperture of the hold, and so into
<lb/>the very vitals and bowels of the ship. <delSpan spanTo="#spanEnd01">When there,
<lb/>it goes to work upon its food with a greed and
<lb/>avidity that is disgusting to a beholder of any taste
<lb/>or imagination.</p>
<p>And now I must explain the anatomical
<lb/>arrangement by which the elevator still
<lb/>devours and continues to devour, till the corn within
<lb/>its reach has all been swallowed, masticated, 
      and digested.<anchor xml:id="spanEnd01"/></p>]]></eg>
      </slide>
      <lectureNote> <p>But in addition there are other cases where it's handy to be able to mark the ends of an element at arbitrary places, rather than having to fit the element neatly into the document hierarchy
      <list>
       <item>classic example is additions and deletions: authors often add large chunks of stuff, or delete parts of things that don't match the textual structure</item>
      </list>
      </p>
          <p>For these, as we saw briefly yesterday, we can mark them much more effectively by putting an empty element at each end, sort of like marking the boundaries of an impromptu soccer field by putting your shoes at each end</p>
       <p>Then create a link between the two, using the pointing system we talked about yesterday...</p>
       </lectureNote>
      </section>
      <section>
        <head>Fragmentation</head>
        <slide>
<eg><![CDATA[<sp>
   <speaker>Leo.</speaker>
      <l part="F">Go on, go on:</l>
      <l>Thou canst not speake too much, I have deserv'd</l>
      <l part="I">All tongues to talk their bittrest.</l>
</sp>
<sp>
   <speaker>Lord.</speaker>
      <l part="F">Say no more;</l>
      <l>How ere the business goes, you have made fault</l>
      <l part="I">I'th boldnesse of your speech.</l>
</sp>
<sp>
   <speaker>Pauline.</speaker>
      <l part="F">I am sorry for't;</l>
      <l>All faults I make, when I shall come to know them</l>
      <!-- ... -->
</sp>]]>
</eg>
         
        </slide>
       <lectureNote>
        <p>Take what is logically a single content object <list>
         <item>encode it as multiple separate XML elements</item>
         <item>indicate that each XML element is only a
          <soCalled>partial</soCalled> element</item>
         <item>optionally have each partial element indicate
          which is the next piece of the whole content
          object</item>
        </list></p>
        <p>TEI provides 2 methods for doing this; the first is the part= attribute...
        </p>
        <p>The <att>part</att> attribute can be used for
         <soCalled>serial</soCalled> cases: <list>
          <item>all fragments are in sequential order</item>
          <item>no intervening occurrence of same element type
           that is <emph>not</emph> part of the aggregate
           element</item>
          <item>e.g., good for <gi>l</gi> but sometimes not
           <gi>q</gi></item>
          <item>available on <gi>l</gi>, <gi>lg</gi>,
           <gi>div</gi>, <gi>seg</gi>, <gi>ab</gi>, <gi>s</gi>,
           <gi>cl</gi>, <gi>phr</gi>, <gi>w</gi>, <gi>m</gi>,
           <gi>c</gi></item>
         </list>
        </p>       </lectureNote>
      </section>
      <section>
        <head>Another approach to fragmentation</head>
        <slide>
         <eg><![CDATA[Mortal, she said, "I'm sent to you,
Then hold my precepts fast;
Remember earth's best joys are few,
And can't for ever last."]]></eg>
         <eg><![CDATA[<lg type="stanza">
  <l>Mortal, she said, <said xml:id="s01" next="#s01">I'm sent to you,</said></l>
  <l><said xml:id="s02" next="#s03" prev="#s01">Then hold my precepts fast;</said></l>
  <l><said xml:id="s03" prev="#s02" next="#s04">Remember earth's best joys are few,</said></l>
  <l><said xml:id="s04" prev="#s03">And can't for ever last.</said></l>
</lg>]]>
</eg>
         
        </slide>
       <lectureNote>
        <p>The <att>next</att> and <att>prev</att> attributes can
         be used for any cases: <list>
          <item>available on <emph>every</emph> element when
           additional tagset for segmentation &amp;
           alignment is used</item>
          <item>each fragment must bear either <att>next</att>
           or <att>prev</att></item>
          <item>probably better if each fragment bears
           both</item>
         </list>
        </p>
        
       </lectureNote>
      </section>
    </presentation>
  </text>
</TEI>
