<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="../stylesheets/yaps-tei.css"?>
<?oxygen RNGSchema="../schema/yaps.rnc" type="compact"?>
<?oxygen SCHSchema="../schema/yaps.sch"?>
<TEI version="5.0"
  xmlns="http://www.wwp.brown.edu/ns/yaps/1.0"
  xmlns:xi="http://www.w3.org/2001/XInclude">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <!-- title>Overlap and Multiple Hierarchies</title -->
        <title>
          <hi rend="CSS(color:red;)">O V E R L</hi>
          <hi>&#xA0;</hi>
          <hi rend="CSS(color:blue;)">H</hi>
          <hi>&#xA0;</hi>
          <hi rend="CSS(color:purple;)">A P</hi>
          <hi>&#xA0;</hi>
          <hi rend="CSS(color:blue;)">P E N S</hi>
        </title>
        <author>Syd Bauman</author>
      </titleStmt>
      <xi:include href="./boilerplate_publicationStmt.xml">
        <xi:fallback>
          <publicationStmt status="restricted">
            <note type="auto">WARNING: XInclude processing failed
              &#x2014; this file should not be copied or used (and
              is invalid) as a result.</note>
          </publicationStmt>
        </xi:fallback>
      </xi:include>
      <sourceDesc>
        <p>None.</p>
      </sourceDesc>
    </fileDesc>
    <revisionDesc>
      <change when="2006-03-13" who="#SB">automatically converted from
        presentation.odd conforming to yaps.odd conforming using
        p2y.xslt and p2y.perl</change>
    </revisionDesc>
  </teiHeader>
  <text>
    <presentation>
      <section>
        <head>The Problem</head>
        <slide>
          <list>
            <item>XML elements may not overlap</item>
            <item>document structures often do</item>
          </list>
          <list>
            <item>potential problem for all XML encoding</item>
            <item>particularly common when encoding extant older texts <list>
                <item>encoder does not control structure</item>
              </list></item>
            <item>generally less common when creating documents</item>
          </list>
        </slide>
      </section>
      <section>
        <head>The Size of The Problem</head>
        <slide>
          <list>
            <item>there exist dozens of <soCalled>solutions</soCalled></item>
            <item>up to half a dozen papers a year</item>
            <item>an entire day-long pre-conference
	    <ref target="http://www.extrememarkup.com/extreme/overlap/index.html">event</ref> in 2007</item>
            <item>an <emph>entire
	    <ref target="http://www.tei-c.org/release/doc/tei-p5-doc/html/NH.html">chapter</ref></emph>
	    of the TEI Guidelines</item>
            <item>frustration has caused many to investigate non-XML
              encodings</item>
          </list>
        </slide>
      </section>
      <section>
        <head>Solutions</head>
        <slide>
          <list>
            <item>milestones</item>
            <item>fragmentation</item>
            <item>HORSE</item>
          </list>
          <p>There are roughly half a dozen others which I will not be
            addressing</p>
        </slide>
      </section>
      <section>
        <head>Milestones</head>
        <slide>
          <list>
            <item>work perfectly for an alternate hierarchy which is
              completely flat and tessellates <list>
                <item>page breaks</item>
                <item>signatures</item>
                <item>reels of a movie</item>
              </list></item>
            <item>can work well for similar constructs <list>
                <item>line breaks</item>
                <item>column breaks</item>
              </list>
            </item>
          </list>
        </slide>
      </section>
      <section>
        <head>Fragmentation</head>
        <slide>
          <p>Take what is logically a single content object <list>
              <item>encode it as multiple separate XML elements</item>
              <item>indicate that each XML element is only a
                  <soCalled>partial</soCalled> element</item>
              <item>optionally have each partial element indicate
                which is the next piece of the whole content
              object</item>
            </list></p>
          <p>TEI provides 2 methods for doing this</p>
        </slide>
      </section>
      <section>
        <head>Fragmentation: aggregation by hunting/gathering with
            <att>part</att></head>
        <slide>
          <p>The <att>part</att> attribute can be used for
              <soCalled>serial</soCalled> cases: <list>
              <item>all fragments are in sequential order</item>
              <item>no intervening occurrence of same element type
                that is <emph>not</emph> part of the aggregate element</item>
              <item>e.g., good for <gi>l</gi> but sometimes not
                <gi>q</gi></item>
              <item>available on <gi>l</gi>, <gi>lg</gi>,
                <gi>div</gi>, <gi>seg</gi>, <gi>ab</gi>, <gi>s</gi>,
                  <gi>cl</gi>, <gi>phr</gi>, <gi>w</gi>, <gi>m</gi>,
                  <gi>c</gi></item>
            </list>
          </p>
        </slide>
      </section>
      <section>
        <head>Fragmentation: aggregation by pointing with
          <att>next</att> and <att>prev</att></head>
        <slide>
          <p>The <att>next</att> and <att>prev</att> attributes can be
            used for any cases: <list>
              <item>available on <emph>every</emph> element when
                additional tagset for segmentation &amp; alignment
                is used</item>
              <item>each fragment must bear either <att>next</att> or
                  <att>prev</att></item>
              <item>probably better if each fragment bears both</item>
            </list>
          </p>
          <eg>&lt;l&gt;I found at a conference C M Sperberg-McQueen&lt;/l&gt;
&lt;l&gt;Sang &lt;q&gt;closing, keynoting, I'm speaking&lt;/q&gt;&lt;/l&gt;
&lt;l&gt;And I said to him, &lt;q xml:id="q1" next="#q2"&gt;Superman, have you not seen&lt;/q&gt;,&lt;/l&gt;
&lt;l&gt;&lt;q xml:id="q2" prev="#q1"&gt;The embarrassment havoc I'm wreaking?&lt;/q&gt;&lt;/l&gt;</eg>
        </slide>
      </section>
      <section>
        <head>HORSE</head>
        <slide>
          <list>
            <item>not currently available in any system I know of</item>
            <item>special case of typed segment-boundary delimiter</item>
            <item>rather than giving an element content, use an empty
              element for both start and end</item>
            <item>special attributes point from the start to the end,
              and from the end to the start </item>
          </list>
          <eg>&lt;l&gt;I found at a conference C M Sperberg-McQueen&lt;/l&gt;
&lt;l&gt;Sang &lt;q&gt;closing, keynoting, I'm speaking&lt;/q&gt;&lt;/l&gt;
&lt;l&gt;And I said to him, &lt;q sID="q2"/&gt;Superman, have you not seen,&lt;/l&gt;
&lt;l&gt;The embarrassment havoc I'm wreaking?&lt;q eID="q2"/&gt;&lt;/l&gt;</eg>
        </slide>
      </section>
    </presentation>
  </text>
</TEI>
