<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="../stylesheets/yaps-tei.css"?>
<?oxygen RNGSchema="../schema/yaps.rnc" type="compact"?>
<?oxygen SCHSchema="../schema/yaps.sch"?>
<TEI xmlns="http://www.wwp.brown.edu/ns/yaps/1.0" xmlns:xi="http://www.w3.org/2001/XInclude">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>Digital Research Materials</title>
    <author xml:id="JF">Julia Flanders</author>
   </titleStmt>
      <editionStmt>
        <edition>Texas A &amp; M University</edition>
      </editionStmt>
      <publicationStmt>
        <distributor>Women Writers Project (via website)</distributor>
        <address>
          <addrLine>wwp@Brown.edu</addrLine>
        </address>
        <date when="2009-04-17"/>
        <availability status="restricted">
          <p>Copyright 2007 Syd Bauman, Julia Flanders, and Brown WWP</p>
	  <p>This TEI-encoded XML file is available under the terms of
	  the <ref target="http://creativecommons.org/licenses/by-sa/3.0/">Creative
	  Commons Attribution-ShareAlike 3.0 (Unported)</ref>
	  license.</p>
        </availability>
        <pubPlace>Providence, RI  USA</pubPlace>
      </publicationStmt>
   <sourceDesc>
    <p>This is the source.</p>
   </sourceDesc>
  </fileDesc>
  <revisionDesc>
   <change when="2007-09-18" who="#JF">updated for UCSB</change>
   <change when="2007-03-14" who="#JF">made new version based on old slides for
    use with NEH seminars: removed TEI materials and expanded discussion of
    markup</change>
   <change when="2006-03-13" who="#SB">automatically converted from
    presentation.odd conforming to yaps.odd conforming using p2y.xslt and
    p2y.perl</change>
   <change when="2006-03-02" who="#JF">removed SGML &amp; P5 -specific
    slides</change>
   <change when="2006-02-13" who="#JF">Added slides on What is XML and more on
    P5. </change>
   <change when="2006-03-05" who="#JF">Added more detail on TEI</change>
  </revisionDesc>
 </teiHeader>
 <text>
  <presentation>
   <section>
    <head>What is distinctive about digital research materials?</head>
    <slide>
     <p>Some comparanda...</p>
     <list>
      <item>
       <ref target="../gfx/gray_concordance.png">Concordances</ref>
      </item>
      <item>
       <ref target="../gfx/image_frame_donne_dots.html">Visualizations</ref>
      </item>
      <item>
       <ref target="../gfx/image_frame_nvs.html">Textual apparatus</ref>
      </item>
      <item><ref target="../gfx/bacon_facsimile.png">Facsimiles</ref> and
            <ref target="../gfx/image_frame_askew.html">page images</ref></item>
      <item>
       <ref target="http://www.natcorp.ox.ac.uk/">Data sets</ref>
      </item>
      <item><ref target="http://www.imdb.com/">Metadata</ref></item>
     </list>
    </slide>
    <lectureNote>
     <p>What I want to do in this first session is situate text encoding within
      a larger frame of reference: as a specific way of representing scholarly
      information <list>
       <item>partly as a way of understanding text encoding (and other forms
        of digital research tools) </item>
       <item>and partly as a way of understanding how we represent and use
        scholarly information, research materials, more generally </item>
       <item>because one crucial question being asked in the digital
        humanities domain is essentially "what changes?"; "does anything
        change?" </item>
       <item>i.e. are we radically altering how humanities research is done?
        or the kinds of arguments we make? </item>
       <item>put another way, is it all worth the effort?</item>
      </list>
     </p>
     <p> In order to do this, I think it will be helpful if we can lay out in
      front of ourselves the kinds of research materials we're familiar with,
      and talk about how we're used to thinking with them: what do they do for
      us, informationally? how do they present the source materials to us? </p>
     <p> So: what kinds of research sources have you used in the past year? <list>
       <item>primary sources? in what media? originals, facsimiles, reprints,
        editions, microfilm reproductions, digital transcriptions (what kind?)</item>
       <item>secondary sources? </item>
       <item>derived data? in what form?</item>
       <item>metadata (library catalogues, finding aids, etc.) </item>
      </list>
     </p>
     <p>How would we characterize the types of information found in these
      sources? <list>
       <item> sources that show us visual evidence</item>
       <item> sources that give us derived analysis (quantitative, qualitative)</item>
       <item> sources that give us a description, in language</item>
       <item> sources that give us a description, in formal terms</item>
       <item> sources that give us an argument, in language</item>
       <item> sources that give us an argument, in other forms?</item>
      </list>
     </p>
     <p> How do we evaluate these various types of sources? how are they
      successful or unsuccessful? <list>
       <item> Visual evidence: level of granularity, fineness of detail,
        accuracy (of color, etc.)</item>
       <item> Derived analysis: the intellectual basis of the analysis: the
        accuracy and relevance of its disciplinary assumptions, the usefulness
        of the insight it gives us</item>
       <item> Prose description: the richness of detail, the persuasiveness
        (i.e. how it convinces us of the author's trustworthiness and usefulness
        as a witness), also its comparability to other descriptions (i.e. using
        descriptive terms consistently)</item>
       <item> Formal description: consistency, appropriate granularization of
        the data </item>
      </list>
     </p>
     <p> Is any of this inflected by discipline? <list>
       <item>what kinds of sources do historians use?</item>
       <item>what kinds of sources do literary scholars use?</item>
       <item>linguists?</item>
       <item>ontologists?</item>
       <item> --other groups?</item>
      </list></p>
    </lectureNote>
   </section>
   <section>
    <head>What is text encoding? Where does it fit in?</head>
    <slide>
     <list>
      <item><quote>Thick description</quote>?</item>
      <item>Scholarly editorial analysis?</item>
      <item>Interpretive commentary?</item>
      <item>Data modelling?</item>
     </list>
    </slide>
    <lectureNote>
     <p>So we can try to situate the activity of text encoding in this
      intellectual space: <list>
       <item>From the viewpoint of the humanities scholar, text encoding looks
        as if it's coming over from computer science: as an activity that takes
        place on computers and requires some technical knowledge (of software,
        of data standards, of encoding languages)</item>
       <item>in fact, there are some other lines of connection that make it
        clearer why it should be of interest to us</item>
       <item>anthropological: the text encoder is an observer and
        documenter of the textual world, and the encoding he/she produces has
        (at least potentially) something of the quality of a <quote>thick
         description</quote>: a contextualized, interpretative account of the
        details of the textual landscape.</item>
       <item>editorial: the text encoder is also very much like a critical editor, creating
        an analytical representation of the text which provides systematic,
        expert knowledge about it</item>
       <item>interpretive, critical: the encoder can also act as an interpretative commentator, using
        markup to add context, layers of interpretive information</item>
      </list>
     </p>
     <p>Perhaps most importantly, text encoding is a modelling activity: a
      process of creating an analytical representation of an object (e.g. a
      document) or an information system</p>
    </lectureNote>
   </section>
   <section>
    <head>Sampling and modelling</head>
    <slide>
     <table>
      <row>
       <cell>
        <figure>
         <graphic height="400px" url="./gfx/caterpillar_8.jpg"/>
        </figure>
       </cell>
       <cell>
        <figure>
         <graphic height="400px" url="./gfx/caterpillar_100.jpg"/>
        </figure>
       </cell>
      </row>
     </table>
    </slide>
    <lectureNote>
     <p>It may be useful to talk more about the concept of data modelling at this point...</p>
     <list>
      <item>increasingly common concept in digital humanities, lots of discussion about what it means</item>
     </list>
     <p>I'm using the term <term>modelling</term> here as distinct from a concept
      like <term>sampling</term>: <list>
       <item><term>Sampling</term> takes slices or samples of the world:
        visually (like a digital camera), sonically (like a digital sound
        recorder), or in some other way</item>
       <item>the classic example is the bitmap image: a matrix of colored dots
        that represent an image at some resolution: high or low</item>
       <item><term>Modelling</term> creates an analytic representation of the
        world: as a function, a formalization, a mathematical representation, a
        conceptual model, some kind of surrogate</item>
      </list></p>
     <p>Sampling produces what I would tentatively call a
      <term>depiction</term>: a version that aspires to <emph>be</emph> the
      source: <list>
       <item>Measured in terms of fidelity</item>
       <item>Example: a high-resolution photograph (higher resolution = better
        depiction)</item>
      </list></p>
     <p>Modelling produces a version that aspires to <emph>yield
      information</emph> about the source for a specific purpose: <list>
       <item>Measured in terms of functionality against the purpose in question</item>
       <item>A topographical map: functional for understanding geographical
        features</item>
       <item>A road map: functional for navigating in a car</item>
       <item>A satellite map: functional for viewing weather systems</item>
      </list>
     </p>

    </lectureNote>
   </section>

   <section>
    <head>Varieties of digital formats</head>
    <slide>
     <table>
      <row>
       <cell>
        <figure>
         <graphic height="200px" url="../gfx/ms_original.jpg"/>
        </figure>
       </cell>
      </row>
      <row>
       <cell>
        <figure>
         <graphic height="200px" url="../gfx/ms_rendered.png"/>
        </figure>
       </cell>
      </row>
      <row>
       <cell>
        <eg><![CDATA[<lg>
<head>After <del>an</del><add>the <del>unsolv'd</del></add> argument</head>
<l><del>The</del><add><del>Coming in,</del> A group of</add> little children, and their
   <lb/>ways and chatter, flow in <del>upon me</del></l>
<l>Like <add>welcome</add> rippling water o'er my
   <lb>heated <add>nerves and</add> flesh.</l>
</lg>]]></eg>
       </cell>
      </row>
     </table>
    </slide>
    <lectureNote>

     <p>What do the examples here show us? what does each version let you see?
      how does each representation convey information about the original
      artifact? (Pause for discussion)</p>
    </lectureNote>
   </section>

   <section>
    <head><soCalled>Text encoding</soCalled> vs. <soCalled>information
     modelling</soCalled></head>
    <slide>
     <table>
      <row>
       <cell>
        <figure>
         <graphic height="400px" url="./gfx/text_encoding.jpg"/>
        </figure>
       </cell>
      </row>
      <row>
       <cell>
        <figure>
         <graphic height="400px" url="./gfx/information_modelling.jpg"/>
        </figure>
       </cell>
      </row>
      <row>
       <cell>
        <figure>
         <graphic height="400px" url="./gfx/data_modelling.jpg"/>
        </figure>
       </cell>
      </row>
     </table>
    </slide>
    <lectureNote>
     <p>Note that <soCalled>text encoding</soCalled> may not even be the best or
      most evocative term for all of this: <list>
       <item>it describes the markup of text streams</item>
       <item>but for things like modelling contextual information (e.g.
        personography example) the emphasis is on the structures created, not on
        the text itself</item>
       <item>somewhere in between is the terrain of text which is itself
        regularly structured</item>
       <item>when we think of what all of these kinds of markup are really
        doing, we might better describe it as <soCalled>information
        modelling</soCalled> or <soCalled>data modelling</soCalled></item>
      </list>
     </p>
     <p>And note as well that text markup is not the only way to model data, or
      text data: <list>
       <item>databases have been used for a long time for this purpose</item>
       <item>with a greater emphasis on structure and less on the nuances of
        text</item>
       <item>not fundamentally different, just a different emphasis: on what is
        consistent vs. on what is variable</item>
       <item>databases tend to cease modelling at the point where the text
        becomes highly variable: within paragraphs, within lines of poetry: the
        markup of individual words</item>
       <item>though now with XMl databases, we're starting to see approaches
        that are really hybrids</item>
      </list>
     </p>
    </lectureNote>
   </section>

   <section>
    <head>What do humanities scholars need?</head>
    <slide>
     <p>What is the distinctive function of modelling for humanities
      scholarship?</p>
     <p>What is the distinctive function of depiction for humanities
      scholarship?</p>
     <p>When do we need each one? What role do they play in our research
      practices?</p>
     <p>What resources do we currently use (in print or digitally) that
      emphasize one or the other?</p>
    </slide>
   </section>




  </presentation>
 </text>
</TEI>
