<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="../../_utils/stylesheets/admin-tei.css"?>
<?oxygen RNGSchema="../../_utils/schema/handout.rnc" type="compact"?>
<!-- To generate XHTML for browsing use the stylesheet at
 ../../_utils/stylesheets/admin.xslt. The commands on golf would be
golf$ cd [to venue directory]
golf$ saxon.bash handouts/document_analysis.tei stylesheets/admin.xslt > handouts/document_analysis.xhtml 
 -->
<TEI xmlns="http://www.tei-c.org/ns/1.0" version="1.01" xml:lang="en-US">
 <teiHeader>
  <fileDesc>
   <titleStmt>
    <title>Document Analysis Worksheet</title>
    <author xml:id="jf">Julia Flanders</author>
    <author xml:id="sb">Syd Bauman</author>
   </titleStmt>
   <publicationStmt>
    <p>An unpublished document used for training.</p>
    <p>Available under GFDL, no invariant sections</p>
    <p>Copyleft 2007 Syd Bauman &amp; Julia Flanders</p>
   </publicationStmt>
   <sourceDesc>
    <p>No source</p>
   </sourceDesc>
  </fileDesc>
  <revisionDesc>
   <change when="2008-04-25" who="#jf">Expanded and reorganized</change>
  </revisionDesc>
 </teiHeader>
 <text>
  <body>
   <div type="section">
    <head>Project and Document Analysis</head>
    <p>Consider the sample document(s) you’ve chosen to focus on, and
     imagine the audience for your digital version of it.
     Then briefly answer the following questions.</p>
    
    <div type="subsection">
     <head>About the document(s)</head>
     <list type="ordered">
      <item>What genres do your documents contain?</item>
      <item>What are the significant chunks or subdivisions of your
       documents?</item>
      <item>List as many as possible of your documents’ significant features
       that you would want to encode, and provide a justification for encoding
       these features. Think about audience, likely uses of the information, and
       the balance of cost and benefit.</item>
      <item>What are the significant presentational features of your
       document (formatting, layout)? How much of this information do you consider important to
       capture? What is its effect on a reader's understanding of the text?</item></list>
     
    </div>
    <div type="subsection">
     <head>About the encoding</head>
     <list type="ordered">
      <item>What kinds of regularization of your document—if any—would be
       useful and appropriate? Would you regularize silently or preserve the
       original reading? Think about audience and probable use (including
       long-term use) of the data.</item>
      <item>What kinds of controlled vocabularies and terminology will be useful to constrain your encoding and help you maintain consistency? (For instance, values for the <att>type</att> attribute on <gi>div</gi>.)</item>
      <item>What kinds of contextual information are needed to make the documents intelligible to your audience? (For instance, glossaries, or biographical information about the authors.)</item>
     </list>
    </div>
     <div type="subsection">
       <head>About the project</head>
     <list type="ordered">
       <item>Who is the primary audience? Do they have special needs that can
         be supported through the encoding of the document?</item>
       <item>List the key features of the document that your audience will need</item>
       </div>
   </div>
  </body>
 </text>
</TEI>
