11 Representation of Primary Sources
Contents
This chapter defines a module intended for use in therepresentation of primary sources, such as manuscripts or otherwritten materials. Section 11.1 Digital Facsimiles provides elementsfor the encoding of digital facsimiles or images of such materials,while the remainder of the chapter discusses ways of encoding detailedtranscriptions of such materials. It is expected that this module willalso be useful in the preparation of critical editions, but the moduledefined here is distinct from that defined in chapter 12 校本, and may be used independently of it. Detailed metadatarelating to primary sources of any kind may be recorded using theelements defined by the manuscript description module discussed inchapter 10 Manuscript Description, but again the present module may be usedindependently if such data is not required.
It should be noted that, as elsewhere in these Guidelines, thischapter places more emphasis on the problems of representing thetextual components of a document than on those relating to thedescription of the document's physical characteristics such as thecarrier medium or physical construction. These aspects, of particularimportance in codicology and the bibliographic study of incunables,are touched on in the chapter on Manuscript Description (10 Manuscript Description) and also form the subject of ongoing work in the TEIPhysical 参考文献 workgroup.
Although this chapter discusses manuscript materials morefrequently than other forms of written text, most of therecommendations presented are equally applicable mutatismutandis in the encoding of printed matter or indeed anyform of written source, including monumental inscriptions. Similarly,where in the following descriptions terms such as‘scribe’, ‘author’,‘editor’, ‘annotator’ or‘corrector’ are used, these may be re-interpretedin terms more appropriate to the medium being transcribed. In printedmaterial, for example, the ‘compositor’ plays arole analogous to the ‘scribe’, while in anauthorial manuscript, the author and the scribe are the same person.
11.1 Digital FacsimilesTEI: Digital Facsimiles¶
These Guidelines are mostly concerned with the preparation ofdigital texts, in which a pre-existing text is transcribed orotherwise converted into character form, and marked up inXML. However, it is also very common practice to make a different formof ‘digital text’, which is instead composed ofdigital images of the original source, typically one per page, orother written surface. We call such a resource a digitalfacsimile. A digital facsimile may, in the simplest case, justconsist of a collection of images, with some metadata to identify themand the source materials portrayed. It may sometimes contain avariety of images of the same source pages, for example of differentresolutions, or of different kinds. Such a collection may form part ofany kind of document, for example a commentary of a codicological orpaeleographic nature, where there is a need to align explanatory textwith image data. And it may also be complemented bya transcribed or encoded version of the original source, which may belinked to the page images. In this section we present elementsdesigned to support these various possibilities and discuss theassociated mechanisms provided by these Guidelines.
- att.global.facs elements which can be associated with an image or a surface within a facsimile element.
facs (facsimile) points directly to an image, or to a part of a facsimile element which corresponds with this element.
<teiHeader>
<!--...-->
</teiHeader>
<text>
<pb facs="page1.png"/>
<!-- text contained on page 1 is encoded here -->
<pb facs="page2.png"/>
<!-- text contained on page 2 is encoded here -->
</text>
</TEI>
The recommended approach to encoding facsimiles is instead to usethe facs attribute in conjunction with the elementsfacsimile, surface, and zone, which arealso provided by this module. These elements make it possible toaccommodate multiple images of each page, as well as to recordarbitrary planar coordinates of textual elements on any kind ofwritten surface and to link such elements with digital facsimileimages of them. Typical applications include the provision of fulltext search in ‘digital facsimile editions’, andways of annotating graphics, for example so as to identify individualsappearing in a group portraits and link them to data about the personrepresented.
- facsimile contains a representation of some written source in the form ofa set of images rather than as transcribed or encoded text.
- surface defines a written surface in terms of a rectangularcoordinate space, optionally grouping one or more graphic representations ofthat space, and rectangular zones of interestwithin it.
start points to an element which encodes the starting position of the text corresponding to theinscribed part of the surface. - zone defines a rectangular area contained within a surfaceelement.
- a TEI Header and a text element
- a TEI Header and a facsimile element
- a TEI Header, a facsimile element, and a text element
Like the text element, a facsimile element mayalso contain an optional front or back element, usedin the same way as described in sections 4.5 前付け and4.7 後付.
<graphic url="page1.png"/>
<graphic url="page2.png"/>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile>
<graphic url="page1.png"/>
<surface>
<graphic url="page2-highRes.png"/>
<graphic url="page2-lowRes.png"/>
</surface>
<graphic url="page3.png"/>
<graphic url="page4.png"/>
</facsimile>
The surface element provides a way of indicating that thetwo images of page2 represent the same physical surface within thesource material. A surface might be a sheet of paper orparchment, a face of a monument, a billboard, a membrane of a scroll,or indeed any two-dimensional surface, of any size.
- att.coordinated elements which can be positioned within a two dimensionalcoordinate system.
ulx gives the x coordinate value for the upper left corner of arectangular space. uly gives the y coordinate value for the upper left corner of arectangular space. lrx gives the x coordinate value for the lower right corner of arectangular space. lry gives the y coordinate value for the lower right corner of arectangular space.
The same coordinate space is used for a surface and forall of its child elements. 34 It may be most convenientto derive a coordinate space from a digital image of the surface inquestion such that each pixel in the image corresponds with a wholenumber of units (typically 1) in the coordinate space. In other casesit may be more convenient to use units such as millimetres; in neithercase is any specific mapping to the physical dimensions of the objectrepresented implied.
Each surface can contain one or more zoneelements, each of which represents a rectangular region orbounding box defined in terms of the same coordinatespace as that of its parent surface element. This provides aunit of analysis which may be used to define any rectangular region ofinterest, such as a detail or illustration, or some part of thesurface which is to be aligned with a particular text element. Theatt.coordinated attributes listed aboveare also used to supply the coordinates of a zone.
As we have seen, a surface will usually correspond with the wholeof a written surface. A zone, by contrast, defines any arbitraryrectangular area of interest using the same coordinate system. Itmight be bigger or smaller than its parent surface, or might overlapits boundaries. The only constraint is that it must be defined usingthe same coordinate system.
When an image of some kind is supplied within either a zone or asurface, the implication is that the whole of the image representsthe zone or surface containing it. In the simple case therefore, wemight imagine a surface defining a page, within which there is agraphic representing the whole of that page, and a number of zonesdefining parts of the page, each with its own graphic. If one of thoseimages actually represents an area larger than the page (for exampleto include a binding or the surface of adesk on which the page rests), then it might be enclosed by a zonewith coordinates smaller or larger than those of the parentsurface.
Note that this mechanism does not provide any way of addressing anon-rectangular area, nor of coping with distortions introduced byperspective or parallax; if this is needed, the more powerfulmechanisms provided by the Standard Vector Graphics (SVG) languageshould be used to define an overlay, as further discussed in 16.4.3 A Three-way Alignment.
<surface
ulx="50"
uly="20"
lrx="400"
lry="280">
<!-- ... -->
</surface>
</facsimile>
<surface
ulx="50"
uly="20"
lrx="400"
lry="280">
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
</facsimile>
If desired, the binaryObject element described in 3.9 図等の非テキスト内容 (or any other element from themodel.graphicLike class) may be used instead of agraphic element.
<surface
ulx="50"
uly="20"
lrx="210"
lry="280">
<desc>left hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
<surface
ulx="240"
uly="25"
lrx="400"
lry="280">
<desc>right hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
</surface>
</facsimile>
<surface
ulx="50"
uly="20"
lrx="210"
lry="280">
<desc>Left hand page</desc>
<zone
ulx="0"
uly="0"
lrx="500"
lry="321">
<graphic
url="http://upload.wikimedia.org/wikipedia/commons/5/50/Handschrift.karlsruhe.blb.jpg"/>
</zone>
<zone
ulx="90"
uly="40"
lrx="200"
lry="225">
<desc>Written part of left hand page</desc>
</zone>
</surface>
</facsimile>
<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile>
<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
<zone
ulx="25"
uly="25"
lrx="180"
lry="60">
<desc>contains the title</desc>
</zone>
<zone
ulx="28"
uly="75"
lrx="175"
lry="178"/>
<!-- contains the paragraph in italics -->
<zone
ulx="105"
uly="76"
lrx="175"
lry="160"/>
<!-- contains the figure -->
<zone
ulx="45"
uly="125"
lrx="60"
lry="130"/>
<!-- contains the word "pendans" -->
</surface>
</facsimile>
ulx="105"
uly="76"
lrx="175"
lry="160">
<graphic url="Bovelles49r-detail.png"/>
</zone>
<surface
ulx="0"
uly="0"
lrx="200"
lry="300">
<zone
ulx="0"
uly="0"
lrx="200"
lry="300">
<graphic url="Bovelles-49r.png"/>
</zone>
<zone
ulx="105"
uly="76"
lrx="175"
lry="160">
<graphic url="Bovelles49r-detail.png"/>
</zone>
<zone
xml:id="B49rHead"
ulx="25"
uly="25"
lrx="180"
lry="60"/>
<!-- contains the title -->
<zone
xml:id="B49rPara2"
ulx="28"
uly="75"
lrx="175"
lry="178"/>
<!-- contains the paragraph in italics -->
<zone
xml:id="B49rFig1"
ulx="105"
uly="76"
lrx="175"
lry="160"/>
<!-- contains the figure -->
<zone
xml:id="B49rW457"
ulx="45"
uly="125"
lrx="60"
lry="130"/>
<!-- contains the word "pendans" -->
</surface>
</facsimile>
<fw>De Geometrie 49</fw>
<head facs="#B49rHead">DU SON ET ACCORD DES CLOCHES ET <lb/> des alleures des chevaulx,
chariotz &amp; charges, des fontaines:&amp; <lb/> encyclie du monde,
&amp; de la dimension du corps humain.</head>
<head>Chapitre septiesme</head>
<div n="1">
<p>Le son & accord des cloches pendans en ung mesme <lb/> axe, est
faict en contraires parties.</p>
<p rend="it" facs="#B49rPara2">LEs cloches ont quasi fi<lb/>gures de rondes
pyra<lb/>mides imperfaictes &amp; <lb/> irregulieres: &amp; leur
accord se <lb/> fait par reigle geometrique. Com<lb/>me si les deux
cloches C &amp; D <lb/> sont <w facs="#B49rW457">pendans</w> à ung
mesme axe <lb/> ou essieu A B: je dis que leur ac<lb/>cord se fera en
co<ex>n</ex>traires parties<lb/> co<ex>m</ex>me voyez icy
figuré. Car qua<ex>n</ex>d <lb/> lune sera en hault, laultre
declinera embas. Aultrement si elles decli<lb/>nent toutes deux
ensembles en une mesme partie, elles seront discord, <lb/> &amp; sera
leur sonnerie mal plaisante à oyr.<figure facs="#B49rFig1">
<graphic url="Bovelles49r-detail.png"/>
</figure>
</p>
</div>
Further discussion of the encodingchoices made in the above transcription is provided in the remainderof this chapter.
<surface start="#PB49R">
<graphic url="Bovelles-49r.png"/>
</surface>
</facsimile>
<text>
<!-- ... -->
<pb xml:id="PB49R"/>
<fw>De Geometrie 49</fw>
<!-- ... -->
</text>
11.2 Scope of TranscriptionsTEI: Scope of Transcriptions¶
- first, methods of recording editorial or other alterations tothe text, such as expansion of abbreviations, corrections, conjectures,etc. (section 11.3 Altered, Corrected, and Erroneous Texts)
- then, methods of describing important extra-linguistic phenomenain the source: unusual spaces, lines, page and line breaks, change ofmanuscript hand, etc. (section 11.4 Hands and Responsibility)
- finally, a method of recording material such as running heads,catch-words, and the like (section 11.7 Headers, Footers, and Similar Matter)
These recommendations are not intended to meet everytranscriptional circumstance likely to be faced by any scholar.Rather, they should be regarded as a base which can be elaborated if necessary by different scholars in different disciplines.
As a rule, all elements which may be used in the course of atranscription of a single witness may also be used in a criticalapparatus, i.e. within the elements proposed in chapter 12 校本.This can generally be achieved by nesting aparticular reading containing tagged elements from a particular witnesswithin the rdg element in an app structure.
Just as a critical apparatus may contain transcriptional elementswithin its record of variant readings in various witnesses, one mayrecord variant readings in an individual witness by use of the apparatusmechanisms app and rdg. This is discussed insection 12.3 転記中の校合要素.
11.3 Altered, Corrected, and Erroneous TextsTEI: Altered, Corrected, and Erroneous Texts¶
In the detailed transcription of any source, it may prove necessaryto record various types of actual or potential alteration of the text:expansion of abbreviations, correction of the text (either by author,scribe, or later hand, or by previous or current editors or scholars),addition, deletion, or substitution of material, and the like. Thesections below describe how such phenomena may be encoded using eitherelements defined in the core module (defined in chapter 3 コアモジュール) or specialized elements available only when the moduledescribed in this chapter is available.
11.3.1 Core elements for Transcriptional WorkTEI: Core elements for Transcriptional Work¶
- abbr (abbreviation) contains an abbreviation of any sort.
- add (addition) contains letters, words, or phrases inserted in the text by anauthor, scribe, annotator, or corrector.
- choice groups a number of alternative encodings for the same point in a text.
- corr (correction) contains the correct form of a passage apparently erroneous in the copy text.
- del (deletion) contains a letter, word, or passage deleted, marked as deleted,or otherwise indicated as superfluous or spurious in the copy text by anauthor, scribe, annotator, or corrector.
- expan (expansion) contains the expansion of an abbreviation.
- gap indicates a point where material has been omitted in atranscription, whether for editorial reasons described in the TEIheader, as part of sampling practice, or because the material isillegible or inaudible.
- sic (latin for thus or so) contains text reproduced although apparently incorrect or inaccurate.
- att.editLike provides attributes describing the nature of a encoded scholarly intervention or interpretation of any kind.
cert (certainty) signifies the degree of certainty associated with the intervention or interpretation. resp (responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber. source contains a list of one or more pointers indicating the sources which support the given reading. - att.typed provides attributes which can be used to classify or subclassify elements in any way.
type characterizes the element in some sense, using any convenient classification scheme or typology. subtype provides a sub-categorization of the element, if needed
The following sections describe how the core elements just named maybe used in the transcription of primary source materials.
11.3.2 Abbreviation and ExpansionTEI: Abbreviation and Expansion¶
The writing of manuscripts by hand lends itself to the use ofabbreviation to shorten scribal labour. Commonly occurring letters,groups of letters, words, or even whole phrases, may be represented bysignificant marks. This phenomenon of manuscript abbreviation is sowidespread and so various that no taxonomy of it is here attempted.Instead, methods are shown which allow abbreviations to be encoded usingthe core elements mentioned above.
A manuscript abbreviation may be viewed in two ways. One maytranscribe it as a particular sequence of letters or marks upon thepage: thus, a ‘p with a bar through the descender’, a‘superscript hook’, a ‘macron’. One may also interpret theabbreviation in terms of the letter or letters it is seen as standingfor: thus, ‘per’, ‘re’, ‘n’. Both of these views aresupported by these Guidelines.
In many cases the glyph found in the manuscript source also existsin the Unicode character set: for example the common Latin brevigraph⁊, standing for et and often known asthe ‘Tironian et’ can be directly represented inany XML document as the Unicode character with code pointU+204A (see further Character References and vi.i Language identification). In cases where it does not, these Guidelinesrecommend use of the g element provided by the gaiji module described in chapter 5 Representation of Non-standard Characters and Glyphs. This module allows the encoder great flexibility bothin processing and in documenting non-standard characters or glyphs,including the ability to provide detailed documentation and images forthem.
ladder
<abbr>
<g ref="#per">per</g>sone
</abbr>
...
<expan>persone</expan> ...
<abbr>eu<g ref="#er">er</g>y</abbr>
<expan>euery</expan>
</choice>
- ex (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation.
- am (abbreviation marker) contains a sequence of letters or signs present in an abbreviation which are omitted or replaced in the expanded form of the abbreviation.
<g ref="#er"/>
</am>y</abbr>
<abbr>
<am>
<g ref="#per"/>
</am>sone
</abbr> ...
<expan>
<ex>per</ex>sone
</expan> ...
<am>
<g ref="#er"/>
</am>
<ex>er</ex>
</choice>y
<choice>
<am>
<g ref="#per"/>
</am>
<ex>per</ex>
</choice>sone ...
As implied in the preceding discussion, making decisions aboutwhich of these various methods of representing abbreviation to usewill form an important part of an encoder's practice. As a rule, theabbr and am elements should be preferred where it is wished tosignify that the content of the element is an abbreviation, withoutnecessarily indicating what the abbreviation may stand for. Theex and expan elements should be used where it is wished to signify thatthe content of the element is not present in the source but has beensupplied by the transcriber, without necessarily indicating theabbreviation used in the original. The decision as to which course ofaction is appropriate may vary from abbreviation to abbreviation;there is no requirement that the one system be used throughout atranscription, although doing so will generally simplifyprocessing. The choice is likely to be a matter of editorial policy.If the highest priority is to transcribe the text literatim, whileindicating the presence of abbreviations, the choice will be to useabbr or am throughout. If the highest priority is to present areading transcription, while indicating that some letters or words arenot actually present in the original, the choice will be to useex or expanthroughout.
plural ending (-es, -is, -ys>) but the singular <hi rend="it">good</hi> was used with the meaning <q>property</q>,
<q>wealth</q>, at this time (v. examples quoted in OED, sb. Good,
C. 7, b, c, d and 8 spec.)</note>
good<ex resp="#mp" cert="high">e</ex> I was welbeloued
<resp>Editorial emendations</resp>
<name>Malcom Parkes</name>
</respStmt>
<choice>
<sic>good<abbr>ɽ</abbr>
</sic>
<expan resp="#mp" cert="high">good<ex>e</ex>
</expan>
</choice>
I was welbeloued
If more than one expansion for the same abbreviation is to berecorded, multiple notes may be supplied. It may also be appropriateto use the markup for critical apparatus; an example is given insection 12.3 転記中の校合要素.
11.3.3 Correction and ConjectureTEI: Correction and Conjecture¶
has been modified by James to begin ‘But One must...’, without the inital capital O having been reduced tolowercase. This non-standard orthography could be recorded thus:Onemust have lived longer with this system, to appreciate itsadvantages.
must have lived ...
have lived ...
<choice>
<sic>One</sic>
<corr>one</corr>
</choice> must have lived
...
et <choice>
<sic>angues</sic>
<corr>augens</corr>
</choice>.
Note that the corr element is used to provide a correctedform which is not present in the source; in the case of acorrection made in the source itself, whether scribal, authorial, orby some other hand, the add, del,and subst elements described in 11.3.4 Additions and Deletions should beused.
As with expan and abbr, thechoice as to whether to record simply that there is an apparent error,or simply that a correction has been applied, or to record bothpossible readings within a choice element isleft to the encoder. The decision is likely to be a matter of editorial policy,which might be applied consistently throughout or decided case by case.If the highest priority is to present an uncorrected transcription whilenoting perceived errors in the original, the choice will typicallybe to use only sic throughout. If the highest priority is topresent a reading transcription, while indicating that perceived errorsin the original have been corrected, the choice will be to useonly corr throughout.
Further information may be attached to instances of these elements bythe note element and resp and certattributes. Instances of these elements may also be classifiedaccording to any convenient typology using the typeattribute.
Were membres maad, of generacioun
And of so parfit wis a
<choice xml:id="corr117">
<sic>wight</sic>
<corr>wright</corr>
</choice>
ywroght?
<!-- ... -->
<note target="#corr117">This emendation of the Hengwrt copy text,
based on a Latin source and on the reading of three late
and usually unauthoritative manuscripts, was proposed
by E. Talbot Donaldson in <bibl>
<title>Speculum</title> 40 (1965)
626–33.</bibl>
</note>
<!-- somewhere in the header ... --><name xml:id="ETD">E Talbot Donaldson</name>
<!-- ... -->
And of so parfit wis a
<choice>
<sic>wight</sic>
<corr resp="#ETD" cert="medium">wright</corr>
</choice>
ywroght?
<sic>mens</sic>
<corr>iners</corr>
</choice> que nutu dei
gesta sunt ... unde esset uiriliter
<choice xml:id="sic-2">
<corr>uegetata</corr>
<sic>negata</sic>
</choice>
graphically what the scribe should be copying but which does not make
sense in the context.</note>
<choice>
<sic>mens</sic>
<corr type="graphSubs">iners</corr>
</choice> que nutu dei
gesta sunt ... unde esset uiriliter
<choice>
<corr type="graphSubs">uegetata</corr>
<sic>negata</sic>
</choice>
<choice>
<sic>mens</sic>
<corr type="graphSubs">iners</corr>
<corr type="reversal">inres</corr>
</choice> que nutu dei
gesta sunt ...
<p>The following codes are used to categorise corrections identified
in this transcription:
<list type="gloss">
<label>graphSubs</label>
<item>Substitution of a more familiar word which resembles
graphically what the scribe should be copying but which does not make
sense in the context.</item>
<!-- ... -->
</list>
</p>
</correction>
For a given project, it may well be desirable to limit the possiblevalues for the type or subtype attributes automatically. This is easilydone but requires customization of the TEI system using techniquesdescribed in 23.2 Personalization and Customization, in particular 23.2.1.4 Modification of Attribute and AttributeValue リスト, which should be consulted for further informationon this topic.
<choice>
<sic>wight</sic>
<corr resp="#mp" source="#Gg">wyf</corr>
</choice>
ywroght?
<msIdentifier>
<settlement>Cambridge</settlement>
<repository>University Library</repository>
<idno>Gg.1. 27</idno>
</msIdentifier>
<!-- further description of the manuscript here -->
</msDesc>
parfit wis a
<app>
<rdg wit="#Hg">wight</rdg>
<rdg wit="#Ln #Ry2 #Ld">wright</rdg>
<rdg wit="#Gg">wyf</rdg>
</app>
parfit wis a
<app>
<rdg wit="#Hg">wight</rdg>
<rdg wit="#Ln #Ry2 #Ld">
<corr resp="#ETD">wright</corr>
</rdg>
<rdg wit="#Gg">
<corr resp="mp">wyf</corr>
</rdg>
</app>
Like the resp attribute, the cert attributemay be used with both corr and rdg elements. Whenused on the rdg element, these attributes indicate confidencein and responsibility for identifying the reading within the sourcesspecified; when used on the corr element they indicateconfidence in and responsibility for the use of the reading to correctthe base text. If no other source is indicated (either by thesource attribute, or by the wit attribute of aparent rdg), the reading supplied within a corr hasbeen provided by the person indicated by the respattribute.
If it is desired to express aspects of certainty and responsibilityfor some other aspect of the use of these elements, then themechanisms discussed in chapter 21 確信度・責任 may be founduseful. See also 11.4.2 Hand, Responsibility, and Certainty Attributes for further discussion of theissues of certainty and responsibility in the context oftranscription.
11.3.4 Additions and DeletionsTEI: Additions and Deletions¶
- add (addition) contains letters, words, or phrases inserted in the text by anauthor, scribe, annotator, or corrector.
- addSpan/ (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also add).
- del (deletion) contains a letter, word, or passage deleted, marked as deleted,or otherwise indicated as superfluous or spurious in the copy text by anauthor, scribe, annotator, or corrector.
- delSpan/ (deleted span of text) marks the beginning of a longer sequence of text deleted,marked as deleted, or otherwise signaled as superfluous or spurious by anauthor, scribe, annotator, or corrector.
- att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it.
spanTo indicates the end of a span initiated by the element bearing this attribute.
- att.transcriptional provides attributes specific to elements encoding authorial or scribal intervention in a text when transcribing manuscript or similar sources.
seq (sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred. status indicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present. hand signifies the hand of the agent which made the intervention.
sight. Others — and here is one of them — <add hand="#mb">do
ever</add> improve by recognition ....
<handNote xml:id="mb">Max Beerbohm holograph</handNote>
...
<handNote xml:id="dhl">D H Lawrence holograph</handNote>
If deletions are classified systematically, the typeattribute may be useful to indicate the classification; when they areclassified by the manner in which they were effected, or by theirappearance, however, this will lead to a certain arbitrariness indeciding whether to use the type or the rendattribute to hold the information. In general, it is recommended thatthe rend attribute be used for description of theappearance or method of deletion, and that the typeattribute be reserved for higher level or more abstractclassifications.
precedents <del hand="RG">in the</del>: current,
obsolete, <add hand="RG" place="supralinear">cant,</add>
cataphretic and nonce-words are all included.
<del>for an abridgement</del>
</add>in
explanation...
are all included. <del hand="RG">It is</del>
<subst>
<add>T</add>
<del>t</del>
</subst>he expressed
The add and del elements defined in the core modulesuffice only for the description of additions and deletions which fitwithin the structure of the text being transcribed, that is, whicheach deletion or addition is completely contained by the structuralelement (paragraph, line, division) within which it occurs. Where thisis not the case, for example because an individual addition or deletioninvolves severaldistinct structural subdivisions, such as poems or prose items, orotherwisecrosses a structural boundary in the text being encoded, specialtreatment is needed. The addSpan anddelSpan elements are provided by this module for thatpurpose. (For a general discussion of the issue see further 20 Non-hierarchical Structures).
<!-- ... -->
<body>
<div>
<!-- text here -->
</div>
<addSpan n="added gathering" hand="#heol" spanTo="#p025"/>
<div>
<!-- text of first added poem here -->
</div>
<div>
<!-- text of second added poem here -->
</div>
<div>
<!-- text of third added poem here -->
</div>
<div>
<!-- text of fourth added poem here -->
</div>
<anchor xml:id="p025"/>
<div>
<!-- more text here -->
</div>
</body>
<delSpan spanTo="#EPdelEnd" resp="#EP" rend="strikethrough"/>
<l>To where Saint Mary Woolnoth kept the time,</l>
<l>With a dead sound on the final stroke of nine.</l>
<anchor xml:id="EPdelEnd"/>
<l>There I saw one I knew, and stopped him, crying "Stetson!</l>...
<delSpan rend="verticalStrike" spanTo="#delend01"/>
Tis moonlight <del>upon</del>
<add>over</add> Oman's sky
</l>
<l>Her isles of pearl look lovelily<anchor xml:id="delend01"/>
</l>
The text deleted must be at least partially legible, in order forthe encoder to be able to transcribe it. If it is not legible at all,the gap element should be used to signal that the text wasnot transcribed, because it could not be; the reasonattribute can give the cause of the omission from the transcription as‘deletion, illegible’. If the deletedtext is partially legible, the unclear element described insection 11.5.1 Damage, Illegibility, and Supplied Text may be used to indicate areas oftext which cannot be read with confidence. See further section 11.3.7 Text Omitted from or Supplied in the Transcription and section 11.5.1 Damage, Illegibility, and Supplied Text.
11.3.5 SubstitutionsTEI: Substitutions¶
Substitution of one word or phrase for another is perhaps the mostcommon of all phenomena requiring special treatment in transcriptionof primary textual sources. It may be simply one word overwritinganother, or deletion of one word and its replacement by anotherwritten above it by the same hand at the one time; the deletion andreplacement may be done by different hands at different times; theremay be a long chain of substitutions on the one stretch of text, withuncertainty as to the order of substitution and as to which of manypossible readings should be preferred.
- subst (substitution) groups one or more deletions with one or more additions when the combination is to be regarded as a single intervention in the text.
<delSpan rend="verticalStrike" spanTo="#delend02"/>
Tis moonlight <subst>
<del>upon</del>
<add>over</add>
</subst> Oman's sky
</l>
<l>Her isles of pearl look lovelily<anchor xml:id="delend02"/>
</l>
with <subst>
<del seq="1">this</del>
<del seq="2">
<add seq="1">such
a</add>
</del>
<add seq="2">a</add>
</subst> system, to appreciate its
advantages.
<l>
<subst>
<del>Helping the worst amongst us</del>
<add>Dragging the
worst amongt us</add>
</subst>, who'd no boots
</l>
<l>But limped on, blood-shod. All went lame;
<subst>
<del status="shortEnd">half-</del>
<add>all</add>
</subst> blind;</l>
<l>Drunk with fatigue ; deaf even to the hoots</l>
<l>Of tired, outstripped <del>fif</del> five-nines that dropped behind.</l>
- the false start fif in the last line is simply marked asa deletion;
- the other two authorial corrections are marked assubstitutions, each combining a deletion and an addition.
- the authorial slip (amongt foramongst) is retained without comment.
<app>
<rdg varSeq="1">
<del>this</del>
</rdg>
<rdg varSeq="2">
<del>
<add>such a</add>
</del>
</rdg>
<rdg varSeq="3">
<add>a</add>
</rdg>
</app>
system, to appreciate its advantages.
11.3.6 Cancellation of Deletions and Other MarkingsTEI: Cancellation of Deletions and Other Markings¶
- restore indicates restoration of text to an earlier state bycancellation of an editorial or authorial marking or instruction.
This element bears the same attributes as the other transcriptionalelements. These may be used to supply further information such asthe hand in which the restoration is carried out, the type ofrestoration, and the personrsponsible for identifying the restoration as such, in the same way aselsewhere.
<restore hand="#dhl" type="marginalStetNote">
<del>my</del>
</restore>
body
Another feature commonly encountered in manuscripts is the use ofcircles, lines, or arrows to indicate transposition of material fromone point in the text to another. No specific markup for thisphenomenon is proposed at this time. Such cases are most simplyencoded as additions at the point of insertion and deletions at thepoint of encirclement or other marking.
11.3.7 Text Omitted from or Supplied in the TranscriptionTEI: Text Omitted from or Supplied in the Transcription¶
- gap indicates a point where material has been omitted in atranscription, whether for editorial reasons described in the TEIheader, as part of sampling practice, or because the material isillegible or inaudible.
reason gives the reason for omission. Sample values include sampling, illegible, inaudible, irrelevant, cancelled, cancelled and illegible. hand in the case of text omitted from the transcription because ofdeliberate deletion by an identifiable hand, signifies the hand whichmade the deletion. agent In the case of text omitted because of damage, categorizes the cause of the damage, if it can be identified. - supplied signifies text supplied by the transcriber or editor for any reason, typically because the original cannot be read because of physical damage or loss to the original.
reason indicates why the text has had to be supplied.
As noted above, the gap element should only be used wheretext has not been transcribed; if partially legible text has beentranscribed, one of the elements damage and unclearshould be used instead. These elements are described in section 11.5.1 Damage, Illegibility, and Supplied Text.
am dr Sr yr <supplied reason="illegible" resp="#msm" source="#AmCo">very humble Servt</supplied> Sydney Smith
11.4 Hands and ResponsibilityTEI: Hands and Responsibility¶
This section discusses in more detail the representation of aspectsof responsibility perceived or to be recorded for the writing of aprimary source. These include points at which one scribe takes overfrom another, or at which ink, pen, or other characteristics of thewriting change. A discussion of the usage of the hand,resp, and cert attributes is also included.
11.4.1 Document HandsTEI: Document Hands¶
For many text-critical purposes it is important to signal theperson responsible (the hand) for the writing of a wholedocument, a stretch of text within a document, or a particular featurewithin the document. A hand, as the name suggests, need notnecessarily be identified with a particular known (or unknown) scribeor author; it may simply indicate a particular combination of writingfeatures recognized within one or more documents. The examples givenabove of the use of the hand attribute with coding ofadditions and deletions illustrate this.
- handNote (note on hand) describes a particular styleor hand distinguished within a manuscript.
A handNote element, with an identifier given by itsxml:id attribute, may appear in either of two places in theTEI Header, depending on which modules are included in a schema. Whenthe transcr module defined by the presentchapter is used, the element hand注釈 is available, withinthe profileDesc element of the Header, to hold one or morehandNote elements. When the msdescription module defined in chapter 10 Manuscript Description is included, the handDesc element described in10.7.2 Writing, Decoration, and Other Notations also becomes available as part of a structuredmanuscript description. The encoder may choose to placehandNote elements identifying individual hands in eitherlocation without affecting their accessibility since the element isalways addressed by means of its xml:id attribute. ThehandDesc element may be more appropriate when a fullcataloguing of each manuscript is required; the hand注釈element if only a brief characterization of each hand is needed. Itis also possible to use the two elements together if, for example, thehandDesc element contains a single summary describing all thehands discursively, while the hand注釈 element givesspecific details of each. The choice will depend on individualencoders' priorities.
- handShift/ marks the beginning of a sequence of text written in a newhand, or the beginning of a scribal stint.
- att.handFeatures provides attributes describing aspects of the hand in which a manuscript is written.
scribe gives a standard name or other identifier for the scribebelieved to be responsible for this hand. script characterizes the particular script or writing style used bythis hand, for example secretary, copperplate, Chancery, Italian, etc.. medium describes the tint or type of ink, e.g. brown, or otherwriting medium, e.g. pencil scope specifies how widely this hand is used in the manuscript.
A single hand may employ different writing styles and inks within adocument, or may change character. For example, the writing stylemight shift from ‘anglicana’ to ‘secretary’, or the ink fromblue to brown, or the character of the hand may change. Simplechanges of this kind may be indicated by assigning a new value to theappropriate attribute within the handShift element. It isfor the encoder to decide whether a change in these properties of thewriting style is so marked as to require treatment as a distincthand.
Where such a change is to be identified, the newattribute is used to indicate the hand applicable to the materialfollowing the handShift. This will ordinarily, but notnecessarily, be the order in which the material was originallywritten.
As might be expected, one hand may employ different renditionswithin the one writing style, for example medieval scribes oftenindicate a structural division by emboldening all the words within aline. These should be indicated by use of the rendattribute on an element, in the same manner as underlining,emboldening, font shifts, etc. are represented in transcription of a printed text,rather than by introducing a new handShift element.
<handShift medium="greenish-ink"/>
<l>And if the cattes skynne be slyk <handShift medium="black-ink"/> and gaye</l>
<handNote xml:id="h1" script="copperplate" medium="brown-ink">Carefully written with regular descenders</handNote>
<handNote xml:id="h2" script="print" medium="pencil">Unschooled scrawl</handNote>
</hand注釈>
may be once more introduced and Established in this
Parish according to the Rules and Ceremonies of the
Church of England and as under a good Consciencious
and sober Curate there would and ought to be
<handShift new="#h2" resp="#das"/>
and for that purpose the parishioners pray
11.4.2 Hand, Responsibility, and Certainty AttributesTEI: Hand, Responsibility, and Certainty Attributes¶
<choice>
<sic>One</sic>
<corr resp="#FB">one</corr>
</choice> must have
lived ...
<!-- elsewhere -->
<respStmt xml:id="FB">
<resp>editorial changes</resp>
<name>Fredson Bowers</name>
</respStmt>
<respStmt xml:id="WJ">
<resp>authorial changes</resp>
<name>William James</name>
</respStmt>
The resp attribute, by contrast, indicate the person responsible for deciding to apply the elementcarrying it to this part of the text, and hence has a slightlydifferent interpretation. In the case of theadd element, for example, the resp attribute willindicate the responsibility for identifying that the addition isindeed an addition, and also (if the hand attribute issupplied) to which hand it should be attributed. In this case, Bowers is credited with identifying the hand as that of WilliamJames. In the case of the corr element, the respattribute indicates who is responsible for supplying theintellectual content of the correction reported in the transcription:here, Bowers' correction of ‘One’ to ‘one’. In the case of adeletion, the resp attribute will similarly indicate whobears responsibility for identifying or categorising the deletion itself,while other attributes (hand most obviously) attributeresponsibility for the deletion itself.
As these examples show, the field of application of theresp attributes varies from element to element. In somecases, it applies to the content of the element (corr, ex, and supplied); in others it applies to the value of a particularattribute (sic, abbr, del, etc.). In allcases where both the resp and cert attributes aredefined for a particular element, the two attributes refer to the sameaspect of the markup. The one indicates who is intellectuallyresponsible for some item of information, the other indicates the degreeof confidence in the information. Thus, for acorrection, the resp attribute signifies the personresponsible for supplying the correction, while the certattribute signifies the degree of editorial confidence felt in thatcorrection. For the expansion of an abbreviation, theresp attribute signifies the person responsible for supplyingthe expansion and the cert attribute signifies the degree ofeditorial confidence felt in the expansion.
This close definition of the use of the resp andcert attributes with each element is intended to provide forthe most frequent circumstances in which encoders might wish to makeunambiguous statements regarding the responsibility for and certainty ofaspects of their encoding. The resp and certattributes, as so defined, give a convenient mechanism for this.However, there will be cases where it is desired to state responsibilityfor and certainty concerning other aspects of the encoding. Forexample, one may wish in the case of an apparent addition to state theresponsibility for the use of the add element, rather than theresponsibility for identifying the hand of the addition. It may also bethat one editor may make an electronic transcription of another editor'sprinted transcription of a manuscript text — here, one will wish toassign layers of responsibility, so as to allow the reader to determineexactly what in the final transcription was theresponsibility of each editor. In these complex cases of dividededitorial responsibility for and certainty concerning the content,attributes, and application of a particular element, the more generalmechanisms for representing certainty and responsibility described inchapter 21 確信度・責任 should be used.
<sic>wight</sic>
<corr resp="#ETD" cert="medium">wright</corr>
</choice>
<corr xml:id="c117">wright</corr>
<sic>wight</sic>
</choice>
<certainty target="#c117" locus="transcribedContent" degree="0.7"/>
<respons target="#c117" locus="transcribedContent" resp="#ETD"/>
The above discussion supposes that in each case an encoder is able tospecify exactly what it is that one wishes to state responsibility forand certainty about. Situations may arise when an encoder wishes tomake a statement concerning certainty or responsibility but is unable orunwilling to specify so precisely the domain of the certainty orresponsibility. In these cases, the note element may be usedwith the type attribute set to ‘cert’ or ‘resp’and the content of the note giving a prose description of the state ofaffairs.
11.5 Damage and ConjectureTEI: Damage and Conjecture¶
The carrier medium of a primary source may often sustain physicaldamage which makes parts of it hard or impossible to read. In thissection we discuss elements which may be used to represent suchsituations and give recommendations about how these should be used inconjunction with the other related elements introduced previously inthis chapter.
11.5.1 Damage, Illegibility, and Supplied TextTEI: Damage, Illegibility, and Supplied Text¶
- damage contains an area of damage to the text witness.
- damageSpan/ (damaged span of text) marks the beginning of a longer sequence of text which is damaged in some way but still legible.
- att.damaged provides attributes describing the nature of any physical damage affecting a reading.
extent indicates approximately how much text is in the damaged area,in letters, minims, inches, or any appropriate unit, where thiscannot be deduced from the contents of the tag. hand In the case of damage (deliberate defacement, etc.) assignableto an identifiable hand, signifies the hand responsible for thedamage. agent categorizes the cause of the damage, if it can be identified. degree Signifies the degree of damage according to a convenient scale.The damage tag with the degree attribute shouldonly be used where the text may be read with some confidence; textsupplied from other sources should be tagged as supplied. group assigns an arbitrary number to each stretch of damage regarded as forming part of the same physical phenomenon.
- att.spanning provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it.
spanTo indicates the end of a span initiated by the element bearing this attribute.
The following examples all refer to the recto of folio 5 of the uniquemanuscript of the Elder Edda. Here, themanuscript of Vóluspá has been damagedthrough irregular rubbing so that letters in various places are obscuredand in some cases cannot be read at all.
<!-- ... -->
<pb n="5r"/>
<damageSpan agent="rubbing" extent="whole leaf" spanTo="#damageEnd"/>
</p>
<p> .... </p>
<p> ....
<pb n="5v" xml:id="damageEnd"/>
</p>
<l>Moves <damage agent="water" group="1">on: nor all your</damage> Piety nor Wit</l>
<l>
<damageSpan agent="water" group="1" spanTo="#washOut"/>Shall lure it back to cancel half a Line,
</l>
<l>Nor all your Tears wash <anchor xml:id="washOut"/> out a Word of it</l>
A more general solution to this problem is provided by thejoin element discussed in 16.7 Aggregation which may beused to link together arbitrary elements of any kind in thetranscription. Where, as here, several phenomena of illegibility and conjectureall result from the one cause, an area of damage to the text — rubbingat various points — which is not continuous in the text, affecting itat irregular points, the join element may beused to indicate which tagged features are part of the same physicalphenomenon.
<unclear>aga</unclear>
</damage> yndisniota
neþan <gap
reason="illegible"
agent="rubbing"
extent="4"
unit="letters"/>
11.5.2 Use of the gap, del, damage, unclear, and supplied Elements in CombinationTEI: Use of the gap, del, damage, unclear, and supplied Elements in Combination¶
- where the text has been rendered completely illegible bydeletion or damage and no text is supplied by the editor in place ofwhat is lost: place an empty gap element at the point ofdeletion or damage. Use the reason attribute to state thecause (damage, deletion, etc.) of the loss of text.
- where the text has been rendered completely illegible bydeletion or damage and text is supplied by the editor in place ofwhat is lost: surround the text supplied at the point of deletion ordamage with the supplied element. Use the reasonattribute to state the cause (damage, deletion, etc.) of the loss oftext leading to the need to supply the text.
- where the text has been rendered partly illegible by deletionor damage so that the text can be read but without perfectconfidence: transcribe the text and surround it with theunclear element. Use the reason attribute to statethe cause (damage, deletion, etc.) of the uncertainty in transcriptionand the cert attribute to indicate the confidence in thetranscription.
- where there is deletion or damage but the text can be read withperfect confidence: transcribe the text and surround it with thedel element (for deletion) or the damage element (fordamage). Use appropriate attribute values to indicate the cause andtype of deletion or damage. Observe that the degreeattribute on the damage element permits the encoding to showthat a letter, word, or phrase is not perfectly preserved, though itmay be read with confidence.
- where there is an area of deletion or damage and parts of thetext within that area can be read with perfect confidence, otherparts with less confidence, other parts not at all: in transcription,surround the whole area with the del element (for deletion; orthe delSpan element where it crosses a structural boundary); orthe damage element (for damage). Text within the damaged areawhich can be read with perfect confidence needs no further tagging.Text within the damaged area which cannot be read with perfectconfidence may be surrounded with the unclear element. Placeswithin the damaged area where the text has been rendered completelyillegible and no text is supplied by the editor may be marked withthe gap element. For each element, one may use appropriateattribute values to indicate the cause and type of deletion or damageand the certainty of the reading.
- if one add element (with identifier ADD1)contains another (with identifier ADD2), thenthe addition ADD1 was firstmade to the text, and later a second addition (ADD2) wasmade within that added text:This is the text
<add xml:id="ADD1">with some added
<add xml:id="ADD2">(interlinear!)</add>
material</add>
as written. - if one del element contains another, and theseq attribute does not indicate otherwise, it should beassumed that the inner deletion was made before the enclosing one. In the following example,the words redundant was deleted before a secondsecond deletion removed the entire passage:<del>This sentence contains
some <del>redundant</del> unnecessary
verbiage.</del> - if a del element contains an add element, the normalinterpretation will be that an addition was made within a passagewhich was laterdeleted in its entirety:<del>This sentence was deleted
<add>originally</add> from the text.</del> - if an add element contains a del element, thenormal interpretation will be that adeletion was made from a passage which had earlier been added:<add>This sentence was added
<del>eventually</del> to the text.</add>
11.6 Aspects of LayoutTEI: Aspects of Layout¶
Finally in this chapter we present elements which may be used tocapture aspects of the layout of material on a page where this isconsidered important. Methods for recording page breaks, column breaks, and line breaks in thesource are described in section 3.10 参照システム.
- » 11.6.2 Lines
- Home | 目次
11.6.1 SpaceTEI: Space¶
- space/ indicates the location of a significant space in the copy text.
resp (responsible party) indicates the individual responsible for identifying and measuringthe space.
As <space quantity="7" unit="char"/> han within her oratoryes
As <supplied reason="space" resp="#ES" source="Hg">preestes</supplied>
han within her oratoryes
- « 11.6.1 Space
- Home | 目次
11.6.2 LinesTEI: Lines¶
<del rend="strikethrough" hand="#dhl">my</del> body,
which is so dear to me
by law — hindered a man's proceedings who
<hi rend="underline">had obtained all the letters
to Mr Boyd</hi>
by law — hindered a man's proceedings who
<hi xml:id="cstart1" rend="underline">had obtained all
the letters to Mr Boyd</hi>
<!-- ... -->
<certainty target="#cstart1" locus="startLoc" degree="0.70">
<desc>may begin with previous word</desc>
</certainty>
Where the area of text marked overlaps other areas of text, forexample crossing a structural division, one of the spanning mechanismsmentioned above must be used; for example where the line is thought tomark a deletion, the delSpan element may be used. Where it isdesired simply to record the marking of a span of text in circumstanceswhere it is not possible to surround the text with a hielement, the span element may be used with the rendattribute indicating the style of line-marking.
More work needs to be done on clarifying the treatment of othertextual features marked by lines which might so overlap or nest. Forexample, in many Middle English manuscripts (e.g. the Jesus and Digbyverse collections) marginal sidebars may indicate metrical structure:couplets may be linked in pairs, with the pairs themselves linkedinto stanzas. Or, marginal sidebars may indicate emphasis, or maypoint out a region of text on which there is some annotation: in manymanuscripts of Chaucer's Wife of Bath's Prologue lines655–8 are marked with nesting parentheses against which the scribehas written nota.
At the lowest level, all such features could be captured by use ofthe note element, containing a prose description of themanuscript at this point, enhanced by a link to a visualrepresentation (or facsimile) of the feature in question. It is not yet clear how best to mark up suchphenomena so as toobtain more usefully structured encodings. For example,in the Chaucer example just cited, one may wish to record that thenota is written in the Hengwrt manuscript in the rightmargin against a single large left parenthesis bracketing the fourlines, with two right parentheses in the right margin bracketing twooverlapping pairs of lines: the first and third, the second and fourth.The note element allows us to record that the scribe wrotenota, but is not well-adapted to show that thenota points both at all four lines and at two pairs oflines within the four lines.
11.7 Headers, Footers, and Similar MatterTEI: Headers, Footers, and Similar Matter¶
- fw (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page.
- running heads (whether repeated on every page, or changing onevery page)
- running footers
- page numbers
- catch-words
- other material repeated from page to page, which falls outside thestream of the text
<fw type="pageNum" place="top-right">29</fw>
<fw type="sig" place="bot-centre">E3</fw>
<fw type="catch" place="bot-right">TEMPLE</fw>
11.8 Other Primary Source Features not Covered in these GuidelinesTEI: Other Primary Source Features not Covered in these Guidelines¶
We repeat the advice given at the beginning of this chapter, thatthese recommendations are not intended to meet every transcriptionalcircumstance ever likely to be faced by any scholar. They are intendedrather as a base to enable encoding of the most common phenomena foundin the course of scholarly transcription of primary source materials.These guidelines particularly do not address the encoding of physicaldescription of textual witnesses: the materials of the carrier, themedium of the inscribing implement, the organisation of the carrier materials themselves (asquiring, collation, etc.), authorial instructions or scribal markup,etc. except insofaras these are involved in the broader question ofmanuscript description, as addressed by the msdescription module described in chapter 10 Manuscript Description.
11.9 転記モジュールTEI: 転記モジュール¶
↑ Contents « 10 Manuscript Description » 12 校本