imported>Marcus |
imported>Marcus |
行 145: |
行 145: |
| |- | | |- |
| |'''Character(s) Overwrite other Character(s): 覆蓋字''' | | |'''Character(s) Overwrite other Character(s): 覆蓋字''' |
| 被覆蓋的字若不清楚則使用<del unit="chars" extent="...">,清楚則轉錄出來,不確定則使用<unclear>。If the overwritten character is illegible use <del unit="chars" extent="...">, if legible give character, if unsure use <unclear>. | | 被覆蓋的字若不清楚則使用<del unit="chars" extent="..."/>,清楚則轉錄出來,不確定則使用<unclear>。If the overwritten character is illegible use <del unit="chars" extent="..."/>, if legible give character, if unsure use <unclear>. |
| |Ex.1: S-4272-0021: | | |Ex.1: S-4272-0021: |
|
| |
|
於 2014年8月6日 (三) 21:18 的修訂
敦煌漢文佛教寫卷點校本工作手冊
Critical Editions of Chinese Buddhist Dunhuang Manuscripts(Markup manual)
Date: 2014-8-4 Author: Zhang Boyong 張伯雍, Marcus Bingenheimer 馬德偉
本計畫為中華佛研所、馬德偉博士(Dr. Marcus Bingenheimer, Temple University)、太史文博士(Dr. Stephen F. Teiser, Princeton University)與大英圖書館國際敦煌專案(International Dunhuang Project, IDP)的合作專案。專案說明詳見計畫書。
寫卷數位化
數位化的寫卷標題範例:lengqieszj-S-4272.xml。S-4272 為敦煌文獻號碼。
所使用的標記規範為 TEI P5
寫卷結構與標記
- 以敦煌寫卷「件」為單位,即每一個檔案即為一件敦煌寫卷(如 S.4272)
行號標記例
|
<lb xml:id="S-4272-0001"/>
|
空格標記例
|
<space unit="char" extent="1"/>
|
S-4272-0008
|
異寫字標記例
|
<reg>偽</reg>
|
S-4272-0005
|
<choice><orig><g ref="#A04441-003"/></orig><reg>障</reg></choice>
|
S-4272-0013
|
取代標記例
|
<subst><del>无</del><add>有</add></subst>
|
S-4272-0005
|
<subst><del unit="chars" extent="1"/><add>心</add></subst>
|
S-4272-0021
|
<subst><del><choice><orig><g ref="#A03580-001"/></orig><reg>薩</reg></choice></del><add place="inline-right">提</add></subst>
|
P-3436-0037
|
插入標記例
|
<add place="inline-right">性</add>
|
S-4272-0009
|
倒乙標記例
|
出<add place="inline-right">㆑</add>不
|
(不出)
P-3436-0037
|
<choice><orig><g ref="#A04688-002"/></orig><reg>魏</reg></choice>朝<add place="inline-right"><note resp="hand2">向上</note></add>苐二
|
(苐二魏朝)
P-3436-0110
|
補充修改標記例
|
<add place="margin-bottom">軰</add>
|
P-3436-0056
|
破損標記例
|
<damage>使鬼神</damage>
|
P-3436-0057
|
<damage unit="chars" extent="1"/>
|
P-3436-0011
|
字跡不清標記例
|
<unclear>斷</unclear>
|
P-3436-0070
|
難辨字標記例
|
<gap unit="char" extent="1"/>
|
P-3703-0011
|
省書例 Abbreviations
|
<choice>卄卄<expan>菩薩</expan></choice>
|
P-2634-0010
|
重文例 Repeat sign
|
<choice>窮々逹々法々性々<expan>窮逹法性窮逹法性</expan></choice>
|
P-3664-0360
|
進階說明──文字迻錄原則
Non-Unicode Variants - attested 萬國碼未收之異體字──已確認
- 萬國碼未收之字形。The variant character is not in Unicode.
- 教育部異體字字典有收錄者。It is attested in the "Dictionary of Chinese Character Variants" 教育部異體字字典 (Ministry of Education, RoC, 2012). Current Query Interface: http://dict2.variants.moe.edu.tw/variants/.
- 教育部異體字字典對該字的編號會記錄在<g>@ref標記中。 <g>@ref points to a header item which references the character number of the variant in the MoE Dictionary.
- 能夠以所對應的正字表達出來。 It can be represented by a semantically equivalent common character (通用字).
|
Ex.1: S-4272-0002:
為除忘相<choice><orig><g ref="#A03335-005"/></orig><reg>修</reg></choice>行六度
|
|
Non-Unicode Variants - unattested 萬國碼未收之異體字──未確認
- 萬國碼與教育部異體字字典均未收錄。The character is neither in Unicode nor in the MoE Dictionary.
- 但字形結構上能夠分辨者。 Use this only for characters where the stroke count is clearly legible.
|
Ex.1: S-4272-0022:
度眾生過去<reg>逢</reg>无量恒
|
|
"Unclear" Characters 模糊字
- <unclear> 是一個較鬆散的解釋,此類字多受到摹寫字跡以及古代字形的影響。<unclear> is much open to interpretation. It is influenced strongly by the quality of the facsimile and the level of paleographic skills.
- 標記此類文字時通常需借助其他版本的文獻,而不能由文本直接辨認出來。We use it in this project when the character and its stroke structure are not recognizable on their own, but only by comparing with other versions.
- 所有的<unclear>都能理解為某個正字,但與<reg>不同的是<unclear>文字結構模糊,而<reg>的文字結構清晰。All <unclear> are understood as 通用字, this form of regularization differes from <reg>, however, because with <unclear> the intended variant is unknown. With <reg> the shape/stroke structure of the variant character is seen.
|
Ex.1: P-3703-0001:
無有邊<unclear>畔坐</unclear>
|
|
Significant spaces 文中的空格
- 另起一段或徵引文獻時。Intentional, significant space before new sections (Ex. 1) or quotations (Ex.1).
- 文獻末尾空格不標記。No <space> needed at end of a Ms folio.
|
Ex.1: S-4272-0008 - S-4272-0010:
為中道<space unit="char" extent="2"/>苐三齊朝
人年十四遇達摩禪師
真登佛果<space unit="char" extent="1"/>楞伽経云
|
|
Character(s) added in the Ms. 插入字
- 文中有人插入文字。Character(s) added by a scribe in the Ms.
- 大致描述插入字位置。@place gives rough description where to find it.
|
Ex.1: S-4272-0009:
禪師俗<add place="inline-right">性</add>姖武窂人
|
|
Character(s) Overwrite other Character(s): 覆蓋字
被覆蓋的字若不清楚則使用,清楚則轉錄出來,不確定則使用<unclear>。If the overwritten character is illegible use , if legible give character, if unsure use <unclear>.
|
Ex.1: S-4272-0021:
為是知眾生識<subst><del unit="chars" extent="1"/><add>心</add></subst>自度
|
|
Damaged but recognizable characters 破損字
<damage>與<unclear>近似,標記中直接使用正字。(範例中的字也可以識別為「忕」或「𢗗」,此處依另一版本。)<damage> is similar to <unclear> in that the text provided should be considered 通用字 as the variant can not be distinguished clearly.
|
Ex.1: P-3703-0001 :
時<damage>狀</damage>𠰥
|
|
Unrecognizable characters due to accidental damage (tearing, breaking, smearing, blotting, smudging etc.) with later annotation 因意外而造成無法判讀(如撕裂、破損、磨滅、髒汙等),後來新增者
- 背面墨水透出使「法」字部分不清,另一個字則完全不清。Seeping ink renders the character 法 partially illegible and another character completely illegible.
- 可以推論第二個字可能被 (hand="1") 塗改為「有」,但又暈墨。Probably the latter character was originally deleted, and the first scribe (hand="1") had added a 有 next to the line, which, however, too became blotted as the ink seeped through, but is still inferable.
- 不清的「法」字旁潦草寫了一個「法」。(hand="2") 又另在前次暈墨的「有」下方再寫一個「有」,這必然是在背面抄寫後才發生的,這份手卷發現數次這樣因背面的墨透背後,才進行的補救。 Later someone adds a quickly written 法 next to the partially damaged 法, and a 有 below the damaged first addition inline-right. This probably was someone else (@hand="2") because it must have occurred after the verso text had been written and there are several other cases of clarifying damaged characters elsewhere in the Ms.
- 假定這髒污是由背面的墨透過來的,那事情發生的順序應是:先抄寫了正面,而背面又抄寫了其他文稿,結果導致墨暈至背面。後來在讀正面時(的人),又將模糊的字重書在右方。Assuming the blotting is due to ink seeping through the paper the series of events was: someone wrote the text, then something else was written on verso, and the ink seeping through blotted the recto text. A later reader clarifies unclear characters recto with a dry brush.
|
Ex.1: P-3703-0007:
In the header: <profileDesc> <creation> <listChange> <change xml:id="stage1">The manuscript is written, corrections were made by the scribe.寫卷抄錄時的修正</change><change xml:id="stage2">The verso is written. Ink seeps through blotting some characters.背面抄寫時的墨透背後所汙染者</change><change xml:id="stage3">A later hand clarifies characters that were blotted out.在汙處外再次訂正</change> </listChange> </creation> </profileDesc>
非<unclear>離</unclear>生<damage change="#stage2">法</damage><add change="#stage3" hand="2" place="inline-right">法</add><damage change="#stage2"><del change="#stage1" hand="1"><gap unit="chars" extent="1"/></del><add change="#stage1" place="inline-right" hand="1">有</add></damage><add change="#stage3" hand="2" place="inline-right">有</add>无生龍
Ex.2: P-3703-0010:
一切圡木<damage change="#stage2">瓦</damage><add place="inline-right" change="#stage3">瓦</add>石
- The original character (probably 瓦) becomes illegible by ink seeping thorugh. 原來的字(應是「瓦」)被透背的墨所暈。
- A later hand clarifies the illegible section and writes 瓦 next to it. 後來的讀者重書「瓦」在右方。
|
|
Reverse Mark 倒乙符號 (レ-点)
- 以萬國碼「雁點」(レ点 U+3191)為倒乙符號。Use Unicode Character 'IDEOGRAPHIC ANNOTATION REVERSE MARK' (U+3191) within <add place="inline-right"> </add>
參見:敦煌古代的標點符號 維基百科:訓読
|
Ex.1: :
亦出<add place="inline-right">㆑</add>不扵有
|
|
字型工具
- 最好安裝 Unicode Super-CJK Fonts v6.0