敦煌漢文佛教寫卷點校本工作手冊
敦煌漢文佛教寫卷點校本工作手冊
Critical Editions of Chinese Buddhist Dunhuang Manuscripts(Markup manual)
Date: 2014-2-14 Author: Zhang Boyong 張伯雍
本計畫為中華佛研所、馬德偉博士(Dr. Marcus Bingenheimer, Temple University)、太史文博士(Dr. Stephen F. Teiser, Princeton University)與大英圖書館國際敦煌專案(International Dunhuang Project, IDP)的合作專案。專案說明詳見計畫書。
寫卷數位化
數位化的寫卷標題範例:lengqieszj-S-4272.xml。S-4272 為敦煌文獻號碼。
所使用的標記規範為 TEI P5
寫卷結構與標記
- 以敦煌寫卷「件」為單位,即每一個檔案即為一件敦煌寫卷(如 S.4272)
文字轉錄原則
- 不論原文使用何種字體(楷書、行書、草書等),皆轉錄為楷書(楷化)。
- Unicode 有提供字型者,按原字形轉錄。如:㘴,不轉錄為教育部標準字體(正字)「坐」。
Non-Unicode Variants - attested 萬國碼未收之異體字──已確認 1.萬國碼未收之字形。The variant character is not in Unicode. |
Ex.1: S-4272-0002: 為除忘相<choice><orig><g ref="#A03335-005"/></orig><reg>修</reg></choice>行六度 |
|
Non-Unicode Variants - unattested 萬國碼未收之異體字──未確認 1.萬國碼與教育部異體字字典均未收錄。The character is neither in Unicode nor in the MoE Dictionary. |
Ex.1: S-4272-0022: 度眾生過去<reg>逢</reg>无量恒 |
1. 2. "Unclear" Characters 模糊字 Ex.1: P-3703-0001: 無有邊<unclear>畔坐</unclear> 1. <unclear> is much open to interpretation. It is influenced strongly by the quality of the facsimile and the level of paleographic skills. <unclear> 是一個較鬆散的解釋,此類字多受到摹寫字跡以及古代字形的影響。 2. We use it in this project when the character and its stroke structure are not recognizable on their own, but only by comparing with other versions. 標記此類文字時通常需借助其他版本的文獻,而不能由文本直接辨認出來。 3. All <unclear> are understood as 通用字, this form of regularization differes from <reg>, however, because with <unclear> the intended variant is unknown. With <reg> the shape/stroke structure of the variant character is seen. 所有的<unclear>都能理解為某個正字,但與<reg>不同的是<unclear>文字結構模糊,而<reg>的文字結構清晰。 Ex.1: S-4272-0022: 度眾生過去<reg>逢</reg>无量恒 1. The character is neither in Unicode nor in the MoE Dictionary. 萬國碼與教育部異體字字典均未收錄。 2. Use this only for characters where the stroke count is clearly legible. 但字形結構上能夠分辨者。 字型工具
|