「CBETA XML P5 Elements」:修訂間差異

出自DILA Wiki
imported>Ray
(新頁面: =Elements= ==ref== T43n1829, p. 1a03 <syntaxhighlight lang="XML"> <note place="inline"><ref target="../T30/T30n1579.xml#xpath2(//0279a03)">論本第一第二</ref></note> </syntaxhighlight>)
 
imported>Ray
無編輯摘要
行 1: 行 1:
=Elements=
=Elements=
==char==
缺字宣告
T01n0001
<syntaxhighlight lang="XML">
<char xml:id="CB00178">
  <charName>CBETA CHARACTER CB00178</charName>
  <charProp>
   <localName>composition</localName>
   <value>[木*奈]</value><!-- CBETA 組字式 -->
  </charProp>
  <mapping type="unicode">U+3B88</mapping>
  <mapping cb:dec="983218" type="PUA">U+F00B2</mapping>
</char>
</syntaxhighlight>
參考 mapping 元素.
===悉曇字===
T18n0848
<syntaxhighlight lang="XML">
<char xml:id="SD-A6A9">
  <charName>CBETA CHARACTER SD-A6A9</charName>
  <charProp>
   <localName>Romanized form in CBETA transcription</localName>
   <value>haa</value>
  </charProp>
  <charProp>
   <!-- 對應到造字字型檔裡所用的字碼 -->
   <localName>Character in the Siddham font</localName>
   <value>扣</value>
  </charProp>
  <charProp>
   <localName>Romanized form in Unicode transcription</localName>
   <value>hā</value>
  </charProp>
  <mapping cb:dec="1066665" type="PUA">U+1046A9</mapping>
</char>
</syntaxhighlight>
SD-A6A9 是 CBETA 缺字碼, 悉曇字都以 SD 開頭.
===蘭札體===
<syntaxhighlight lang="XML">
<char xml:id="RJ-C947">
  <charName>CBETA CHARACTER RJ-C947</charName>
  <charProp>
   <localName>Romanized form in CBETA transcription</localName>
   <value>aa.h</value>
  </charProp>
  <charProp>
   <localName>rjchar</localName>
   <value>乇</value>
  </charProp>
  <charProp>
   <localName>Romanized form in Unicode transcription</localName>
   <value>āḥ</value>
  </charProp>
  <mapping cb:dec="1100103" type="PUA">U+10C947</mapping>
</char>
</syntaxhighlight>
RJ-C947 是 CBETA 缺字碼, 蘭札體都以 RJ 開頭.
==g==
缺字
T01n0001, p. 3c01
<syntaxhighlight lang="XML">
波羅<g ref="#CB00178">㮈</g>
</syntaxhighlight>
CB00178 是 CBETA 的缺字碼, 參考 char 元素.
==mapping==
===type="unicode"===
該缺字對應的 Unicode 碼
<syntaxhighlight lang="XML">
<mapping type="unicode">U+3B88</mapping>
</syntaxhighlight>
===type="PUA"===
<syntaxhighlight lang="XML">
<char xml:id="CB00178">
  ...
  <mapping cb:dec="983218" type="PUA">U+F00B2</mapping>
</char>
</syntaxhighlight>
U+F00B2 是這個缺字對應的 Unicode 造字區 (PUA, Private Use Area) 的編碼.<br/>
由 U+F0000 起算, 加上十進位 178 (來自 CB00178), 也就是十六進位 B2, 所以是 U+F00B2.<br/>
983218 是 F00B2 的十進位值.
====悉曇字====
T18n0848
<syntaxhighlight lang="XML">
<char xml:id="SD-A6A9">
  ...
  <mapping cb:dec="1066665" type="PUA">U+1046A9</mapping>
</char>
</syntaxhighlight>
U+1046A9 是 U+FA000 起算, 加上十六進位 A6A9.<br/>
1066665 是 U+1046A9 的十進位值.
====蘭札體====
<syntaxhighlight lang="XML">
<char xml:id="RJ-C947">
  ...
  <mapping cb:dec="1100103" type="PUA">U+10C947</mapping>
</char>
</syntaxhighlight>
U+10C947 是由 U+100000 起算, 加上 C947, 所以是 U+10C947
==ref==
==ref==
T43n1829, p. 1a03
T43n1829, p. 1a03

於 2013年8月27日 (二) 16:12 的修訂

Elements

char

缺字宣告

T01n0001

<char xml:id="CB00178">
  <charName>CBETA CHARACTER CB00178</charName>
  <charProp>
    <localName>composition</localName>
    <value>[木*奈]</value><!-- CBETA 組字式 -->
  </charProp>
  <mapping type="unicode">U+3B88</mapping>
  <mapping cb:dec="983218" type="PUA">U+F00B2</mapping>
</char>

參考 mapping 元素.

悉曇字

T18n0848

<char xml:id="SD-A6A9">
  <charName>CBETA CHARACTER SD-A6A9</charName>
  <charProp>
    <localName>Romanized form in CBETA transcription</localName>
    <value>haa</value>
  </charProp>
  <charProp>
    <!-- 對應到造字字型檔裡所用的字碼 -->
    <localName>Character in the Siddham font</localName>
    <value></value>
  </charProp>
  <charProp>
    <localName>Romanized form in Unicode transcription</localName>
    <value></value>
  </charProp>
  <mapping cb:dec="1066665" type="PUA">U+1046A9</mapping>
</char>

SD-A6A9 是 CBETA 缺字碼, 悉曇字都以 SD 開頭.

蘭札體

<char xml:id="RJ-C947">
  <charName>CBETA CHARACTER RJ-C947</charName>
  <charProp>
    <localName>Romanized form in CBETA transcription</localName>
    <value>aa.h</value>
  </charProp>
  <charProp>
    <localName>rjchar</localName>
    <value></value>
  </charProp>
  <charProp>
    <localName>Romanized form in Unicode transcription</localName>
    <value>āḥ</value>
  </charProp>
  <mapping cb:dec="1100103" type="PUA">U+10C947</mapping>
</char>

RJ-C947 是 CBETA 缺字碼, 蘭札體都以 RJ 開頭.

g

缺字

T01n0001, p. 3c01

波羅<g ref="#CB00178"></g>

CB00178 是 CBETA 的缺字碼, 參考 char 元素.

mapping

type="unicode"

該缺字對應的 Unicode 碼

<mapping type="unicode">U+3B88</mapping>

type="PUA"

<char xml:id="CB00178">
  ...
  <mapping cb:dec="983218" type="PUA">U+F00B2</mapping>
</char>

U+F00B2 是這個缺字對應的 Unicode 造字區 (PUA, Private Use Area) 的編碼.
由 U+F0000 起算, 加上十進位 178 (來自 CB00178), 也就是十六進位 B2, 所以是 U+F00B2.
983218 是 F00B2 的十進位值.

悉曇字

T18n0848

<char xml:id="SD-A6A9">
  ...
  <mapping cb:dec="1066665" type="PUA">U+1046A9</mapping>
</char>

U+1046A9 是 U+FA000 起算, 加上十六進位 A6A9.
1066665 是 U+1046A9 的十進位值.

蘭札體

<char xml:id="RJ-C947">
  ...
  <mapping cb:dec="1100103" type="PUA">U+10C947</mapping>
</char>

U+10C947 是由 U+100000 起算, 加上 C947, 所以是 U+10C947

ref

T43n1829, p. 1a03

<note place="inline"><ref target="../T30/T30n1579.xml#xpath2(//0279a03)">論本第一第二</ref></note>