Add some missing mappings for Big5 code points or fix some definitions.
[chise/xemacs-chise.git-] / lisp / utf-2000 / README.en
1 UTF-2000 character database format (S-expression version)
2
3 * Relations between Characters
4
5 Property-name `->foo' or `<-foo' mean relations between characters.
6
7 If there are a relationship `foo', property-name `->foo' means a value
8 of the property is a `foo' of the defined character.  Likewise,
9 property-name `<-foo' means the defined character is a `foo' of each
10 value of the property.
11
12 Example 1: If a character ?A has property `(->lowercase . (?a))', the
13 character ?a specified in the property is a lowercase of the defined
14 character ?A.
15
16 Example 2: If a character ?a has property `(<-lowercase . (?A))', the
17 defined character ?a is a lowercase of the character ?A specified in
18 the property.
19
20
21 ** Character Reference
22
23 Type of property `->foo' or `<-foo' are list of ``char-ref''.  The
24 type char-ref is defined by following definition:
25
26    char-ref = character | char-spec | char-property-list
27
28 The type ``char-property-list'' is a property-list.  In the
29 property-list, some keys are defined in the specification.  Following
30 list describes meaning of keys:
31
32   :char
33         Required.  It indicates a character.
34         [type] character | char-spec
35
36   :sources
37         Optional.  It indicates a source of the relation.
38         [type] list of symbols
39
40         Meaning of some symbols are defined.  Following list describes
41         meaning of symbols:
42
43                 chuuka-daijiten                 \e$BCf\e$(Ga^DKGsL(\e(B
44                 doubun-tsuukou                  \e$BF1J8DL9M\e(B
45                 gyokuhen                        \e$B6LJS\e(B
46                 henkai                          \e$BJS3$\e(B
47                 inkai                           \e$B1$2q\e(B
48                 inkaiho                         \e$B1$2qJd\e(B
49                 jii                             \e$B;zWC\e(B
50                 jiiho                           \e$B;zWCJd\e(B
51                 jiyun (shuuin)                  \e$B=81$\e(B
52                 kaihen                          \e$B3$JS\e(B
53                 kangxi                          \e$B9/_f;zE5\e(B
54                 kouin                           \e$B9-1$\e(B
55                 morohashi-daikanwa              \e$BBg4AOB<-E5\e(B
56                 ruishuu-meigishou               \e$BN`f\L>5A>6\e(B
57                 seiin                           \e$B@51$\e(B
58                 seiji-tsuu                      \e$B@5;z\e$(CwW\e(B
59                 setsumon-tsuukun-teisei         \e$B@bJ8DL71Djfa\e(B
60                 shuowen                         \e$B@bJ8\e(B
61                 sougen-irai-zokujifu            \e$BAW850JMhB/;zIh\e(B
62                 yuquan                          \e$B6L@t\e(B
63                 senhen                          \e$B@n\e$(Gn2\e(B
64
65 ** Properties
66
67 ->lowercase     indicates lowercase
68 ->uppercase     indicates uppercase
69 ->titlecase     indicates titlecase (uppercase + lowercase)
70
71 ->fullwidth     indicates ``fullwidth'' characters
72 <-fullwidth     a ``fullwidth'' character of
73
74 ->same-ideograph        indicates same characters (\e$BF1;z\e(B)
75 ->original-ideograph    indicates ``original'' characters (\e$BK\;z\e(B)
76 <-original-ideograph    an ``original'' character of
77 ->ancient-ideograph     indicates ``ancient'' characters (\e$B8E;z\e(B)
78 <-ancient-ideograph     an ``ancient'' character of
79 ->vulgar-ideograph      indicates ``vulgar'' characters (\e$BB/;z\e(B)
80 <-vulgar-ideograph      a ``vulgar'' character of
81 ->wrong-ideograph       indicates ``wrong'' characters (\e$Bkw;z\e(B)
82 <-wrong-ideograph       a ``wrong'' character of
83 ->simplified-ideograph  indicates simplified characters
84 <-simplified-ideograph  a simplified character of
85 ->ideographic-variants  indicates variants (\e$B0[BN;z\e(B)
86 ->synonyms              indicates synonyms
87
88 ->radical               indicates corresponding radical forms
89 <-radical               a radical form of
90
91 ->bopomofo              indicates bopomofo characters
92 <-bopomofo              a bopomofo character of
93
94 ->ideographic           indicates corresponding characters used with
95                         Ideographic script
96 <-ideographic           an Ideographic symbol of